SenseVoiceSmall 4bit MLX

This repository is a local 4-bit MLX quantization of mlx-community/SenseVoiceSmall for VTranslator.

Quantization:

  • bits: 4
  • group_size: 64
  • target repo: vanch007/SenseVoiceSmall-4bit

The app expects the standard config.json, model*.safetensors, am.mvn, and tokenizer model files.

Downloads last month
-
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vanch007/SenseVoiceSmall-4bit

Finetuned
(1)
this model