Parakeet ja GGUF โ€” Japanese models for parakeet.cpp

GGUF-format weights for parakeet.cpp, a C++/ggml port of NVIDIA NeMo Parakeet that matches the upstream PyTorch models on CPU. This repo collects Japanese model ร— quantization as a flat set of .gguf files โ€” download just the one you need.

Models

tdt_ctc-0.6b-ja

Source: nvidia/parakeet-tdt_ctc-0.6b-ja

File Variant Size
ttdt_ctc-0.6b-ja-f16.gguf F16 1359.9 MB
ttdt_ctc-0.6b-ja-q8_0.gguf Q8_0 884.1MB
ttdt_ctc-0.6b-ja-q5_0.gguf Q5_0 693.8 MB
ttdt_ctc-0.6b-ja-q4_0.gguf Q4_0 630.3 MB
ttdt_ctc-0.6b-ja-q6_k.gguf Q6_K 762.4 MB
ttdt_ctc-0.6b-ja-q5_k.gguf Q5_K 695.1 MB
ttdt_ctc-0.6b-ja-q4_k.gguf Q4_K 631.7 MB

License

The GGUF weights are derived from the NVIDIA NeMo Parakeet checkpoints, released under the CC-BY-4.0 license. The parakeet.cpp runtime is MIT-licensed.

Downloads last month
131
GGUF
Model size
0.6B params
Architecture
parakeet
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kun432/parakeet-cpp-ja-gguf

Quantized
(12)
this model