-
lightblue/DeepSeek-R1-Distill-Qwen-1.5B-Multilingual
2B • Updated • 9 • 24 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Multilingual
8B • Updated • 17 • 22 -
lightblue/DeepSeek-R1-Distill-Qwen-14B-Multilingual
15B • Updated • 3 • 13 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese
Text Generation • 8B • Updated • 259 • • 32
AI & ML interests
None defined yet.
Multipurpose RAG models for many languages
-
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full
Text Generation • 8B • Updated • 6.75k • • 2 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top75
Text Generation • 8B • Updated • 6.76k • • 4 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half
Text Generation • 8B • Updated • 7.68k • • 16 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top25
Text Generation • 8B • Updated • 7.84k • • 3
-
lightblue/DeepSeek-R1-Distill-Qwen-1.5B-Multilingual
2B • Updated • 9 • 24 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Multilingual
8B • Updated • 17 • 22 -
lightblue/DeepSeek-R1-Distill-Qwen-14B-Multilingual
15B • Updated • 3 • 13 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese
Text Generation • 8B • Updated • 259 • • 32
The models trained under our Karasu and Qarasu project
Multipurpose RAG models for many languages
Our latest fine-tuned models
-
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full
Text Generation • 8B • Updated • 6.75k • • 2 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top75
Text Generation • 8B • Updated • 6.76k • • 4 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half
Text Generation • 8B • Updated • 7.68k • • 16 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top25
Text Generation • 8B • Updated • 7.84k • • 3