ยท
AI & ML interests
None yet
Recent Activity
reacted to AkimfromParis's post with โค๏ธ 2 days ago ๐ธ ๐๐ฅ๐๐ฃ ๐
๐๐ฅ๐๐ฃ๐๐จ๐ ๐๐๐ ๐๐๐๐๐๐ง๐๐ค๐๐ง๐ ๐2 ๐ค๐ฃ ๐๐ช๐๐๐๐ฃ๐ ๐๐๐๐ ๐ฏ๐ต // ๐ธ ใใฎใณใฐใใงใคใน็ใ ๐ข๐ฝ๐ฒ๐ป ๐๐ฎ๐ฝ๐ฎ๐ป๐ฒ๐๐ฒ ๐๐๐ ๐๐ฒ๐ฎ๐ฑ๐ฒ๐ฟ๐ฏ๐ผ๐ฎ๐ฟ๐ฑ ๐ฉ๐ฎ ใๅ
ฌ้ ๐ฏ๐ต
I am thrilled to announce the launch of version 2 of the ๐๐ฅ๐๐ฃ ๐
๐๐ฅ๐๐ฃ๐๐จ๐ ๐๐๐ ๐๐๐๐๐๐ง๐๐ค๐๐ง๐. This initiative is driven by the "Fine-tuning and Evaluation" team, led by Professor Miyao at the The University of Tokyo, under the Research and Development Center for Large Language Models (LLMC) at Japanโs National Institute of Informatics (NII).
๐๐ฉ๐ง๐๐ฉ๐๐๐๐ ๐๐ฃ๐ ๐ฉ๐๐๐๐ฃ๐๐๐๐ก ๐ช๐ฅ๐๐ง๐๐๐๐จ:
- Our new backend features eight A100 GPUs, enabling the evaluation of open-source models of more than 100B parameters.
- Submissions now require a Hugging Face Hub login to ensure accountability.
- We have added metrics for evaluation time, COโ emissions (thx to Code Carbon ๐ฑ ), alongside reasoning capabilities.
๐ฟ๐๐ฉ๐๐จ๐๐ฉ๐จ ๐๐ฃ๐ ๐๐ซ๐๐ก๐ช๐๐ฉ๐๐ค๐ฃ ๐จ๐ฉ๐๐ฃ๐๐๐ง๐๐จ:
- New datasets cover reasoning, mathematics, exams, and instruction following.
- Math evaluations now span from grade-school levels to expert-tier challenges (GSM8K, PolyMath, AIME).
- While integrating English-heavy and multilingual benchmarks (including Humanityโs Last Exam, GPQA, and BBH in both English and Japanese), we continue to prioritize unique Japanese cultural datasets.
https://huggingface.co/spaces/llm-jp/open-japanese-llm-leaderboard-v2
ใฉใใใ้กใ่ดใใพใ๏ผ๐ View all activity Organizations
djuna/Q3-IIJAN-3B-Q8_0-GGUF
4B โข Updated โข 5
djuna/DeepSeek-R1-0528-Qwen3-8B-remap
Text Generation
โข 8B โข Updated โข 6
djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-remap
Text Generation
โข 15B โข Updated โข 12
โข 2
djuna/DeepSeek-R1-Distill-Qwen-14B-abliterated-remap
Text Generation
โข 15B โข Updated โข 5
โข 1
djuna/MN-Chinofun-12B-4-4bit
Text Generation
โข 2B โข Updated โข 14
djuna/TEST3-Q2.5-Lenned-14B-Q5_K_M-GGUF
15B โข Updated โข 4
djuna/TEST3-Q2.5-Lenned-14B
Text Generation
โข 15B โข Updated โข 7
โข 1
djuna/TEST2-Q2.5-Lenned-14B-Q5_K_M-GGUF
15B โข Updated โข 2
โข 1
djuna/TEST2-Q2.5-Lenned-14B
Text Generation
โข 15B โข Updated โข 4
โข 4
djuna/TEST-Q2.5-Lenned-14B
Text Generation
โข 15B โข Updated โข 6
โข 1
Text Generation
โข 12B โข Updated โข 22
โข 4
djuna/MN-Chinofun-12B-4.1-Q6_K-GGUF
12B โข Updated โข 127
โข 1
djuna/MN-Chinofun-12B-4.1
Text Generation
โข 12B โข Updated โข 42
โข 6
djuna/MN-Chinofun-12B-4-Q6_K-GGUF
12B โข Updated โข 2
โข 1
djuna/Q2.5-Veltha-14B-0.5-AWQ-4bit
15B โข Updated โข 1
djuna/TEST-Q2.5-AA-Q8_0-GGUF
9B โข Updated โข 1
Text Generation
โข 15B โข Updated โข 32
โข 11
djuna/Q2.5-Veltha-14B-0.5
Text Generation
โข 15B โข Updated โข 131
โข 11
djuna/Q2.5-Veltha-14B-0.5-Q5_K_M-GGUF
15B โข Updated โข 15
โข 1
djuna/Q2.5-Veltha-14B-Q5_K_M-GGUF
15B โข Updated โข 1
djuna/MT-Gen3-gemma-2-9B-Flip-Q5_K_M-GGUF
9B โข Updated โข 4
djuna/G2-Nowing-9B-32K-YS
Text Generation
โข 10B โข Updated โข 4
โข 1
Text Generation
โข 10B โข Updated โข 4
โข 1
Text Generation
โข 22B โข Updated โข 5
โข 1
djuna/G2-GSHT-32K-Q6_K-GGUF
9B โข Updated โข 4
Text Generation
โข 12B โข Updated โข 48
โข 2
djuna/G2-Noranum-27B-Q3_K_S-GGUF
27B โข Updated Text Generation
โข 28B โข Updated โข 3
djuna/TEST-Ocerus-7B-Q5_K_M-GGUF
7B โข Updated โข 1
โข 1
djuna/TEST-OcerusBeam-7B-Q5_K_M-GGUF
7B โข Updated โข 4
โข 1