On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published Feb 3 • 59
cyankiwi/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit Any-to-Any • 10B • Updated Sep 28, 2025 • 48.9k • 48
Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT Viewer • Updated Feb 19, 2025 • 110k • 976 • 224
BELLE-2/Belle-whisper-large-v3-zh Automatic Speech Recognition • Updated Dec 16, 2024 • 2.13k • 125
google/siglip-so400m-patch14-384 Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 2.1M • 671
Runtime error Agents 603 GAIA Leaderboard 🦾 603 Submit your model answers to GAIA benchmark and view leaderboard