-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 33 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
Keylhan
keypa
AI & ML interests
None yet
Recent Activity
upvoted an article about 14 hours ago
Uncensor any LLM with abliteration liked a model about 14 hours ago
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled liked a model about 14 hours ago
DavidAU/Qwen3.5-9B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING-MAX-NEOCODE-Imatrix-GGUF