ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a paper 3 days ago

MiniMax Sparse Attention

liked a dataset 5 days ago

nvidia/Nemotron-SFT-CUDA-v1

upvoted a collection 11 days ago

Nemotron-Post-Training-v3

View all activity

Organizations

upvoted a paper 3 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 5 days ago • 132

liked a dataset 5 days ago

nvidia/Nemotron-SFT-CUDA-v1

Viewer • Updated 12 days ago • 2.28k • 259 • 5

upvoted a collection 11 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 4 days ago • 157

upvoted an article 12 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 163

liked a Space 19 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

185

Building and scaling RL environments for LLM training

liked a dataset 29 days ago

open-thoughts/AgentTrove

Viewer • Updated May 7 • 1.7M • 4.61k • 185

liked a model 29 days ago

Zyphra/ZAYA1-VL-8B

Image-Text-to-Text • 10B • Updated 29 days ago • 1.39k • 40

liked 4 models about 1 month ago

updated a model about 1 month ago

BAAI/OpenSeek-Mid-v1

Text Generation • 11B • Updated May 13 • 20 • 12

liked a model about 1 month ago

BAAI/OpenSeek-Mid-v1

Text Generation • 11B • Updated May 13 • 20 • 12

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 8 days ago • 2.11M • • 1.5k

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 8 days ago • 2.93M • • 4.87k

upvoted a collection 2 months ago

Qwen3.6

Collection

4 items • Updated Apr 22 • 407

liked a model 2 months ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated Mar 2 • 5.78M • • 1.57k

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

liked a model 2 months ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated Apr 20 • 1.88M • • 1.22k