Unchun Yang

ucyang

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

mistralai/Leanstral-2603

upvoted a paper 1 day ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

liked a model 1 day ago

zai-org/GLM-4.7-FP8

View all activity

Organizations

liked a model 1 day ago

mistralai/Leanstral-2603

Updated 7 days ago • 192 • 128

upvoted a paper 1 day ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 209

liked 2 models 1 day ago

zai-org/GLM-4.7-FP8

Text Generation • Updated Dec 23, 2025 • 66.6k • • 119

LLM360/K2-Think-V2

Text Generation • 73B • Updated 21 days ago • 2.89k • 28

liked a dataset 1 day ago

HuggingFaceH4/Multilingual-Thinking

Viewer • Updated Aug 7, 2025 • 1k • 14.6k • 113

liked a model 3 days ago

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated about 13 hours ago • 5.35k • 220

upvoted a paper 3 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 4 days ago • 50

upvoted a collection 3 days ago

Nemotron-Cascade 2

Collection

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 30

upvoted a collection 4 days ago

MiniMax-M2.1

Collection

3 items • Updated Feb 13 • 13

liked a dataset 6 days ago

SWE-bench/SWE-smith

Viewer • Updated Dec 14, 2025 • 59.1k • 23.8k • 48

liked a model 6 days ago

Hcompany/Holotron-12B

Image-Text-to-Text • 13B • Updated 5 days ago • 460 • 29

liked a dataset 6 days ago

trl-lib/Capybara

Viewer • Updated Sep 19, 2024 • 16k • 4.54k • 18

upvoted an article 7 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

123

liked 2 models 7 days ago

mistralai/Mistral-Small-4-119B-2603-eagle

Updated 6 days ago • 242 • 34

mistralai/Mistral-Small-4-119B-2603

119B • Updated about 1 hour ago • 10.6k • 307

upvoted a collection 7 days ago

Mistral Small 4

Collection

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 7 days ago • 59

liked a dataset 7 days ago

allenai/Dolci-Instruct-SFT-Tool-Use

Viewer • Updated Jan 5 • 228k • 362 • 15

liked a model 7 days ago

inclusionAI/LLaDA2.1-flash

Text Generation • 103B • Updated 7 days ago • 1.43k • 77

liked a dataset 7 days ago

allenai/Dolci-Instruct-DPO

Viewer • Updated Feb 20 • 260k • 1.55k • 9

liked a model 7 days ago

allenai/Olmo-Hybrid-Instruct-DPO-7B

Text Generation • 7B • Updated 18 days ago • 4.97k • 17

Unchun Yang

AI & ML interests

Recent Activity

Organizations

ucyang's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular