zhangwenbin

ExceedZhang

3 349 416

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Qwen-Music Technical Report

liked a model 1 day ago

unsloth/Qwen3.6-27B-NVFP4

upvoted an article 3 days ago

Newer Models, Same Advantage

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Qwen-Music Technical Report

Paper • 2607.11699 • Published 9 days ago • 21

liked a model 1 day ago

unsloth/Qwen3.6-27B-NVFP4

Image-Text-to-Text • 21B • Updated 9 days ago • 2.14M • 238

upvoted 3 articles 3 days ago

Article

Newer Models, Same Advantage

Dharma-AI

•

5 days ago

• 34

Article

What building Shippy taught us about building agents

allenai

•

6 days ago

• 14

Article

Fine-tune video and image models at scale with NVIDIA NeMo Automodel and 🤗 Diffusers

nvidia

•

4 days ago

• 77

upvoted a paper 3 days ago

Harness Handbook: Making Evolving Agent Harnesses Readable,Navigable, and Editable

Paper • 2607.13285 • Published 8 days ago • 207

upvoted 2 articles 6 days ago

Article

Model Routing Is Simple. Until It Isn’t.

ibm-research

•

6 days ago

• 45

Article

Introducing Real World VoiceEQ: Measuring the human quality of voice AI

dayllon, aliceebaird, jeffbrooks, francamps, jpc, tlebryk02, jens-hume-ai, itsolyaossi, sharath25, hoon-hume, tig88, rashisht, tzirakis

•

7 days ago

• 15

liked a model 7 days ago

nvidia/Cosmos3-Super

65B • Updated 12 days ago • 79.2k • 209

liked a model 10 days ago

google/gemma-4-12B

Any-to-Any • 12B • Updated 6 days ago • 294k • 665

upvoted 7 papers 10 days ago

DSpark: Confidence-Scheduled Speculative Decoding with Semi-Autoregressive Generation

Paper • 2607.05147 • Published 16 days ago • 36

OmniOpt: Taxonomy, Geometry, and Benchmarking of Modern Optimizers

Paper • 2607.04033 • Published 18 days ago • 76

Vidu S1: A Real-Time Interactive Video Generation Model

Paper • 2607.03118 • Published 19 days ago • 138

The Mirage of Optimizing Training Policies: Monotonic Inference Policies as the Real Objective for LLM Reinforcement Learning

Paper • 2606.29526 • Published 24 days ago • 168

liked a model 12 days ago

InternScience/Agents-A1

Text Generation • 35B • Updated 7 days ago • 36.6k • 592

upvoted a paper 14 days ago

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Paper • 2606.30616 • Published 23 days ago • 101

upvoted a paper 16 days ago

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper • 2606.28733 • Published 25 days ago • 148

zhangwenbin

AI & ML interests

Recent Activity

Organizations

ExceedZhang's activity

Newer Models, Same Advantage

What building Shippy taught us about building agents

Fine-tune video and image models at scale with NVIDIA NeMo Automodel and 🤗 Diffusers

Model Routing Is Simple. Until It Isn’t.

Introducing Real World VoiceEQ: Measuring the human quality of voice AI