ZhangJin

Benjamin0

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

upvoted a paper 2 months ago

TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation

liked a dataset 2 months ago

ShadenA/MathNet

View all activity

Organizations

None yet

upvoted a paper 26 days ago

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

Paper • 2607.02440 • Published 27 days ago • 51

upvoted a paper 2 months ago

TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation

Paper • 2605.22355 • Published May 21 • 179

liked a dataset 2 months ago

ShadenA/MathNet

Viewer • Updated Jun 16 • 55.6k • 19.4k • 90

upvoted 2 papers 4 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 125

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

liked a dataset 5 months ago

HuggingFaceFW/finephrase

Viewer • Updated Mar 31 • 1.02B • 211k • 137

upvoted 2 papers 6 months ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

upvoted an article 6 months ago

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 274

liked a Space 7 months ago

Evaluation Guidebook

Explore LLM benchmark scores over time

liked a Space 9 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

liked a dataset 9 months ago

meituan-longcat/AMO-Bench

Viewer • Updated Feb 5 • 50 • 1.79k • 36

liked a model 11 months ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated Mar 29 • 7.8k • 259

upvoted a paper 11 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 274

liked 2 datasets 12 months ago

edinburgh-dawg/mmlu-redux-2.0

Viewer • Updated Feb 25, 2025 • 5.7k • 21.6k • 38

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25, 2025 • 251k • 9.26k • 238

upvoted an article about 1 year ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 784

upvoted a paper about 1 year ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

upvoted 2 articles about 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

Article

The Common Pile v0.1

stellaathena

•

Jun 6, 2025

• 54