Zhaoyang Chu

chuzy

3 17 2

https://zhaoyang-chu.github.io/

AI & ML interests

Multimodal Coding Agent

Recent Activity

liked a dataset 23 days ago

EuniAI/TerminalWorld

updated a dataset 24 days ago

EuniAI/TerminalWorld

authored a paper about 2 months ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

View all activity

Organizations

liked a dataset 23 days ago

EuniAI/TerminalWorld

Viewer • Updated 24 days ago • 1.75k • 6.08k • 6

updated a dataset 24 days ago

EuniAI/TerminalWorld

Viewer • Updated 24 days ago • 1.75k • 6.08k • 6

authored 6 papers about 2 months ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10, 2025 • 54

TESTEVAL: Benchmarking Large Language Models for Test Case Generation

Paper • 2406.04531 • Published Jun 6, 2024 • 1

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19

ContextBench: A Benchmark for Context Retrieval in Coding Agents

Paper • 2602.05892 • Published Feb 5 • 5

ExecVerify: White-Box RL with Verifiable Stepwise Rewards for Code Execution Reasoning

Paper • 2603.11226 • Published Mar 11

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Paper • 2605.22535 • Published May 21 • 11

commented a paper about 2 months ago

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Paper • 2605.22535 • Published May 21 • 11 •

upvoted a paper about 2 months ago

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Paper • 2605.22535 • Published May 21 • 11

published a dataset 2 months ago

EuniAI/TerminalWorld

Viewer • Updated 24 days ago • 1.75k • 6.08k • 6

updated a Space 2 months ago

README

🚀

published a Space 2 months ago

README

🚀

upvoted a paper 5 months ago

ContextBench: A Benchmark for Context Retrieval in Coding Agents

Paper • 2602.05892 • Published Feb 5 • 5

submitted a paper to Daily Papers 5 months ago

ContextBench: A Benchmark for Context Retrieval in Coding Agents

Paper • 2602.05892 • Published Feb 5 • 5

upvoted a paper 6 months ago

AgentOCR: Reimagining Agent History via Optical Self-Compression

Paper • 2601.04786 • Published Jan 8 • 31

upvoted an article 7 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

upvoted a paper 10 months ago

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19

commented a paper 10 months ago

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19 •

upvoted a paper 10 months ago

Reinforced Visual Perception with Tools

Paper • 2509.01656 • Published Sep 1, 2025 • 32

Zhaoyang Chu

AI & ML interests

Recent Activity

Organizations

chuzy's activity

README

README

We Got Claude to Fine-Tune an Open Source LLM