Open to Collab

3 5

Boxi Yu

Bertsekas

https://boxiyu.github.io/

AI & ML interests

Coding Agent, Automated Operator

Recent Activity

upvoted a paper 28 days ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

liked a model about 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

updated a dataset 10 months ago

Bertsekas/SWE-Bench_Lite_UTBoost

View all activity

Organizations

upvoted a paper 28 days ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Paper • 2604.04247 • Published Apr 5 • 31

liked a model about 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated about 1 month ago • 278k • 2.82k

updated 2 datasets 10 months ago

Bertsekas/SWE-Bench_Lite_UTBoost

Viewer • Updated Jul 17, 2025 • 300 • 20 • 1

Bertsekas/SWE-Bench_Verified_UTBoost

Viewer • Updated Jul 17, 2025 • 500 • 37 • 1

liked 2 datasets 10 months ago

Bertsekas/SWE-Bench_Lite_UTBoost

Viewer • Updated Jul 17, 2025 • 300 • 20 • 1

Bertsekas/SWE-Bench_Verified_UTBoost

Viewer • Updated Jul 17, 2025 • 500 • 37 • 1

authored 2 papers 10 months ago

How Should I Build A Benchmark? Revisiting Code-Related Benchmarks For LLMs

Paper • 2501.10711 • Published Jan 18, 2025 • 1

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Paper • 2506.09289 • Published Jun 10, 2025 • 2

upvoted a paper 10 months ago

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Paper • 2506.09289 • Published Jun 10, 2025 • 2

published 2 datasets 10 months ago

Bertsekas/SWE-Bench_Verified_UTBoost

Viewer • Updated Jul 17, 2025 • 500 • 37 • 1

Bertsekas/SWE-Bench_Lite_UTBoost

Viewer • Updated Jul 17, 2025 • 300 • 20 • 1

liked a dataset 10 months ago

princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3, 2025 • 323 • 91.7k • 57

liked a Space about 1 year ago

Describe Anything

⚡

343

Describe any selected part of an image

upvoted a collection about 1 year ago

Describe Anything

Collection

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 16 days ago • 62

Boxi Yu

AI & ML interests

Recent Activity

Organizations

Bertsekas's activity

Describe Anything