Wentaoshi's picture

Wentaoshi

swt

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

submitted a paper 1 day ago

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

updated a dataset about 2 months ago

swt/dits_pipeline

View all activity

Organizations

None yet

upvoted a paper 1 day ago

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Paper • 2604.18240 • Published 4 days ago • 14

upvoted a paper 11 months ago

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 34

upvoted 2 papers about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19, 2025 • 30