arxiv:2602.09443
wang
astrid01052
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics upvoted a paper 21 days ago
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon WorkflowsOrganizations
None yet