Xianfeng Tang's picture

Xianfeng Tang

xianft

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

upvoted a paper 7 months ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

new activity 8 months ago

bigboss24/TRAJECT-Bench:Add paper link

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 6 days ago • 47

upvoted a paper 7 months ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published Oct 29, 2025 • 42

New activity in bigboss24/TRAJECT-Bench 8 months ago

Add paper link

#2 opened 8 months ago by

authored 2 papers about 1 year ago

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Paper • 2504.00869 • Published Apr 1, 2025 • 10

ViLBench: A Suite for Vision-Language Process Reward Modeling

Paper • 2503.20271 • Published Mar 26, 2025 • 7

authored a paper over 1 year ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12, 2025 • 20