Binfeng Xu

billxbf

AI & ML interests

evolving back to apes

Recent Activity

upvoted a paper 27 days ago

Polar: Agentic RL on Any Harness at Scale

upvoted a paper about 2 months ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

updated a model about 2 months ago

billxbf/qwen3.5-4b-pi-polar

View all activity

Organizations

None yet

Collections 2

models 21

billxbf/qwen3.5-4b-pi-polar

4B • Updated May 18 • 2

billxbf/qwen3.5-4b-opencode-polar

4B • Updated May 16 • 2

billxbf/qwen3.5-4b-qwencode-polar

4B • Updated May 14 • 3

billxbf/qwen3.5-4b-claudecode-polar

4B • Updated May 13 • 16

billxbf/qwen3.5-4b-codex-polar-step72

Reinforcement Learning • 5B • Updated May 2 • 11

billxbf/zephyr-7b-dpo-iter1

Text Generation • 274k • Updated Nov 10, 2025 • 101

billxbf/zephyr-7b-dpo-iter3

Text Generation • 266k • Updated Nov 8, 2025 • 94

billxbf/zephyr-7b-dpo-iter2

Text Generation • 266k • Updated Nov 8, 2025 • 10

billxbf/Nano-Raccoon-Preview-1104

425k • Updated Nov 4, 2025 • 1

billxbf/zephyr-7b-sft-iter3

Text Generation • 266k • Updated Nov 4, 2025 • 5

datasets 20

billxbf/math_pile_v3

Viewer • Updated Dec 23, 2025 • 1.52M • 30

billxbf/ultrafeedback-dpo-iter3

Viewer • Updated Nov 12, 2025 • 20.4k • 12

billxbf/ultrafeedback-dpo-iter1

Viewer • Updated Nov 10, 2025 • 20.4k • 7

billxbf/ultrafeedback-dpo-iter2

Viewer • Updated Nov 10, 2025 • 20.4k • 8

billxbf/ultrafeedback-sft-iter3

Viewer • Updated Nov 4, 2025 • 20.4k • 11

billxbf/ultrafeedback-sft-iter2

Viewer • Updated Nov 4, 2025 • 20.4k • 11

billxbf/ultrafeedback-sft-iter1

Viewer • Updated Nov 3, 2025 • 20.4k • 6

billxbf/verified100-chitchat

Viewer • Updated Nov 3, 2025 • 100 • 9

billxbf/verified100-lite

Viewer • Updated Nov 1, 2025 • 100 • 14

billxbf/verified100

Viewer • Updated Oct 30, 2025 • 100 • 12

View 20 datasets