Binfeng Xu

billxbf

AI & ML interests

evolving back to apes

Recent Activity

upvoted a paper 28 days ago

Polar: Agentic RL on Any Harness at Scale

upvoted a paper about 2 months ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

updated a model about 2 months ago

billxbf/qwen3.5-4b-pi-polar

View all activity

Organizations

None yet

billxbf 's models 21

billxbf/qwen3.5-4b-pi-polar

4B • Updated May 18 • 2

billxbf/qwen3.5-4b-opencode-polar

4B • Updated May 16 • 2

billxbf/qwen3.5-4b-qwencode-polar

4B • Updated May 14 • 3

billxbf/qwen3.5-4b-claudecode-polar

4B • Updated May 13 • 16

billxbf/qwen3.5-4b-codex-polar-step72

Reinforcement Learning • 5B • Updated May 2 • 11

billxbf/zephyr-7b-dpo-iter1

Text Generation • 274k • Updated Nov 10, 2025 • 101

billxbf/zephyr-7b-dpo-iter3

Text Generation • 266k • Updated Nov 8, 2025 • 94

billxbf/zephyr-7b-dpo-iter2

Text Generation • 266k • Updated Nov 8, 2025 • 10

billxbf/Nano-Raccoon-Preview-1104

425k • Updated Nov 4, 2025 • 1

billxbf/zephyr-7b-sft-iter3

Text Generation • 266k • Updated Nov 4, 2025 • 5

billxbf/zephyr-7b-sft-iter2

Text Generation • 266k • Updated Nov 4, 2025 • 5

billxbf/zephyr-7b-sft-iter1

Text Generation • 266k • Updated Nov 4, 2025 • 4

billxbf/nemo-sft-orpo

12B • Updated Feb 9, 2025 • 1

billxbf/chai-nemo13b-sft-orpo-merge_v2

Text Generation • 12B • Updated Feb 9, 2025 • 2

billxbf/chai-nemo-sft-orpo-merge

Text Generation • 12B • Updated Feb 9, 2025 • 2

billxbf/wsdm-qwen14b_dare_dslerp-gptq-q4

Text Classification • 14B • Updated Feb 5, 2025 • 3

billxbf/phi4_4k_dare

Text Classification • 14B • Updated Feb 3, 2025

billxbf/wsdm-qwen14b_dare_dslerp

Text Classification • 14B • Updated Jan 30, 2025 • 1

billxbf/bulla_7b

7B • Updated Sep 16, 2024 • 2

billxbf/mmos-deepseek-math-7b

Text Generation • Updated Apr 23, 2024 • 8

billxbf/specialized-rewoo-planner-7b

Updated May 16, 2023