Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
5
Binfeng Xu
billxbf
Follow
billxbf
billxbf
AI & ML interests
evolving back to apes
Recent Activity
updated
a model
8 days ago
billxbf/qwen3.5-4b-codex-polar-step72
published
a model
8 days ago
billxbf/qwen3.5-4b-codex-polar-step72
upvoted
a
paper
about 2 months ago
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents
View all activity
Organizations
billxbf
's models
17
Sort: Recently updated
billxbf/qwen3.5-4b-codex-polar-step72
Reinforcement Learning
•
5B
•
Updated
8 days ago
•
14
billxbf/zephyr-7b-dpo-iter1
Text Generation
•
274k
•
Updated
Nov 10, 2025
•
2
billxbf/zephyr-7b-dpo-iter3
Text Generation
•
266k
•
Updated
Nov 8, 2025
•
4
billxbf/zephyr-7b-dpo-iter2
Text Generation
•
266k
•
Updated
Nov 8, 2025
•
1
billxbf/Nano-Raccoon-Preview-1104
425k
•
Updated
Nov 4, 2025
•
2
billxbf/zephyr-7b-sft-iter3
Text Generation
•
266k
•
Updated
Nov 4, 2025
•
5
billxbf/zephyr-7b-sft-iter2
Text Generation
•
266k
•
Updated
Nov 4, 2025
•
2
billxbf/zephyr-7b-sft-iter1
Text Generation
•
266k
•
Updated
Nov 4, 2025
•
3
billxbf/nemo-sft-orpo
12B
•
Updated
Feb 9, 2025
•
1
billxbf/chai-nemo13b-sft-orpo-merge_v2
Text Generation
•
12B
•
Updated
Feb 9, 2025
•
1
billxbf/chai-nemo-sft-orpo-merge
Text Generation
•
12B
•
Updated
Feb 9, 2025
•
2
billxbf/wsdm-qwen14b_dare_dslerp-gptq-q4
Text Classification
•
14B
•
Updated
Feb 5, 2025
•
1
billxbf/phi4_4k_dare
Text Classification
•
14B
•
Updated
Feb 3, 2025
•
2
billxbf/wsdm-qwen14b_dare_dslerp
Text Classification
•
14B
•
Updated
Jan 30, 2025
•
1
billxbf/bulla_7b
7B
•
Updated
Sep 16, 2024
•
1
billxbf/mmos-deepseek-math-7b
Text Generation
•
Updated
Apr 23, 2024
•
5
billxbf/specialized-rewoo-planner-7b
Updated
May 16, 2023