Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
AMAImedia's profile picture
OliverQinyy's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
published
a dataset
8 days ago
t2ance/CodeRM-GRPO-2B-thinking-step700-test-traces
updated
a model
8 days ago
t2ance/CodeRM-SFT-Warmup-Selection-2B-Thinking-Merged
updated
a model
8 days ago
t2ance/CodeRM-SFT-Warmup-Selection-2B-Thinking-Merged-step700
View all activity
Organizations
None yet
t2ance
's models
66
Sort: Recently updated
t2ance/CodeRM-SFT-Warmup-Selection-2B-Thinking-Merged
2B
•
Updated
8 days ago
•
3.13k
t2ance/CodeRM-SFT-Warmup-Selection-2B-Thinking-LoRA
Updated
8 days ago
t2ance/CodeRM-SFT-Warmup-Selection-2B-Thinking-Merged-step700
2B
•
Updated
8 days ago
•
14
t2ance/CodeRM-GRPO-1.7B-halluc-gh200
Updated
8 days ago
t2ance/CodeRM-SFT-Warmup-Selection-2B-Merged
2B
•
Updated
8 days ago
•
11
t2ance/CodeRM-SFT-Warmup-Selection-2B-LoRA
Updated
8 days ago
t2ance/CodeRM-GRPO-4B-bs96-nrp-step110-merged
4B
•
Updated
12 days ago
•
397
t2ance/CodeRM-GRPO-4B-bs96-nrp
Updated
15 days ago
t2ance/atts-grpo-8b-warmstart155-b63r16
Updated
18 days ago
t2ance/atts-grpo-8b-sft-2gpu-bs96
Updated
19 days ago
t2ance/sft_qwen3_8b_merged
8B
•
Updated
21 days ago
•
21
t2ance/CodeRM-SFT-Haiku500-4B
4B
•
Updated
22 days ago
•
20
t2ance/CodeRM-GRPO-Selection-8B
8B
•
Updated
Apr 6
•
111
•
1
t2ance/CodeRM-Bilevel-GRPO-4B
4B
•
Updated
Apr 5
•
21
•
1
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-K8s-v2
Updated
Apr 3
t2ance/CodeRM-OnlineGRPO-Selection-4B-v13-ThinkingMasked
Updated
Apr 3
t2ance/CodeRM-OnlineGRPO-Selection-4B-v12-NoThinking
Updated
Apr 3
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v11
Updated
Apr 2
•
1
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v9
Updated
Mar 30
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v6
Updated
Mar 30
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v5
Updated
Mar 30
t2ance/mle-playbooks
Updated
Mar 29
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v4
Updated
Mar 29
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v3
Updated
Mar 29
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v2
Updated
Mar 28
t2ance/CodeRM-SFT-Warmup-Selection-4B-Merged
4B
•
Updated
Mar 28
•
299
t2ance/sft-4b-onpolicy-rejection-sampling
Updated
Mar 28
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
Mar 28
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
Mar 28
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
Mar 28
•
73
Previous
1
2
3
Next