Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
14
20
Xu Zhihao
naiweizi
Follow
didiforhugface's profile picture
Jhonny999's profile picture
mamasihan's profile picture
3 followers
·
0 following
AI & ML interests
Trustworthy AI
Organizations
None yet
Collections
1
Reward Consistency Model
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
2
Reward Consistency Model
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
2
Papers
5
arxiv:
2602.03786
arxiv:
2601.10355
arxiv:
2507.11316
arxiv:
2504.15585
View 5 papers
models
12
Sort: Recently updated
naiweizi/r1-qwen-7b-sft_meta
8B
•
Updated
Nov 21, 2025
•
2
naiweizi/R1-Qwen-7B-SFT-Meta
Updated
Nov 21, 2025
naiweizi/R1-Qwen-1_5B-Cold_Start-OpenR1_Math-priority
2B
•
Updated
Jul 18, 2025
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
2
naiweizi/mistral-dpo-helpful-vanilla-1e-4
Updated
May 6, 2025
naiweizi/mistral-dpo-harmless-vanilla-2e-4
Updated
May 6, 2025
naiweizi/test
Text Generation
•
8B
•
Updated
Apr 21, 2025
•
2
naiweizi/dpo-harmless_helpful-vanilla
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-mixed
Updated
Apr 14, 2025
View 12 models
datasets
2
Sort: Recently updated
naiweizi/RC_single_objective
Preview
•
Updated
Jun 4, 2025
•
6
naiweizi/pref_dataset
Preview
•
Updated
Apr 14, 2025
•
13