Wang Han
zjuwh
ยท
AI & ML interests
LLM Post-Training
Recent Activity
upvoted
a
paper
1 day ago
V-Zero: Self-Improving Multimodal Reasoning with Zero Annotation
liked
a dataset
about 2 months ago
zjuwh/self_train_set
updated
a dataset
2 months ago
zjuwh/self_train_set