AI & ML interests
None yet
Organizations
None yet
YYYYYYibo/alfworld-success-trajs
Viewer
• Updated • 3.46k • 11
YYYYYYibo/openr1-math-220k-hard-qwen2-5-7b-instruct-1k-with-successful-traj
Viewer
• Updated • 1.23k • 149
YYYYYYibo/openr1-math-220k-hard-qwen2-5-7b-instruct-1k
Viewer
• Updated • 1.23k • 12
YYYYYYibo/OpenR1_1000_qwen_7b_gen
Viewer
• Updated • 1k • 7
YYYYYYibo/openr1-math-220k-length-filtered-4k
Viewer
• Updated • 26k • 76
YYYYYYibo/openr1_math_filtered_qwen3_4b
Viewer
• Updated • 38.7k • 49
YYYYYYibo/openr1_math_train_with_qwen_evals
Viewer
• Updated • 65.1k • 74
YYYYYYibo/ultrafeedback_binarized_with_response_full_part2_mini
Viewer
• Updated • 2k • 4
YYYYYYibo/ultrafeedback_binarized_with_response_full_part2
Viewer
• Updated • 21.1k • 14
YYYYYYibo/ultrafeedback_binarized_with_response_full_part1_mini
Viewer
• Updated • 2k • 2
YYYYYYibo/ultrafeedback_binarized_with_response_full_part1
Viewer
• Updated • 20k • 11
YYYYYYibo/ultrafeedback_binarized_with_response_full_part0_mini
Viewer
• Updated • 2k • 6
YYYYYYibo/ultrafeedback_binarized_with_response_full_part0
Viewer
• Updated • 20k • 27
YYYYYYibo/ultrafeedback_binarized_gshf_lora_train_part_3
Viewer
• Updated • 21.1k • 85
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_2_part_3
Viewer
• Updated • 21.1k • 22
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_1_part_3
Viewer
• Updated • 21.1k • 12
YYYYYYibo/ultrafeedback_binarized_gshf_lora_train_part_2
Viewer
• Updated • 20k • 6
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_2_part_2
Viewer
• Updated • 20k • 61
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_1_part_2
Viewer
• Updated • 20k • 11
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_train_part_3
Viewer
• Updated • 19.8k • 80
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_part_3
Viewer
• Updated • 19.8k • 4
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_2_part_3
Viewer
• Updated • 19.8k • 36
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_1_part_3
Viewer
• Updated • 19.8k • 5
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_vllm_part_3
Viewer
• Updated • 19.8k • 47
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_train_part_2
Viewer
• Updated • 19.1k • 9
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_part_2
Viewer
• Updated • 19.1k • 9
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_2_part_2
Viewer
• Updated • 19.1k • 160
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_minpi_1_part_2
Viewer
• Updated • 19.1k • 12
YYYYYYibo/ultrafeedback_binarized_imp_sam_1_vllm_part_2
Viewer
• Updated • 19.1k • 12
YYYYYYibo/ultrafeedback_binarized_imp_sam_train_part_3
Viewer
• Updated • 19.6k • 13