AI & ML interests
None yet
Organizations
None yet
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 9
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 6
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 51.8k • 9
ShenaoZhang/0.0_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated
• 2k • 7
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated
• 46.8k • 7
ShenaoZhang/0.0_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 44.8k • 11
ShenaoZhang/0.01_4iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated
• 34.6k • 9
ShenaoZhang/0.001_2iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 65.1k • 18
ShenaoZhang/0.01_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 67.1k • 36
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 67.1k • 14
ShenaoZhang/0.0001_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 62k • 5
ShenaoZhang/0.001_6iters_bs256_nodpo_full6w_dataset
Viewer
• Updated
• 2k • 7
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 62k • 17
ShenaoZhang/0.001_3iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 46k • 9
ShenaoZhang/0.001_5iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 70k • 7
ShenaoZhang/0.0_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 24k • 6
ShenaoZhang/0.1_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 48k • 6
ShenaoZhang/0.01_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated
• 48k • 4
ShenaoZhang/0.001_4iters_bs128_nodpo_only4w_dataset
Viewer
• Updated
• 48k • 5
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_zephyr_dataset
Viewer
• Updated
• 48k • 7
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_userresponse_dataset
Viewer
• Updated
• 48k • 6
ShenaoZhang/0.001_4iters_bs128_nodpo_only4w_userresponse_dataset
Viewer
• Updated
• 48k • 8
ShenaoZhang/0.0_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated
• 53.8k • 9
ShenaoZhang/0.001_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated
• 69.1k • 8
ShenaoZhang/0.01_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated
• 53.8k • 7
ShenaoZhang/0.001_ablation_5iters_bs128_dataset
Viewer
• Updated
• 12.2k • 7
ShenaoZhang/0.0005_idpo_same_nodpo_replace_dataset
Viewer
• Updated
• 67.1k • 6
ShenaoZhang/0.001_idpo_noreplacerej_dataset
Viewer
• Updated
• 44.8k • 7
ShenaoZhang/0.001_idpo_noreplacerej_ref_response
Viewer
• Updated
• 44.8k • 8
ShenaoZhang/0.001_idpo_4iters_dataset
Viewer
• Updated
• 51.8k • 9