camgeodesic/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO Text Generation • 7B • Updated Dec 24, 2025 • 1.35k • 1
camgeodesic/sfm-sft_dolci_mcqa_instruct_filtered-DPO Text Generation • 7B • Updated Dec 24, 2025 • 872 • 1
camgeodesic/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO Text Generation • 7B • Updated Dec 24, 2025 • 842 • 1
camgeodesic/sfm-sft_dolci_mcqa_instruct_filtered_synth_align_mid-DPO Text Generation • 7B • Updated Dec 23, 2025 • 480
camgeodesic/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_misalign_mid-DPO Text Generation • 7B • Updated Dec 23, 2025 • 448