models 133
Muadil/Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme
Text Generation
• 1B • Updated • 3
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_2ep_4bit
Text Generation
• 1B • Updated • 2
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Text Generation
• 1B • Updated • 2
Muadil/Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep_4bit
Text Generation
• 1B • Updated • 1
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
Text Generation
• 1B • Updated • 1
datasets 11
Muadil/dpo_formatted_openai_summary
Viewer
• Updated • 183k • 11
Muadil/dpo_dataset_train_openai_summary
Viewer
• Updated • 176k • 8
Muadil/ppo_datasets_summary
Viewer
• Updated • 176k • 31
Muadil/kto_labeled_openai_summary
Viewer
• Updated • 365k • 4
• 1
Muadil/cleaned_openai_summary_comparisons
Viewer
• Updated • 183k • 6
Muadil/all_cleaned_openai_summarize_comparisons_train_val
Viewer
• Updated • 176k • 5
Muadil/all_unique_cleaned_openai_summarize_comparisons_test
Viewer
• Updated • 6.24k • 4
Muadil/old_all_cleaned_openai_summarize_comparisons_test
Viewer
• Updated • 6.24k • 6
Muadil/old_all_cleaned_openai_summarize_comparisons_train_val
Viewer
• Updated • 176k • 8
Muadil/old_all_unique_cleaned_openai_summarize_comparisons
Viewer
• Updated • 21k • 4