arxiv:2405.01573
Anmol Agarwal
anmolagarwal999
·
AI & ML interests
None yet
Organizations
models 307
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-30
0.5B • Updated
• 3
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-20
0.5B • Updated
• 4
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-10
0.5B • Updated
• 5
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_560
Text Generation • 0.5B • Updated
• 7
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_550
Text Generation • 0.5B • Updated
• 4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_540
Text Generation • 0.5B • Updated
• 4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_530
Text Generation • 0.5B • Updated
• 4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_520
Text Generation • 0.5B • Updated
• 4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510
Text Generation • 0.5B • Updated
• 4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_504
Text Generation • 0.5B • Updated
• 2
datasets 9
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer
• Updated
• 4.37k • 9
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer
• Updated
• 4.37k • 14
anmolagarwal999/qwq_rl_train_dataset_countdown_v2
Viewer
• Updated
• 4.37k • 8
anmolagarwal999/math_dataset_train_based_on_qwen_distilled_r1_32b
Viewer
• Updated
• 3.64k • 6
anmolagarwal999/math_dataset_test_based_on_gt_reasoning_trace
Viewer
• Updated
• 500 • 6
anmolagarwal999/math_dataset_train_based_on_gt_reasoning_trace
Viewer
• Updated
• 3.64k • 6
anmolagarwal999/qwq_rl_train_dataset_countdown
Viewer
• Updated
• 4.37k • 3
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer
• Updated
• 440 • 3
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer
• Updated
• 2.72k • 6