VladShash/deepseek-math-7B-lean-prover-grpo-olmo-weighed Text Generation • 7B • Updated 4 days ago • 3.66k • 1
VladShash/deepseek-math-7b-lean-prover-dpo-olmo-3 Text Generation • 7B • Updated 23 days ago • 3.2k • 4