Reading List Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 150
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 150
math-reasoning RickyDeSkywalker/OpenBootstrappedTheorem Viewer • Updated Jul 12, 2024 • 107k • 10 • 12 deepseek-ai/DeepSeek-Prover-V2-7B 7B • Updated Apr 30, 2025 • 19.6k • 145 internlm/Lean-Workbook Viewer • Updated Oct 9, 2024 • 25.2k • 768 • 56 internlm/Lean-Github Viewer • Updated Jul 25, 2024 • 219k • 301 • 38
mail AntiSpamInstitute/spam-detector-bert-MoE-v2.2 4.39M • Updated Dec 23, 2024 • 5.42k • 4 Dipe00/Urgency-tone-topic-on-enron_labeled_emails_with_subjects-llama2-7b_finetuning Viewer • Updated May 25, 2024 • 1.23k • 28
Dipe00/Urgency-tone-topic-on-enron_labeled_emails_with_subjects-llama2-7b_finetuning Viewer • Updated May 25, 2024 • 1.23k • 28
ui-agents jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8, 2025 • 55 • 68 foduucom/web-form-ui-field-detection Object Detection • Updated Sep 8, 2023 • 57 google/paligemma-3b-pt-224 Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 554k • 455
Reading List Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 150
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 150
mail AntiSpamInstitute/spam-detector-bert-MoE-v2.2 4.39M • Updated Dec 23, 2024 • 5.42k • 4 Dipe00/Urgency-tone-topic-on-enron_labeled_emails_with_subjects-llama2-7b_finetuning Viewer • Updated May 25, 2024 • 1.23k • 28
Dipe00/Urgency-tone-topic-on-enron_labeled_emails_with_subjects-llama2-7b_finetuning Viewer • Updated May 25, 2024 • 1.23k • 28
math-reasoning RickyDeSkywalker/OpenBootstrappedTheorem Viewer • Updated Jul 12, 2024 • 107k • 10 • 12 deepseek-ai/DeepSeek-Prover-V2-7B 7B • Updated Apr 30, 2025 • 19.6k • 145 internlm/Lean-Workbook Viewer • Updated Oct 9, 2024 • 25.2k • 768 • 56 internlm/Lean-Github Viewer • Updated Jul 25, 2024 • 219k • 301 • 38
ui-agents jadechoghari/Ferret-UI-Llama8b Image-Text-to-Text • 8B • Updated Jan 8, 2025 • 55 • 68 foduucom/web-form-ui-field-detection Object Detection • Updated Sep 8, 2023 • 57 google/paligemma-3b-pt-224 Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 554k • 455