The *RLMT* collection. Coming soon!
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Organizations
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 416 • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 26 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 33 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 689 •
RLMT Experiments
The *RLMT* collection. Coming soon!
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 416 • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 26 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 33 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 689 •
models 306
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B-Instruct
8B • Updated
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B-Instruct
8B • Updated
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B
8B • Updated • 1
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B
8B • Updated • 2
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B-Instruct
8B • Updated • 2
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B
8B • Updated • 4
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B
8B • Updated • 3
princeton-nlp/zero__grpo__nothink__Qwen2.5-7B
8B • Updated • 3
princeton-nlp/zero__grpo__nothink__Llama-3.1-8B
8B • Updated
datasets 47
princeton-nlp/rl_tulu3_wildchat-if_prompts
Viewer • Updated • 7.79k • 22 • 5
princeton-nlp/gemini_2.5_flash_0417_sft-data
Viewer • Updated • 6k • 19 • 1
princeton-nlp/prolong-data-512K
Updated • 10.4k • 11
princeton-nlp/SWE-bench_Lite
Viewer • Updated • 323 • 66.2k • 55
princeton-nlp/SWE-bench
Viewer • Updated • 21.5k • 18.3k • 135
princeton-nlp/SWE-bench_Verified
Viewer • Updated • 500 • 709k • 312
princeton-nlp/TextbooksBySubject
Viewer • Updated • 129 • 33 • 1
princeton-nlp/TextbookChapters
Viewer • Updated • 77.9k • 66 • 12
princeton-nlp/SWE-bench_Multimodal
Viewer • Updated • 612 • 1.2k • 21
princeton-nlp/fineweb_edu-swahili-translated
Viewer • Updated • 137k • 16 • 2