A collection of preference datasets used for training and evaluation of code reward models.
Themis
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
A collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models.
-
project-themis/Themis-RM-0.6B-PMP
Text Classification β’ 0.6B β’ Updated β’ 1 -
project-themis/Themis-RM-1.7B-PMP
Text Classification β’ 2B β’ Updated -
project-themis/Themis-RM-4B-PMP
Text Classification β’ 4B β’ Updated -
project-themis/Themis-RM-8B-PMP
Text Classification β’ 8B β’ Updated β’ 8
A collection of strong code reward models trained on a diverse collection of code preferences.
-
project-themis/Themis-RM-0.6B
Text Classification β’ 0.6B β’ Updated β’ 101 -
project-themis/Themis-RM-1.7B
Text Classification β’ 2B β’ Updated β’ 12 -
project-themis/Themis-RM-4B
Text Classification β’ 4B β’ Updated β’ 11 -
project-themis/Themis-RM-8B
Text Classification β’ 8B β’ Updated β’ 95
A collection of preference datasets used for training and evaluation of code reward models.
A collection of strong code reward models trained on a diverse collection of code preferences.
-
project-themis/Themis-RM-0.6B
Text Classification β’ 0.6B β’ Updated β’ 101 -
project-themis/Themis-RM-1.7B
Text Classification β’ 2B β’ Updated β’ 12 -
project-themis/Themis-RM-4B
Text Classification β’ 4B β’ Updated β’ 11 -
project-themis/Themis-RM-8B
Text Classification β’ 8B β’ Updated β’ 95
A collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models.
-
project-themis/Themis-RM-0.6B-PMP
Text Classification β’ 0.6B β’ Updated β’ 1 -
project-themis/Themis-RM-1.7B-PMP
Text Classification β’ 2B β’ Updated -
project-themis/Themis-RM-4B-PMP
Text Classification β’ 4B β’ Updated -
project-themis/Themis-RM-8B-PMP
Text Classification β’ 8B β’ Updated β’ 8