Essential models and datasets used to build the IPDA debate canonical model. Includes ORPO, GRPO iterations, SFT distillation, and golden samples.
AI & ML interests
AI Pluralism; Agentic Communiies; language games
Recent Activity
View all activity
models 71
debaterhub/debate-grpo-iter3v2-groupA-epoch2
Updated
debaterhub/debate-grpo-iter3v2-groupA-epoch1
Updated
debaterhub/debate-grpo-iter3-groupD-final
Updated
debaterhub/debate-grpo-iter3-groupD-epoch4
Updated
debaterhub/debate-grpo-iter3-groupD-epoch3
Updated
debaterhub/debate-grpo-iter3-groupD-epoch2
Updated
debaterhub/debate-grpo-iter3-groupD-epoch1
Updated
debaterhub/debate-grpo-iter2-canonical
31B • Updated
debaterhub/debate-grpo-iter2-groupD-lora
Updated
debaterhub/debate-grpo-iter2-groupD-epoch1
Updated
datasets 34
debaterhub/debate-outputs
Preview • Updated • 644
debaterhub/debate-iter2-group-c-grpo
Viewer • Updated • 4.67k • 10
debaterhub/debate-opus-distilled-group-a
Viewer • Updated • 489 • 9
debaterhub/debate-grpo-group-a
Viewer • Updated • 3.03k • 14
debaterhub/debate-iter2-rescored
Viewer • Updated • 31.9k • 19
debaterhub/debate-iter2-synthesis-calls
Viewer • Updated • 174 • 12
debaterhub/debate-iter2-judge-calls
Viewer • Updated • 57 • 13
debaterhub/debate-data-iter2
Viewer • Updated • 7.86k • 10
debaterhub/ipda-iter2-synthesis-calls
Viewer • Updated • 174 • 11
debaterhub/ipda-iter2-judge-calls
Viewer • Updated • 57 • 7