-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 108 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 109 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 121 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 114
Michał Wiliński
MWilinski
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated
a model about 16 hours ago
MWilinski/dro-v-qwen3-1.7b-paperlike published
a model about 19 hours ago
MWilinski/dro-v-qwen3-1.7b-paperlike updated
a dataset 2 days ago
MWilinski/hh-rlhf-irl