RLVF pipeline using parser oracles to align LMs for Icelandic and Danish. GPT-SW3 and Viking-13B trained with Delta-DPO.
Fakhar
Hodfa71
AI & ML interests
None yet
Recent Activity
updated a dataset 3 days ago
omniagentbench/OmniAgentBench updated a model 3 days ago
Hodfa71/saga-is-356m-kl-sft published a model 3 days ago
Hodfa71/saga-is-356m-kl-sftOrganizations
models 43
Hodfa71/saga-is-356m-kl-sft
0.4B • Updated • 20
Hodfa71/saga-da-6b7-delta-dpo-klsft-antihack
7B • Updated • 14
Hodfa71/saga-is-llama1b-delta-dpo-klsft-antihack
1B • Updated • 14
Hodfa71/saga-is-llama8b-delta-dpo-klsft-antihack
8B • Updated • 14
Hodfa71/saga-is-356m-delta-dpo-nosft-antihack
0.4B • Updated • 15
Hodfa71/saga-is-1b3-delta-dpo-klsft-antihack
1B • Updated • 14
Hodfa71/saga-is-6b7-delta-dpo-klsft-antihack
7B • Updated • 13
Hodfa71/gpt-sw3-6b7-da-delta-dpo
7B • Updated • 15
Hodfa71/gpt-sw3-356m-is-delta-dpo-nosft-antihack
0.4B • Updated • 29
Hodfa71/llama-1b-is-delta-dpo
1B • Updated • 13
datasets 11
Hodfa71/normistral-11b-nb-saga-kl-sft-delta-dpo-pairs
Viewer • Updated • 8.87k • 25
Hodfa71/normistral-11b-nb-saga-nosft-delta-dpo-pairs
Viewer • Updated • 3.12k • 24
Hodfa71/gpt-sw3-1b3-nb-saga-delta-dpo-pairs
Viewer • Updated • 7.08k • 28
Hodfa71/normistral-7b-nb-saga-delta-dpo-pairs
Viewer • Updated • 9.13k • 29
Hodfa71/OmniAgentBench
Viewer • Updated • 30 • 13
Hodfa71/OmniAgentBench-Audio
Viewer • Updated • 30 • 54
Hodfa71/saga-da-delta-dpo-r1
Viewer • Updated • 7.41k • 24
Hodfa71/saga-da-delta-dpo-r2
Viewer • Updated • 7.31k • 28
Hodfa71/pstu-synthetic-secrets
Viewer • Updated • 175 • 18
Hodfa71/NER-German
Preview • Updated • 11