gpt-sw3 models backup of the gpt-sw3 models timpal0l/gpt-sw3-126m-instruct Text Generation • 0.2B • Updated 3 days ago • 870 timpal0l/gpt-sw3-356m-instruct 0.5B • Updated 3 days ago • 67 timpal0l/gpt-sw3-1.3b-instruct 1B • Updated 3 days ago • 70 timpal0l/gpt-sw3-6.7b-v2-instruct 7B • Updated 3 days ago • 39
Data Ablation Study timpal0l/trafilatura-extracted-full-txt Viewer • Updated Aug 10, 2024 • 7.7M • 44
gpt-sw3 models backup of the gpt-sw3 models timpal0l/gpt-sw3-126m-instruct Text Generation • 0.2B • Updated 3 days ago • 870 timpal0l/gpt-sw3-356m-instruct 0.5B • Updated 3 days ago • 67 timpal0l/gpt-sw3-1.3b-instruct 1B • Updated 3 days ago • 70 timpal0l/gpt-sw3-6.7b-v2-instruct 7B • Updated 3 days ago • 39
Data Ablation Study timpal0l/trafilatura-extracted-full-txt Viewer • Updated Aug 10, 2024 • 7.7M • 44