-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 65 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 3.75k • 648 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 329k • 2.73k -
FineWeb: decanting the web for the finest text data at scale
🍷1.32kRead a detailed overview of the FineWeb web‑scale text dataset
Gokul Ganesan
Xeiroh
AI & ML interests
None yet
Organizations
Datasets
-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 65 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 3.75k • 648 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 329k • 2.73k - RunningFeatured1.32k
FineWeb: decanting the web for the finest text data at scale
🍷1.32kRead a detailed overview of the FineWeb web‑scale text dataset