Open Datasets
updated
Updated
• 176
• 86
Viewer
• Updated
• 1.46k • 23.2k
• 9.61k
Viewer
• Updated
• 69.9k • 137k
• 386
Viewer
• Updated
• 2.2M • 6.92k
• 392
Matthijs/cmu-arctic-xvectors
Viewer
• Updated
• 7.93k • 23.6k
• 63
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer
• Updated
• 359k • 108
• 7
Viewer
• Updated
• 860k • 15.6k
• 544
alpindale/two-million-bluesky-posts
Viewer
• Updated
• 2.11M • 1.09k
• 201
arimalabs/2.3-million-bluesky-posts
Viewer
• Updated
• 2.37M • 30
• 5
Viewer
• Updated
• 70k • 85.9k
• 230
Viewer
• Updated
• 1.34M • 10.5k
• 30
Viewer
• Updated
• 1.12M • 371
• 4
parler-tts/libritts_r_filtered
Viewer
• Updated
• 359k • 1.32k
• 21
opendiffusionai/cc12m-cleaned
Viewer
• Updated
• 8.53M • 80
• 10
Viewer
• Updated
• 31.4k • 656
• 23
Preview
• Updated
• 364
• 7
Viewer
• Updated
• 61.6M • 83.3k
• 1.15k
parler-tts/mls-eng-speaker-descriptions
Viewer
• Updated
• 10.8M • 331
• 11
Viewer
• Updated
• 114M • 2.25k
• 101
Updated
• 33
• 2
Viewer
• Updated
• 602k • 10k
• 150
Viewer
• Updated
• 4.48B • 70.8k
• 767
Viewer
• Updated
• 1.55k • 42
• 4
Viewer
• Updated
• 1.26M • 22.7k
• 146
Viewer
• Updated
• 59.1k • 312
• 12
keremberke/license-plate-object-detection
Viewer
• Updated
• 8.83k • 896
• 36
Updated
• 31
• 8
Viewer
• Updated
• 98.6k • 635
• 100
nebius/SWE-agent-trajectories
Viewer
• Updated
• 80k • 1.51k
• 71
Viewer
• Updated
• 3.4k • 7.95k
• 58
cfahlgren1/react-code-instructions
Viewer
• Updated
• 74.4k • 213
• 157
DAMO-NLP-SG/multimodal_textbook
Updated
• 1.01k
• 157
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated
• 16.4k • 307
• 187
Viewer
• Updated
• 5.45B • 7.04k
• 517
Viewer
• Updated
• 546M • 13.5k
• 967
hoskinson-center/proof-pile
Viewer
• Updated
• 363k • 1.72k
• 63
HuggingFaceFW/fineweb-edu
Viewer
• Updated
• 3.5B • 223k
• 988
EleutherAI/the_pile_deduplicated
Viewer
• Updated
• 134M • 12.5k
• 109
MohamedRashad/multilingual-tts
Viewer
• Updated
• 25.5k • 81
• 47
Viewer
• Updated
• 16.4k • 10
• 4
facebook/multilingual_librispeech
Viewer
• Updated
• 1.49M • 13k
• 171
Viewer
• Updated
• 1.25M • 13.6k
• 87
Viewer
• Updated
• 2.77M • 4.44k
• 115
Fumika/Wikinews-multilingual
Viewer
• Updated
• 15.2k • 17
• 7
ayymen/Weblate-Translations
Viewer
• Updated
• 11.7M • 717
• 17
Updated
• 39.2k
• 157
Helsinki-NLP/opus_wikipedia
Viewer
• Updated
• 1.75M • 114
• 10
Viewer
• Updated
• 3.59M • 18
• 1
MLCommons/unsupervised_peoples_speech
Updated
• 21.8k
• 74
HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized
Updated
• 260
• 30
Viewer
• Updated
• 10k • 7.5k
• 536
Viewer
• Updated
• 68.1k • 45.3k
• 22
allenai/RLVR-GSM-MATH-IF-Mixed-Constraints
Viewer
• Updated
• 29.9k • 1.76k
• 30
allenai/olmo-2-0325-32b-preference-mix
Updated
• 115
• 15
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
• Updated
• 866k • 1.29k
• 22
Viewer
• Updated
• 170M • 50.8k
• 90
Viewer
• Updated
• 621M • 10.2k
• 87
Viewer
• Updated
• 932 • 26.9k
• 624
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer
• Updated
• 110k • 636
• 731
Viewer
• Updated
• 102k • 350
• 47
Viewer
• Updated
• 450k • 12.1k
• 716
Viewer
• Updated
• 167M • 3.34k
• 67