qikp commited on
Commit
1bb77f9
Β·
verified Β·
1 Parent(s): b8fe8db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -17,4 +17,5 @@ This is the home of the 🍷 **FineData** team, a branch of the πŸ€— **Hugging F
17
  - **[πŸ“„ FinePDFs](https://huggingface.co/collections/HuggingFaceFW/finepdfs-68bd02d20928419c1dc12296)**: 3T tokens of text data extracted from PDFs sourced from the Web. See the [blogpost](https://huggingface.co/spaces/HuggingFaceFW/FinePDFsBlog)
18
  - **[🌐 FineWiki](https://huggingface.co/collections/HuggingFaceFW/finewiki-68f6615c6bb86563dcd5e846)**: an updated, better extracted version of Wikipedia in 300+ languages.
19
  - **[πŸ“„ FinePDFs-Edu](https://huggingface.co/datasets/HuggingFaceFW/finepdfs-edu)**: 350B+ highly educational tokens filtered from πŸ“„ FinePDFs
20
- - **[πŸ’¬ FineTranslations](https://huggingface.co/datasets/HuggingFaceFW/finetranslations)**: 1+1T tokens of parallel text translated from 500+ πŸ₯‚ FineWeb2 languages
 
 
17
  - **[πŸ“„ FinePDFs](https://huggingface.co/collections/HuggingFaceFW/finepdfs-68bd02d20928419c1dc12296)**: 3T tokens of text data extracted from PDFs sourced from the Web. See the [blogpost](https://huggingface.co/spaces/HuggingFaceFW/FinePDFsBlog)
18
  - **[🌐 FineWiki](https://huggingface.co/collections/HuggingFaceFW/finewiki-68f6615c6bb86563dcd5e846)**: an updated, better extracted version of Wikipedia in 300+ languages.
19
  - **[πŸ“„ FinePDFs-Edu](https://huggingface.co/datasets/HuggingFaceFW/finepdfs-edu)**: 350B+ highly educational tokens filtered from πŸ“„ FinePDFs
20
+ - **[πŸ’¬ FineTranslations](https://huggingface.co/datasets/HuggingFaceFW/finetranslations)**: 1+1T tokens of parallel text translated from 500+ πŸ₯‚ FineWeb2 languages
21
+ - **[πŸ” FinePhrase](https://huggingface.co/datasets/HuggingFaceFW/finephrase)**: 486B tokens rephrased from πŸ“š FineWeb-Edu. See the [blogpost](https://huggingface.co/spaces/HuggingFaceFW/finephrase).