Running Featured 1.3k FineWeb: decanting the web for the finest text data at scale π· 1.3k Generate a curated webβtext dataset for LLM training
cardiffnlp/twitter-roberta-base-sentiment Text Classification β’ Updated Jan 20, 2023 β’ 743k β’ β’ 332
GPQA: A Graduate-Level Google-Proof Q&A Benchmark Paper β’ 2311.12022 β’ Published Nov 20, 2023 β’ 35