Datasets for high quality small LM model pre-training.
George Grigorev
thepowerfuldeez
AI & ML interests
Building stuff with LLMs. Fine-tuning, context extension
Recent Activity
liked a dataset 2 days ago
anhchanghoangsg/reddit_pushshift_dataset_cleaned updated a dataset about 1 month ago
thepowerfuldeez/massive-yt-edu-transcriptions updated a dataset about 1 month ago
thepowerfuldeez/massive-yt-edu-queueOrganizations
None yet