microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
6.14k
•
1.27k
Generate high-quality text data for LLMs using FineWeb
The ultimate guide to training LLM on large GPU Clusters
Calculate memory usage for model configurations