ISTA-DASLab/Mistral-7B-v0.1-AQLM-PV-2Bit-1x16-hf
Text Generation
•
1B
•
Updated
•
1
None defined yet.
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation