ISTA-DASLab/Llama-3.2-1B-AQLM-PV-2Bit-2x8
Text Generation
•
0.5B
•
Updated
•
11
None defined yet.
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation