nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Any-to-Any • 18B • Updated 5 days ago • 663k • 96
view article Article Welcome PaliGemma 2 – New vision language models by Google +2 Dec 5, 2024 • 166
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 10