NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 48
view article Article TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell apsys • Jan 5 • 14