E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 6 days ago • 1
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 6 days ago • 1
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs Paper • 2409.10994 • Published Sep 17, 2024 • 1
Unconstrained Model Merging for Enhanced LLM Reasoning Paper • 2410.13699 • Published Oct 17, 2024 • 1
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper • 2502.11573 • Published Feb 17, 2025 • 9
InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models Paper • 2509.22536 • Published Sep 26, 2025 • 2
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 12 days ago • 51
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 6 days ago • 1
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 12 days ago • 51
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published Dec 18, 2025 • 222
Diffusion Language Models Know the Answer Before Decoding Paper • 2508.19982 • Published Aug 27, 2025 • 27
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published Nov 12, 2025 • 215
InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models Paper • 2509.22536 • Published Sep 26, 2025 • 2