Online Information Acquisition: Hiring Multiple Agents Paper • 2307.06210 • Published Jul 12, 2023 • 1
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 2 days ago • 4 • 3
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Paper • 2406.06424 • Published Jun 10, 2024 • 15 • 2