Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 10 days ago • 40
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Paper • 2504.05897 • Published Apr 8, 2025 • 21