Extending Context Window of Large Language Models via Semantic Compression Paper • 2312.09571 • Published Dec 15, 2023 • 16
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Paper • 2405.08707 • Published May 14, 2024 • 34
High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models Paper • 2309.15889 • Published Sep 27, 2023
HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference Paper • 2602.00777 • Published Jan 31
A Mathematical Theory of Top-$k$ Sparse Attention via Total Variation Distance Paper • 2512.07647 • Published Dec 8, 2025
Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding Paper • 2406.12331 • Published Jun 18, 2024