On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 4 days ago • 6
On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 4 days ago • 6
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation Paper • 2404.12648 • Published Apr 19, 2024
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention Paper • 2503.12734 • Published Mar 17, 2025
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published Aug 1, 2025 • 34
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 25 days ago • 13
On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 4 days ago • 6
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems Paper • 2405.19883 • Published May 30, 2024
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 20
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16, 2025 • 5