Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 18 days ago • 18
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 7 days ago • 74