LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning Paper • 2604.14922 • Published 8 days ago • 7
LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning Paper • 2604.14922 • Published 8 days ago • 7
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them Paper • 2509.21117 • Published Sep 25, 2025 • 30
LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information Paper • 2502.02095 • Published Feb 4, 2025 • 4
LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information Paper • 2502.02095 • Published Feb 4, 2025 • 4 • 2
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models Paper • 2406.08903 • Published Jun 13, 2024 • 1