JudgeSense: A Benchmark for Prompt Sensitivity in LLM-as-a-Judge Systems Paper • 2604.23478 • Published 10 days ago • 2