Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting Paper • 2310.11324 • Published Oct 17, 2023 • 1
Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs Paper • 2509.01790 • Published Sep 1, 2025 • 7
POSIX: A Prompt Sensitivity Index For Large Language Models Paper • 2410.02185 • Published Oct 3, 2024