Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shafagh99 's Collections
Reasoning
Interpretability
AI Agent
RAG
Bias in LLM
LLM Distillation

Interpretability

updated about 10 hours ago

Research in LM interpretability

Upvote
-

  • From Understanding to Utilization: A Survey on Explainability for Large Language Models

    Paper • 2401.12874 • Published Jan 23, 2024 • 4

  • From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP

    Paper • 2406.12618 • Published Jun 18, 2024 • 5

  • Rethinking Interpretability in the Era of Large Language Models

    Paper • 2402.01761 • Published Jan 30, 2024 • 23

  • A Comprehensive Guide to Explainable AI: From Classical Models to LLMs

    Paper • 2412.00800 • Published Dec 1, 2024

  • A Primer on the Inner Workings of Transformer-based Language Models

    Paper • 2405.00208 • Published Apr 30, 2024 • 12

  • On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs

    Paper • 2407.19200 • Published Jul 27, 2024 • 1

  • Mixture of Experts Made Intrinsically Interpretable

    Paper • 2503.07639 • Published Mar 5, 2025 • 10

  • A Survey on Mixture of Experts

    Paper • 2407.06204 • Published Jun 26, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs