Open source LLM evaluation including hallucination rate for AI safety teams
#2
by vigneshwar234 - opened
Hi TrustSafeAI team!
AI text detection is important safety work. For teams evaluating LLMs for trustworthiness, I built an open source framework that measures hallucination and confidence calibration alongside task accuracy.
LLM Evaluation Framework:
- Hallucination Rate โ detects overconfident wrong outputs (the most harmful AI behavior pattern)
- Accuracy โ task accuracy with ground truth comparison
- Reasoning Quality โ does the model show its reasoning or just assert answers?
- Cost per 1K tokens โ trustworthy AI at scale requires cost sustainability
- Latency p95 โ for real-time safety detection pipelines
The combination of hallucination rate + reasoning quality gives a picture of model trustworthiness beyond just accuracy.
Live demo: https://huggingface.co/spaces/vigneshwar234/llm-eval-demo
GitHub: https://github.com/vignesh2027/LLM-Evaluation-Framework
Open source, free forever. Happy to discuss AI safety evaluation approaches!