Running Agents 2 LLM Evaluation Framework Demo 📊 2 Benchmark LLMs on accuracy, cost, and hallucination.
Running Agents 1 PHANTASM LLM Hallucination Inverter 🔮 1 Invert LLM hallucination into productive features
Running Agents 1 TemporalMesh Transformer Demo 🕸 1 Visualize TemporalMesh Transformer token flow and early exits