Run your AI agent on benchmark questions and see the score
Generate code solutions from natural language prompts