Running Agents 7 Polish Cultural Vision Benchmark (PCVB) 🏆 7 Show model benchmark scores in an interactive table and plot
Runtime error Agents Featured 1.09k Open NotebookLM 🎙 1.09k Personalised Podcasts For All - Available in 13 Languages
Running 39 Polish Information Retrieval Benchmark (PIRB) 📈 39 View evaluation results on an interactive leaderboard
Running 32 Polish Linguistic and Cultural Competency Benchmark 🏆 32 View a leaderboard of evaluation results