LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper β’ 2402.01622 β’ Published Feb 2, 2024 β’ 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper β’ 2402.13598 β’ Published Feb 21, 2024 β’ 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper β’ 2402.01622 β’ Published Feb 2, 2024 β’ 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper β’ 2402.13598 β’ Published Feb 21, 2024 β’ 21
Leaderboards Running Featured 574 Image Arena Leaderboard π 574 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 7.04k MTEB Leaderboard π₯ 7.04k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.71k LMArena Leaderboard π 4.71k View LMArena model leaderboard
Running Featured 574 Image Arena Leaderboard π 574 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots
LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper β’ 2402.01622 β’ Published Feb 2, 2024 β’ 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper β’ 2402.13598 β’ Published Feb 21, 2024 β’ 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper β’ 2402.01622 β’ Published Feb 2, 2024 β’ 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper β’ 2402.13598 β’ Published Feb 21, 2024 β’ 21
Leaderboards Running Featured 574 Image Arena Leaderboard π 574 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 7.04k MTEB Leaderboard π₯ 7.04k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.71k LMArena Leaderboard π 4.71k View LMArena model leaderboard
Running Featured 574 Image Arena Leaderboard π 574 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots