LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
Leaderboards Running Featured 574 Image Arena Leaderboard 📊 574 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 7.05k MTEB Leaderboard 🥇 7.05k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.72k LMArena Leaderboard 🏆 4.72k View the LMArena leaderboard in full‑screen
Running Featured 574 Image Arena Leaderboard 📊 574 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
Leaderboards Running Featured 574 Image Arena Leaderboard 📊 574 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 7.05k MTEB Leaderboard 🥇 7.05k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.72k LMArena Leaderboard 🏆 4.72k View the LMArena leaderboard in full‑screen
Running Featured 574 Image Arena Leaderboard 📊 574 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots