The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 261
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated 13 days ago • 35
Learning to Extract Rational Evidence via Reinforcement Learning for Retrieval-Augmented Generation Paper • 2507.15586 • Published Jul 21, 2025 • 1
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey Paper • 2507.20783 • Published Jul 28, 2025 • 1
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey Paper • 2507.20783 • Published Jul 28, 2025 • 1 • 1
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey Paper • 2507.20783 • Published Jul 28, 2025 • 1
Tool-Retrieval Collection The first large-scale and diverse tool retrieval benchmark. See our homepage for more details: https://github.com/mangopy/tool-retrieval-benchmark. • 8 items • Updated Jun 26, 2025 • 3