GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 20 days ago • 26 RUC-NLPIR/GISA Preview • Updated 16 days ago • 216 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 20 days ago • 26
OmniGAIA Towards Native Omni-Modal AI Agents Running 3 OmniGAIA Leaderboard 🏆 3 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 3 days ago • 360 • 497 • 5 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 3 days ago • 2.16k • 112 • 3 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Text-to-Audio • 32B • Updated 3 days ago • 45 • 1
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 18 days ago • 52 RUC-NLPIR/DISBench Updated 12 days ago • 55 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 18 days ago • 52
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Sleeping 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 2.65k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 15 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41
GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 20 days ago • 26 RUC-NLPIR/GISA Preview • Updated 16 days ago • 216 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published 20 days ago • 26
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 18 days ago • 52 RUC-NLPIR/DISBench Updated 12 days ago • 55 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 18 days ago • 52
OmniGAIA Towards Native Omni-Modal AI Agents Running 3 OmniGAIA Leaderboard 🏆 3 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 3 days ago • 360 • 497 • 5 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 3 days ago • 2.16k • 112 • 3 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Text-to-Audio • 32B • Updated 3 days ago • 45 • 1
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Sleeping 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 2.65k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 15 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41