DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 13 days ago • 51 RUC-NLPIR/DISBench Updated 7 days ago • 41 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 13 days ago • 51
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Running 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 2.23k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 18 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41
OmniGAIA Towards Native Omni-Modal AI Agents Running 2 OmniGAIA Leaderboard 🏆 2 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 1 day ago • 360 • 171 • 3 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated about 12 hours ago • 2.16k • 19 • 2 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B 32B • Updated 7 days ago • 17 • 1
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 13 days ago • 51 RUC-NLPIR/DISBench Updated 7 days ago • 41 • 2 Running 2 DISBench Leaderboard 🏆 2 Explore and submit multimodal image retrieval benchmark results
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published 13 days ago • 51
OmniGAIA Towards Native Omni-Modal AI Agents Running 2 OmniGAIA Leaderboard 🏆 2 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 1 day ago • 360 • 171 • 3 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated about 12 hours ago • 2.16k • 19 • 2 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B 32B • Updated 7 days ago • 17 • 1
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Running 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 2.23k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 18 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41