view article Article The Chinese GLM-5 Model Now Ranks #2 in Arabic Language Performance about 5 hours ago • 1
view article Article ABBL: NextGen LLM Benchmark & Leaderboard for evaluating Arabic models May 18, 2025 • 3
view article Article SILMA RAGQA V1.0: A Comprehensive Benchmark for Evaluating LLMs on RAG QA Use-Cases Dec 18, 2024 • 1