view article Article Towards More Standardized AI Evaluation: From Models to Agents alielfilali01 • Feb 24 • 2
view article Article Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More +4 alielfilali01, SarahAlBarri, Arwa88, samta-kamboj, neha1710, preslavnakov • Apr 8, 2025 • 20
view article Article The Open Arabic LLM Leaderboard 2 +6 alielfilali01, Manel-Hik, tarickMorty, amztheory, basma-b, rcojocaru, HakimHacid, clefourrier • Feb 10, 2025 • 38
view article Article Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard +3 alielfilali01, neha1710, Arwa88, preslavnakov, clefourrier • Dec 4, 2024 • 39
view article Article Introducing the Open Arabic LLM Leaderboard +3 alielfilali01, Hamza-Alobeidli, rcojocaru, basma-b, clefourrier • May 14, 2024 • 103
view article Article Introducing the Open Arabic LLM Leaderboard +3 alielfilali01, Hamza-Alobeidli, rcojocaru, basma-b, clefourrier • May 14, 2024 • 103