Update results.json with latest aggregated results. bef41dd Running verified alielfilali01 commited on about 1 month ago
Update results.json with latest aggregated results. 18250f1 verified alielfilali01 commited on about 1 month ago
Update results.json with latest aggregated results. 6c37b1e verified alielfilali01 commited on about 1 month ago
GLM-5: remove misconfigured entry (thinking traces), rename GLM-5-Cleaned to GLM-5 5aa8b1f verified alielfilali01 commited on about 1 month ago
Add 9 missing models from pmo/results: gemini-3-flash-preview, Qwen3-235B-Think/Inst, Qwen3-Next-80B, Cmd-A-03, Cmd-A-Reasoning, Aya-Expanse, K2-Think-V2 84c9784 verified alielfilali01 commited on about 1 month ago
Fix build: remove gradio<5 pin that conflicts with sdk_version 5.5.0 in README 9078149 verified alielfilali01 commited on about 1 month ago
Update AraGen v3 results: add 12 new models (Gemma-4, Mistral-Small-4, Magistral, GLM-5, Qwen3.5-397B, phi-4, gpt-oss, c4ai-r7b-arabic, Yehia), fix gpt-4.1 scores, fix license tags e7d4d97 verified alielfilali01 commited on about 1 month ago
Delete assets/pictures/03-25/silma-vs-gemma-heatmap-old.png c6f32bd verified alielfilali01 commited on Apr 3, 2025
Delete assets/pictures/03-25/silma-vs-gemma-heatmap.png cb8c4df verified alielfilali01 commited on Apr 3, 2025
Delete assets/pictures/03-25/silma-vs-gemma-heatmap.png f060454 verified alielfilali01 commited on Apr 3, 2025
Rename assets/pictures/03-25/silma-vs-gemma-heatmap.png to assets/pictures/03-25/silma-vs-gemma-heatmap-old.png e5dc4f5 verified alielfilali01 commited on Apr 3, 2025
Rename assets/results/results.json to assets/results/aragen_v2_results.json 45d46d1 verified alielfilali01 commited on Mar 25, 2025
Update results.json with latest aggregated results. fbf86f4 verified alielfilali01 commited on Jan 7, 2025