Add MathArena evaluation result for aime/aime_2026

This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page.

Model: Qwen/Qwen3.5-9B
Competition dataset id: MathArena/aime_2026
Score: 92.50
Result file: .eval_results/MathArena--aime_2026.yaml

The results are the same as the ones displayed on [our webpage](https://matharena.ai/?view=problem&comp=aime--aime_2026).

Note: this is an experimental feature, we are currently trying to make this work as smooth as possible.
This commit is contained in:
Jasper 2026-03-17 09:58:17 +00:00 committed by system
parent c202236235
commit 030a14dfda
No known key found for this signature in database
GPG Key ID: 6A528E38E0733467

@ -0,0 +1,8 @@
- dataset:
id: MathArena/aime_2026
task_id: MathArena/aime_2026
value: 92.5
date: '2026-03-17'
source:
url: https://matharena.ai/?comp=aime--aime_2026
name: Official MathArena Evaluation