This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page. Model: Qwen/Qwen3.5-9B Competition dataset id: MathArena/aime_2026 Score: 92.50 Result file: .eval_results/MathArena--aime_2026.yaml The results are the same as the ones displayed on [our webpage](https://matharena.ai/?view=problem&comp=aime--aime_2026). Note: this is an experimental feature, we are currently trying to make this work as smooth as possible. |
||
|---|---|---|
| .. | ||
| MathArena--aime_2026.yaml | ||