Commit Graph

10 Commits

Author SHA1 Message Date
Jasper
030a14dfda
Add MathArena evaluation result for aime/aime_2026
This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page.

Model: Qwen/Qwen3.5-9B
Competition dataset id: MathArena/aime_2026
Score: 92.50
Result file: .eval_results/MathArena--aime_2026.yaml

The results are the same as the ones displayed on [our webpage](https://matharena.ai/?view=problem&comp=aime--aime_2026).

Note: this is an experimental feature, we are currently trying to make this work as smooth as possible.
2026-03-17 09:58:17 +00:00
cheng
c202236235
Update README.md 2026-03-02 00:51:43 +00:00
cheng
ef3d031a90
Upload 6 files 2026-03-01 14:58:17 +00:00
Fei Huang
1045d9897c
Update README.md 2026-03-01 11:22:36 +00:00
Fei Huang
4c6ea032b8
Update README.md 2026-02-28 15:38:17 +00:00
shuai bai
f8c2a121ee
Update README.md 2026-02-28 10:46:10 +00:00
shuai bai
7306d96d0f
Update README.md 2026-02-28 10:42:48 +00:00
Fei Huang
04e3da7e8a
Update README.md 2026-02-28 10:05:51 +00:00
cheng
21eca8a083
Upload folder using huggingface_hub 2026-02-27 13:09:26 +00:00
cheng
d45e68c4ae
initial commit 2026-02-27 12:58:26 +00:00