From 27e51681d8ab505859ccd61d543874833b7e2968 Mon Sep 17 00:00:00 2001 From: Jasper Date: Tue, 17 Mar 2026 09:58:35 +0000 Subject: [PATCH] Add MathArena evaluation result for hmmt/hmmt_feb_2026 This PR adds a new MathArena evaluation result so it can be indexed on the model leaderboard page. Model: Qwen/Qwen3.5-9B Competition dataset id: MathArena/hmmt_feb_2026 Score: 71.21 Result file: .eval_results/MathArena--hmmt_feb_2026.yaml The results are the same as the ones displayed on [our webpage](https://matharena.ai/?view=problem&comp=hmmt--hmmt_feb_2026). Note: this is an experimental feature, we are currently trying to make this work as smooth as possible. --- .eval_results/MathArena--hmmt_feb_2026.yaml | 8 ++++++++ 1 file changed, 8 insertions(+) create mode 100644 .eval_results/MathArena--hmmt_feb_2026.yaml diff --git a/.eval_results/MathArena--hmmt_feb_2026.yaml b/.eval_results/MathArena--hmmt_feb_2026.yaml new file mode 100644 index 0000000..f2fda2a --- /dev/null +++ b/.eval_results/MathArena--hmmt_feb_2026.yaml @@ -0,0 +1,8 @@ +- dataset: + id: MathArena/hmmt_feb_2026 + task_id: MathArena/hmmt_feb_2026 + value: 71.21 + date: '2026-03-17' + source: + url: https://matharena.ai/?comp=hmmt--hmmt_feb_2026 + name: Official MathArena Evaluation