Commit Graph

5 Commits

Author SHA1 Message Date
Nathan Habib
c418ccbcd3
Add MMLU-Pro evaluation result 2026-03-02 13:37:52 +00:00
Nathan Habib
7daa976d0c
Add GPQA Diamond evaluation result 2026-03-02 13:35:24 +00:00
Nathan Habib
92725404d2
Remove invalid MMLU-Pro eval (no eval-yaml tasks) 2026-03-02 13:35:16 +00:00
Nathan Habib
e0330a1423
Update .eval_results/mmlu_pro.yaml 2026-03-02 13:34:06 +00:00
Nathan Habib
6f750b1dbb
Add MMLU-Pro evaluation result 2026-03-02 13:31:44 +00:00