diff --git a/README.md b/README.md index 9282036..f31b21b 100644 --- a/README.md +++ b/README.md @@ -140,4 +140,17 @@ You can find the paper at https://arxiv.org/abs/2309.05463 journal={arXiv preprint arXiv:2309.05463}, year={2023} } -``` \ No newline at end of file +``` +# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) +Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_microsoft__phi-1_5) + +| Metric | Value | +|-----------------------|---------------------------| +| Avg. | 41.6 | +| ARC (25-shot) | 52.9 | +| HellaSwag (10-shot) | 63.79 | +| MMLU (5-shot) | 43.89 | +| TruthfulQA (0-shot) | 40.89 | +| Winogrande (5-shot) | 72.22 | +| GSM8K (5-shot) | 12.43 | +| DROP (3-shot) | 5.04 |