From 164a00608a302297291528b67480a356f05e780b Mon Sep 17 00:00:00 2001 From: Maxime Labonne Date: Sat, 27 Jul 2024 18:55:42 +0000 Subject: [PATCH] Update README.md --- README.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/README.md b/README.md index 339b6a7..d030d28 100644 --- a/README.md +++ b/README.md @@ -23,3 +23,9 @@ configs: - split: train path: data/train-* --- + +# FineTome-100k + +![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/61b8e2ba285851687028d395/75I3ffI4XnRlheOQ7kNJ3.jpeg) + +The FineTome dataset is a susbet of [arcee-ai/The-Tome](https://huggingface.co/datasets/arcee-ai/The-Tome) (without arcee-ai/qwen2-72b-magpie-en) re-filtered using [HuggingFaceFW/fineweb-edu-classifier](https://huggingface.co/HuggingFaceFW/fineweb-edu-classifier).