From 3761afd63fa32a94e0ff72f7c6e505df6af1f809 Mon Sep 17 00:00:00 2001 From: ABDENNACER BADAOUI Date: Tue, 16 Sep 2025 13:58:49 +0000 Subject: [PATCH] Add link to Neuron-optimized version MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 🤖 Neuron Export Bot: Adding link to Neuron-optimized version. A Neuron-optimized version of this model has been created at [badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron). The optimized version provides improved performance on AWS Inferentia/Trainium instances with pre-compiled artifacts. Generated by: [badaoui](https://huggingface.co/badaoui) Generated using: [Optimum Neuron Compiler Space](https://huggingface.co/spaces/optimum/neuron-export) --- README.md | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 9737c9c..3b010fa 100644 --- a/README.md +++ b/README.md @@ -63,4 +63,16 @@ print(outputs[0]["generated_text"]) # How many helicopters can a human eat in one sitting? # <|assistant|> # ... -``` \ No newline at end of file +``` + +--- +## 🚀 AWS Neuron Optimized Version Available + +A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances: + +**[badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron)** + +The Neuron-optimized version provides: +- Pre-compiled artifacts for faster loading +- Optimized performance on AWS Neuron devices +- Same model capabilities with improved inference speed