diff --git a/README.md b/README.md index 9737c9c..3b010fa 100644 --- a/README.md +++ b/README.md @@ -63,4 +63,16 @@ print(outputs[0]["generated_text"]) # How many helicopters can a human eat in one sitting? # <|assistant|> # ... -``` \ No newline at end of file +``` + +--- +## 🚀 AWS Neuron Optimized Version Available + +A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances: + +**[badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron)** + +The Neuron-optimized version provides: +- Pre-compiled artifacts for faster loading +- Optimized performance on AWS Neuron devices +- Same model capabilities with improved inference speed