Add link to Neuron-optimized version
🤖 Neuron Export Bot: Adding link to Neuron-optimized version. A Neuron-optimized version of this model has been created at [badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron). The optimized version provides improved performance on AWS Inferentia/Trainium instances with pre-compiled artifacts. Generated by: [badaoui](https://huggingface.co/badaoui) Generated using: [Optimum Neuron Compiler Space](https://huggingface.co/spaces/optimum/neuron-export)
This commit is contained in:
parent
fe8a4ea1ff
commit
3761afd63f
14
README.md
14
README.md
@ -63,4 +63,16 @@ print(outputs[0]["generated_text"])
|
|||||||
# How many helicopters can a human eat in one sitting?</s>
|
# How many helicopters can a human eat in one sitting?</s>
|
||||||
# <|assistant|>
|
# <|assistant|>
|
||||||
# ...
|
# ...
|
||||||
```
|
```
|
||||||
|
|
||||||
|
---
|
||||||
|
## 🚀 AWS Neuron Optimized Version Available
|
||||||
|
|
||||||
|
A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances:
|
||||||
|
|
||||||
|
**[badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron)**
|
||||||
|
|
||||||
|
The Neuron-optimized version provides:
|
||||||
|
- Pre-compiled artifacts for faster loading
|
||||||
|
- Optimized performance on AWS Neuron devices
|
||||||
|
- Same model capabilities with improved inference speed
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user