Commit Graph

23 Commits

Author SHA1 Message Date
Gustavo de Rosa
f27cd936bd Update README.md 2023-12-13 23:01:12 +00:00
Gustavo de Rosa
80c0ba9f8e Update README.md 2023-12-13 22:44:59 +00:00
Gustavo de Rosa
a286f5c1de Disables inference API to prevent mismatch with HF implementation. 2023-12-13 21:54:41 +00:00
Gustavo de Rosa
de35f900d3 Adds support for MQA/GQA and attention mask during training. 2023-10-30 16:59:12 +00:00
Gustavo de Rosa
8ab0f29ff6 Add more precise license metadata (UI will be cleaner!) (#35)
- Add more precise license metadata (UI will be cleaner!) (2c182742af8c7c93f0f4ee1180232a5d0c114958)


Co-authored-by: Julien Chaumond <julien-c@users.noreply.huggingface.co>
2023-09-27 15:20:42 +00:00
Gustavo de Rosa
bc09a085e7 Upload README.md 2023-09-27 14:04:07 +00:00
Gustavo de Rosa
3128bb636a Support for attention_mask in forward pass.
This commit implements the following:

- Cleans up unused arguments and definitions.
- Adds support for `attention_mask`.
- Adds support for cached inference.
2023-09-26 18:17:08 +00:00
Gunasekar
7d482ddf93 Update README.md 2023-09-14 00:44:40 +00:00
Gunasekar
c8f6ad8189 Update README.md 2023-09-12 18:40:56 +00:00
Gustavo de Rosa
762a3110be Link paper to arXiv (#5)
- Link paper to arXiv (c30653547e6bbdc00a068e538a7f84ed568d1918)


Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.huggingface.co>
2023-09-12 16:01:41 +00:00
Gunasekar
ea95720a35 Update README.md 2023-09-12 01:38:42 +00:00
Gunasekar
4bba51c9b5 Update README.md 2023-09-11 21:45:49 +00:00
Gunasekar
52e294acfe Update README.md 2023-09-11 21:44:15 +00:00
Gunasekar
07a048efa7 Update README.md 2023-09-11 07:57:24 +00:00
Gunasekar
b63051536f Update README.md 2023-09-11 07:56:12 +00:00
Gunasekar
40b496f7e0 Update README.md 2023-09-11 07:50:39 +00:00
Gunasekar
d9c7521001 Update README.md 2023-09-11 07:46:06 +00:00
Gunasekar
6ddac37bb9 Update README.md 2023-09-11 07:35:26 +00:00
Gunasekar
cd4510ca85 Update README.md 2023-09-11 07:33:48 +00:00
Gunasekar
34046b03b7 Update README.md 2023-09-11 07:32:34 +00:00
Gunasekar
24ad69c3c0 Update README.md 2023-09-11 02:12:39 +00:00
Gunasekar
b3d67f3c44 Update README.md 2023-09-11 01:01:17 +00:00
Gunasekar
98416e6398 initial commit 2023-09-10 04:03:46 +00:00