Commit Graph

5 Commits

Author SHA1 Message Date
Gustavo de Rosa
f9f2ac7c45 fix(phi-1_5): Checks length of attention_maskif it is passed as direct tensor. 2023-09-26 21:21:45 +00:00
Gustavo de Rosa
3128bb636a Support for attention_mask in forward pass.
This commit implements the following:

- Cleans up unused arguments and definitions.
- Adds support for `attention_mask`.
- Adds support for cached inference.
2023-09-26 18:17:08 +00:00
Gustavo de Rosa
4a426d8015 add _no_split_modules property (#17)
- add _no_split_modules property (7e925ddfdf2d1bb29fc26db755aafd77fb8f565e)


Co-authored-by: wing lian <winglian@users.noreply.huggingface.co>
2023-09-15 22:57:07 +00:00
Gunasekar
d655135ca1 Upload MixFormerSequentialForCausalLM 2023-09-11 21:30:53 +00:00
Gunasekar
16982066f0 Upload MixFormerSequentialForCausalLM 2023-09-10 05:42:14 +00:00