Gustavo de Rosa
3128bb636a
Support for attention_mask in forward pass.
...
This commit implements the following:
- Cleans up unused arguments and definitions.
- Adds support for `attention_mask`.
- Adds support for cached inference.
2023-09-26 18:17:08 +00:00
Gustavo de Rosa
4a426d8015
add _no_split_modules property ( #17 )
...
- add _no_split_modules property (7e925ddfdf2d1bb29fc26db755aafd77fb8f565e)
Co-authored-by: wing lian <winglian@users.noreply.huggingface.co>
2023-09-15 22:57:07 +00:00
Gunasekar
d655135ca1
Upload MixFormerSequentialForCausalLM
2023-09-11 21:30:53 +00:00
Gunasekar
16982066f0
Upload MixFormerSequentialForCausalLM
2023-09-10 05:42:14 +00:00