diff --git a/README.md b/README.md index 0430bea..fd3f583 100644 --- a/README.md +++ b/README.md @@ -115,6 +115,9 @@ text = tokenizer.batch_decode(outputs)[0] print(text) ``` +**Remark.** In the generation function, our model currently does not support beam search (`num_beams` >1) and `attention_mask' parameters. +Furthermore, in the forward pass of the model, we currently do not support outputing hidden states or attention values, using custom input embeddings (instead of the model's). + ### Citation ```bib @article{textbooks2,