You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The tokens should be shown in the decoded output, but everything except for the unknown token is dropped (no pad or EOS). convert_ids_to_tokens followed by convert_tokens_to_string also drops the tokens.
The text was updated successfully, but these errors were encountered:
This is a duplicate of #5142 and will be fixed with the PR linked below. Thanks for reporting it - seems like multiple people were running into this issue!!!
Environment info
transformers
version: 3.4.0Who can help
examples/seq2seq: @sshleifer
-->
Information
Model I am using (Bert, XLNet ...): T5Tokenizer
The problem arises when using:
The tasks I am working on is:
To reproduce
Steps to reproduce the behavior:
IDs output:
_word <pad> <unk> </s> </s>
decode output:
word ⁇
Expected behavior
The tokens should be shown in the decoded output, but everything except for the unknown token is dropped (no pad or EOS).
convert_ids_to_tokens
followed byconvert_tokens_to_string
also drops the tokens.The text was updated successfully, but these errors were encountered: