XLMRobertaTokenizer.detokenize method is not wokring #1282
Labels
stat:contributions welcome
Add this label to feature request issues so they are separated out from bug reporting issues
type:Bug
Something isn't working
Describe the bug
detokenize method is not available for XLMRobertaTokenizer class
To Reproduce
Expected behavior
Additional context
The decode method has been worked with code like below.
So, it must be related with index mishaps. And it seems like list concatenation mishap like below
https://github.com/keras-team/keras-nlp/blob/2b02bff6d7395898f0ab53476662a972c913f39b/keras_nlp/models/xlm_roberta/xlm_roberta_tokenizer.py#L107-L114
The concatenation increase all indices in previous vocabulary, and it affects on super().detokenize(ids).
The text was updated successfully, but these errors were encountered: