Skip to content

Commit

Permalink
GH-873: sync tokenization bug fix for RoBERTa with upstream (see face…
Browse files Browse the repository at this point in the history
  • Loading branch information
stefan-it committed Aug 1, 2019
1 parent cd814f1 commit b08d6d1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion flair/embeddings.py
Original file line number Diff line number Diff line change
Expand Up @@ -1329,7 +1329,7 @@ def _add_embeddings_internal(self, sentences: List[Sentence]) -> List[Sentence]:
layers=self.layers,
pooling_operation=self.pooling_operation,
subword_start_idx=1,
subword_end_idx=-3,
subword_end_idx=-2,
)

final_subtoken_embedding = torch.cat(subtoken_embeddings)
Expand Down

0 comments on commit b08d6d1

Please sign in to comment.