zh-cn/the-bpe-tokenizer/ #14
Replies: 1 comment 1 reply
-
所以如果对中文使用BPE算法的话需要先分词?还是直接全部拆成单字? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
zh-cn/the-bpe-tokenizer/
介绍了 BPE 算法用来分词的原理
https://martinlwx.github.io/zh-cn/the-bpe-tokenizer/
Beta Was this translation helpful? Give feedback.
All reactions