Skip to content

Commit

Permalink
BPE tokenizer (#389)
Browse files Browse the repository at this point in the history
Add more test cases.

Co-authored-by: jessechancy <[email protected]>

add merge file

Make cache a tf module

Delete testdata

address comments

address comments

fix docstring

fix docstring
  • Loading branch information
chenmoneygithub authored and mattdangerw committed Nov 10, 2022
1 parent f9abc8f commit 82e4914
Show file tree
Hide file tree
Showing 3 changed files with 679 additions and 0 deletions.
1 change: 1 addition & 0 deletions keras_nlp/tokenizers/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.

from keras_nlp.tokenizers.byte_pair_tokenizer import BytePairTokenizer
from keras_nlp.tokenizers.byte_tokenizer import ByteTokenizer
from keras_nlp.tokenizers.sentence_piece_tokenizer import SentencePieceTokenizer
from keras_nlp.tokenizers.tokenizer import Tokenizer
Expand Down
Loading

0 comments on commit 82e4914

Please sign in to comment.