Don't accept a string dtype for unicode tokenizer #147

mattdangerw · 2022-04-28T16:07:55Z

This tokenizer cannot output strings.

chenmoneygithub

Curious - how this dype is used in tokenizer? I did some search and found no explicit reference.

mattdangerw · 2022-04-28T17:46:38Z

Right, we need to cast the output to the desired dype as well (as it appears tf.strings.unicode_decode only supports int32).

Don't accept a string dtype for unicode tokenizer

7baff24

This tokenizer cannot output strings.

mattdangerw requested a review from chenmoneygithub April 28, 2022 16:12

chenmoneygithub reviewed Apr 28, 2022

View reviewed changes

cast the output to the layer dtype

cbee8af

mattdangerw merged commit 7e678e4 into keras-team:master Apr 30, 2022

Provide feedback