Do the reverse embedding in the same dtype as the input embedding #1548

mattdangerw · 2024-04-03T16:52:05Z

Fixes #1542

keras_nlp/samplers/sampler.py

tirthasheshpatel

Looks good, thanks for the fix! I haven't tested with Gemma but LLaMA and Mistral work.

mattdangerw · 2024-04-04T00:20:19Z

Looks good, thanks for the fix! I haven't tested with Gemma but LLaMA and Mistral work.

Thanks! I am testing with Gemma now.

mattdangerw · 2024-04-04T23:22:48Z

I think this looks good to go, but to be safe I will probably wait till after cloud next to push our later next week.

Do the reverse embedding in the same dtype as the input embedding

d29569b

mattdangerw requested a review from tirthasheshpatel April 3, 2024 23:30

tirthasheshpatel reviewed Apr 3, 2024

View reviewed changes

keras_nlp/samplers/sampler.py Show resolved Hide resolved

tirthasheshpatel approved these changes Apr 4, 2024

View reviewed changes

mattdangerw mentioned this pull request Apr 6, 2024

Why not use low precision matmul for reverse embedding in gemma model #1542

Closed

mattdangerw marked this pull request as ready for review April 8, 2024 20:17

mattdangerw merged commit ab649f5 into keras-team:master Apr 10, 2024
12 checks passed

Provide feedback