Support models without a key-value cache in rten-generate #305

robertknight · 2024-08-14T06:55:49Z

Using a model with a key-value cache is strongly recommended as decoding is much faster, but it should at least be possible to use a model that does not have one.

Support models without a key-value cache in rten-generate

bc076bb

Using a model with a key-value cache is strongly recommended as decoding is much faster, but it should at least be possible to use a model that does not have one.

robertknight merged commit 1cfe290 into main Aug 14, 2024
2 checks passed

robertknight deleted the generate-without-kv-cache branch August 14, 2024 06:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support models without a key-value cache in rten-generate #305

Support models without a key-value cache in rten-generate #305

robertknight commented Aug 14, 2024

Support models without a key-value cache in rten-generate #305

Support models without a key-value cache in rten-generate #305

Conversation

robertknight commented Aug 14, 2024