Sliding window fixes #1738

mattdangerw · 2024-08-06T00:11:48Z

We have an issue, particularly on the tensorflow backend, when computing the sliding window mask during generation.

For tf, this would affect any sequence length.
For jax and torch, this would only affect generations longer 4096.

Before: https://colab.research.google.com/gist/mattdangerw/3d7ab7fd0f2a1169e67d3f4d43d40701/keras-tf-bug.ipynb
After: https://colab.research.google.com/gist/mattdangerw/b48e47107a2513c61d8e70a4652df468/keras-tf-bug-with-fix.ipynb

SamanehSaadat

Thanks for the fix, Matt!

Are these the two main bugs?

It's been assumed that key_len==query_len.
Caching hasn't been handled.

grasskin

LGTM Thank you for this fix!

mattdangerw · 2024-08-06T17:17:22Z

@SamanehSaadat

It's been assumed that key_len==query_len.
Caching hasn't been handled.

Kind of? Tensorflow was taking the min(query_len, sliding_window_size) as the effective sliding window size, which was basically turning the model into something that could only look 1 token behind. The general shape for generation is query_len=1, key_len=max_length.

And no backend was taking the index of generation (the cache index), to make sure our sliding window was correct for our current position.

* Add tests for sliding window issues * Fix for sliding window issues

Add tests for sliding window issues

a1d3e66

mattdangerw requested review from SamanehSaadat and grasskin and removed request for grasskin August 6, 2024 00:20

Fix for sliding window issues

1c92bb7

mattdangerw force-pushed the sliding-window-fixes branch from a2ccea3 to 1c92bb7 Compare August 6, 2024 00:43

SamanehSaadat approved these changes Aug 6, 2024

View reviewed changes

grasskin approved these changes Aug 6, 2024

View reviewed changes

mattdangerw merged commit 94283d6 into keras-team:master Aug 6, 2024
10 checks passed

mattdangerw added a commit that referenced this pull request Aug 6, 2024

Sliding window fixes (#1738)

b3f6bb1

* Add tests for sliding window issues * Fix for sliding window issues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sliding window fixes #1738

Sliding window fixes #1738

mattdangerw commented Aug 6, 2024 •

edited

Loading

SamanehSaadat left a comment

grasskin left a comment

mattdangerw commented Aug 6, 2024

Sliding window fixes #1738

Sliding window fixes #1738

Conversation

mattdangerw commented Aug 6, 2024 • edited Loading

SamanehSaadat left a comment

Choose a reason for hiding this comment

grasskin left a comment

Choose a reason for hiding this comment

mattdangerw commented Aug 6, 2024

mattdangerw commented Aug 6, 2024 •

edited

Loading