Add beam search decoding util #237

jessechancy · 2022-06-28T17:12:39Z

No description provided.

mattdangerw · 2022-06-28T21:59:36Z

keras_nlp/utils/text_generation.py

+        prompt: a list or a Tensor, can be 1D or 2D, the initial tokens to
+            append generated tokens. The initial beam for beam search.
+        max_length: int. The max length of generated text.
+        beam_width: int. The number of beams that should be kept at each


why width? what about just num_beams?

mattdangerw · 2022-06-29T00:09:11Z

keras_nlp/layers/multi_segment_packer.py

-    If inputs are batched, inputs should be `tf.RaggedTensor`s with shape
-    `[batch_size, None]` and will be packed and converted to a dense tensor with
-    shape `[batch_size, sequence_length]`.
+    If inputs are batched, inputs should either be `tf.RaggedTensor`s with shape


This isn't part of this PR I think!

mattdangerw · 2022-06-29T00:36:43Z

keras_nlp/utils/text_generation.py

+    respective beams, before beginning the next iteration.
+
+    Args:
+        token_probability_fn: a callable, which takes in input_sequence


We should document the input shape and output shape expected by this function, as it is a non-standard batch size.

…ras-nlp into jesse-beam-search

chenmoneygithub · 2022-06-29T00:13:05Z

keras_nlp/utils/text_generation.py

+    """
+    Text generation utility based on beam search algorithm.
+
+    At each time-step, beam search keeps the top `beam_width` beams (sequences),


let's include the information that the top num_beams beams means sequences of highest num_beams probability.

chenmoneygithub · 2022-06-29T20:26:01Z

keras_nlp/utils/text_generation.py

+        i = length
+        while i < max_length:
+            beam_size = beams.shape[1]
+            reshaped_beam = tf.reshape(beams, [batch_size * beam_size, i])


Per our offline discussion, let's retain the loop over beams.

Ah I was saying we could either document this expectation or retain the loop.

But either way I think we need to fix something here. We currently document the fn input as shape [batch_size, length] but it's currently [batch_size * beam_size, length] right?

Fine with either brining back the loop or just documenting the current behavior correclty.

mattdangerw · 2022-06-29T22:16:43Z

keras_nlp/utils/__init__.py

 from keras_nlp.utils.text_generation import greedy_search
 from keras_nlp.utils.text_generation import random_search
 from keras_nlp.utils.text_generation import top_k_search
+from keras_nlp.utils.text_generation import top_p_search


this change already got in. make sure to rebase

mattdangerw

This looks good to me! Thank you!

kevinerazoBSD · 2023-02-22T19:53:36Z

Hi, I ran the "English-to-Spanish translation with KerasNLP" Colab notebook by Abheesht Sharma and changed the decoding util from "top_p" to "beam" and got all the dimension mismatch errors alluded to above. Is there a tidy example of how the token_probability_fn and beam_search decoder should be set up in this case?

jessechancy added 4 commits June 28, 2022 10:12

beam search

3b6d2cf

minor fixes

7aa5e75

minor changes

b97514b

minor style changes

558302e

jessechancy changed the title ~~beam search~~ Add beam search decoding util Jun 28, 2022

mattdangerw reviewed Jun 29, 2022

View reviewed changes

mattdangerw requested changes Jun 29, 2022

View reviewed changes

jessechancy and others added 9 commits June 29, 2022 10:48

Fixed bug

f5ee128

Merge branch 'keras-team:master' into jesse-beam-search

130c899

naming and docstring updates

96dff06

Merge branch 'jesse-beam-search' of https://github.com/jessechancy/ke…

61e5297

…ras-nlp into jesse-beam-search

temporary change in setup

96c54d0

undo setup change

2702c03

updated init to export utils

dcdf3a1

init change

4520a80

style changes

506b81f

chenmoneygithub suggested changes Jun 29, 2022

View reviewed changes

mattdangerw reviewed Jun 29, 2022

View reviewed changes

jessechancy and others added 3 commits June 29, 2022 16:10

converted to loop based beam search

256a5c2

docstring description changes

4b3bec3

Merge branch 'keras-team:master' into jesse-beam-search

deeda46

mattdangerw approved these changes Jun 29, 2022

View reviewed changes

mattdangerw merged commit f9abc8f into keras-team:master Jun 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add beam search decoding util #237

Add beam search decoding util #237

jessechancy commented Jun 28, 2022

mattdangerw Jun 28, 2022

mattdangerw Jun 29, 2022

mattdangerw Jun 29, 2022

chenmoneygithub Jun 29, 2022

chenmoneygithub Jun 29, 2022

mattdangerw Jun 29, 2022

mattdangerw Jun 29, 2022

mattdangerw left a comment

kevinerazoBSD commented Feb 22, 2023

Add beam search decoding util #237

Add beam search decoding util #237

Conversation

jessechancy commented Jun 28, 2022

mattdangerw Jun 28, 2022

Choose a reason for hiding this comment

mattdangerw Jun 29, 2022

Choose a reason for hiding this comment

mattdangerw Jun 29, 2022

Choose a reason for hiding this comment

chenmoneygithub Jun 29, 2022

Choose a reason for hiding this comment

chenmoneygithub Jun 29, 2022

Choose a reason for hiding this comment

mattdangerw Jun 29, 2022

Choose a reason for hiding this comment

mattdangerw Jun 29, 2022

Choose a reason for hiding this comment

mattdangerw left a comment

Choose a reason for hiding this comment

kevinerazoBSD commented Feb 22, 2023