Add sampling strategies to beam search #4768

epwalsh · 2020-11-03T17:57:05Z

TODO (@jvstokes)

implement multinomial sampler
implement top-k sampler
implement top-p sampler
test multinomial sampler
test top-k sampler
test top-p sampler
test gumbel sampler, especially with edge cases like when the end token is predicted
update CHANGELOG
finalize documentation

…d tests for those samplers and stochastic_beam_search

…ampler

epwalsh

Almost there 😬! My suggestions are mostly cosmetic. I also think I found a way to avoid the loop within top-p.

allennlp/nn/beam_search.py

…up documentation and sampeler code.

…ochastic-search-sampler

epwalsh · 2020-11-10T21:27:19Z

allennlp/nn/beam_search.py

+        # Create a mask for filtering out probabilities that don't make the top `p`.
+        # shape: (batch_size, num_classes)
+        exclusion_mask = probabilities_summed >= self.p
+
+        # We want to include the firt index where probabilities_summes >= p, so we shift over one.
+        exclusion_mask[..., 1:] = exclusion_mask[..., :-1].clone()
+        exclusion_mask[..., 0] = False


How about this:

Suggested change

# Create a mask for filtering out probabilities that don't make the top `p`.

# shape: (batch_size, num_classes)

exclusion_mask = probabilities_summed >= self.p

# We want to include the firt index where probabilities_summes >= p, so we shift over one.

exclusion_mask[..., 1:] = exclusion_mask[..., :-1].clone()

exclusion_mask[..., 0] = False

# Create a mask for filtering out probabilities that don't make the top `p`.

# shape: (batch_size, num_classes)

exclusion_mask = probabilities_summed > self.p

# Make sure there's at least `per_node_beam_size` options.

exclusion_mask[..., :per_node_beam_size] = False

I think we actually need both. If we don't shift the mask then then the cumulative sum is less than p, i.e.
probs = [0.5, 0.31, 0.19] , p = 0.8
cumsum = [0.5, 0.81, 1.0]
mask: [False, True, True]
softmax: [1.0, 0.0, 0.0]

Good point. How about this then:

Suggested change

# Create a mask for filtering out probabilities that don't make the top `p`.

# shape: (batch_size, num_classes)

exclusion_mask = probabilities_summed >= self.p

# We want to include the firt index where probabilities_summes >= p, so we shift over one.

exclusion_mask[..., 1:] = exclusion_mask[..., :-1].clone()

exclusion_mask[..., 0] = False

# Create a mask for filtering out probabilities that don't make the top `p`.

# shape: (batch_size, num_classes)

exclusion_mask = probabilities_summed >= self.p

# We want to include the first index where probabilities_summed >= p, so we shift over one.

exclusion_mask[..., 1:] = exclusion_mask[..., :-1].clone()

# We also want to make sure we have at least `per_node_beam_size` options.

exclusion_mask[..., :per_node_beam_size] = False

(only change is the last line)

Do we want to allow the client to sample with replacement from < per_node_beam_size options?

i.e. maybe they sample the 10 examples from the top 8 cumulative probabilities

Possibly. So maybe only fill up to per_node_beam_size as False if with_replacement is False, otherwise only fill the first as False.

If you don't think we should build that functionality in though I can clean it up!

I think it's okay to have that.

oops sorry I forgot to refresh and missed your comment! And yes that's the change I ended up making, so it should be all good

allennlp/nn/beam_search.py

…chastic-search-sampler

…allennlp into stochastic-search-sampler

epwalsh added 3 commits October 30, 2020 12:52

add node and beam samplers

a8f98e6

refactor

96f7d52

get stochastic beam search working

8ed9cfb

epwalsh changed the title ~~Add sampler strategies to beam search~~ Add sampling strategies to beam search Nov 3, 2020

Jackson Stokes added 3 commits November 7, 2020 16:05

Add MultiomialSampler, TopPSampler, TopKSampler to beam_search.py, an…

f6b34cb

…d tests for those samplers and stochastic_beam_search

Merge remote-tracking branch 'origin/master' into stochastic-search-s…

f0a7a0c

…ampler

Update changelog and finalize documentation

dd20473

epwalsh assigned epwalsh and jvstokes Nov 9, 2020

set default to without replacement

d348e0e

epwalsh commented Nov 10, 2020

View reviewed changes

epwalsh marked this pull request as ready for review November 10, 2020 18:01

Jackson Stokes added 3 commits November 10, 2020 12:07

Updated TopPSampler to remove loop, with testing and bugfix. Cleaned …

1bb1f68

…up documentation and sampeler code.

added p sampler test

b6b04b1

Merge branch 'master' of https://github.com/jvstokes/allennlp into st…

97feb54

…ochastic-search-sampler

epwalsh commented Nov 10, 2020

View reviewed changes

epwalsh added 2 commits November 10, 2020 13:40

Merge branch 'master' into beam-search-sampler-try

35911c7

Better error messages

77e40e2

epwalsh commented Nov 10, 2020

View reviewed changes

allennlp/nn/beam_search.py Outdated Show resolved Hide resolved

epwalsh and others added 6 commits November 10, 2020 13:42

Update allennlp/nn/beam_search.py

7718b39

lint

c319c27

Merge branch 'master' of https://github.com/allenai/allennlp into sto…

7a22f08

…chastic-search-sampler

default to top-k if insufficient examples when top-p sampling

e866c6f

Merge branch 'beam-search-sampler-try' of https://github.com/allenai/…

01cdf1b

…allennlp into stochastic-search-sampler

formatting

967a678

jvstokes approved these changes Nov 10, 2020

View reviewed changes

epwalsh added 2 commits November 10, 2020 15:48

minor clean up

7e5f3b7

fix CHANGELOG

6d25b76

epwalsh merged commit 9f7cc24 into master Nov 11, 2020

epwalsh deleted the beam-search-sampler-try branch November 11, 2020 00:00

epwalsh mentioned this pull request Nov 11, 2020

Adding stochastic beam search #4724

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sampling strategies to beam search #4768

Add sampling strategies to beam search #4768

epwalsh commented Nov 3, 2020 •

edited by jvstokes

Loading

epwalsh left a comment

epwalsh Nov 10, 2020

jvstokes Nov 10, 2020

epwalsh Nov 10, 2020 •

edited

Loading

jvstokes Nov 10, 2020

jvstokes Nov 10, 2020

epwalsh Nov 10, 2020

jvstokes Nov 10, 2020

epwalsh Nov 10, 2020

jvstokes Nov 10, 2020

Add sampling strategies to beam search #4768

Add sampling strategies to beam search #4768

Conversation

epwalsh commented Nov 3, 2020 • edited by jvstokes Loading

epwalsh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

epwalsh Nov 10, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

epwalsh commented Nov 3, 2020 •

edited by jvstokes

Loading

epwalsh Nov 10, 2020 •

edited

Loading