Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

stancld · 2021-01-26T19:06:17Z

🚀 Feature request

Following the PRs #9569, #9634 and #9639, there are other encoder-decoder models, which either do not support head_mask and decoder_head_mask input arguments at all or can be only provided with a single head_mask argument used for head masking both in encoder and decoder. It would be, therefore, nice to make this feature uniform over all the decoder-models.

Models:

Model	Pytorch	TensorFlow	PR	Copy dependency
BERTGeneration	☑️	✖️	-	-
EncoderDecoderModel	☑️	✖️	-	-
FSMT	✅	✖️	#9819	-
LED	✅	☑️	PT - #9856 ; TF - #9988	-
ProphetNet	☑️	✖️	#9964	-
Longformer	✅	☑️	PT - #9856; TF - #9988	LED

Your contribution

I'm happy to add this feature in the following days, both for PyTorch and TensorFlow models. (Likely in shorter PRs in order not to create large, overwhelming PRs)

Reviewers: @patrickvonplaten, @jplu, @sgugger, @LysandreJik, @stas00 .

The text was updated successfully, but these errors were encountered:

This was referenced Jan 26, 2021

Add head_mask and decoder_head_mask to FSMT #9819

Merged

Add head_mask and decoder_head_mask to PyTorch LED #9856

Merged

patrickvonplaten closed this as completed in #9819 Feb 1, 2021

This was referenced Feb 2, 2021

Add head_mask, decoder_head_mask, cross_head_mask to ProphetNet #9964

Merged

Add head_mask and decoder_head_mask to TF LED #9988

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

stancld commented Jan 26, 2021 •

edited

Loading

Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

Missing head_mask and decoder_head_mask arguments in encoder-decoder models #9814

Comments

stancld commented Jan 26, 2021 • edited Loading

🚀 Feature request

Your contribution

stancld commented Jan 26, 2021 •

edited

Loading