You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following the PRs #9569, #9634 and #9639, there are other encoder-decoder models, which either do not support head_mask and decoder_head_mask input arguments at all or can be only provided with a single head_mask argument used for head masking both in encoder and decoder. It would be, therefore, nice to make this feature uniform over all the decoder-models.
I'm happy to add this feature in the following days, both for PyTorch and TensorFlow models. (Likely in shorter PRs in order not to create large, overwhelming PRs)
🚀 Feature request
Following the PRs #9569, #9634 and #9639, there are other encoder-decoder models, which either do not support
head_mask
anddecoder_head_mask
input arguments at all or can be only provided with a singlehead_mask
argument used for head masking both in encoder and decoder. It would be, therefore, nice to make this feature uniform over all the decoder-models.Models:
Your contribution
I'm happy to add this feature in the following days, both for PyTorch and TensorFlow models. (Likely in shorter PRs in order not to create large, overwhelming PRs)
Reviewers: @patrickvonplaten, @jplu, @sgugger, @LysandreJik, @stas00 .
The text was updated successfully, but these errors were encountered: