Adding a FNetMaskedLM task model and preprocessor #740

apupneja · 2023-02-10T18:31:35Z

Solves #722

google-cla · 2023-02-10T18:31:40Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

apupneja · 2023-02-13T17:33:05Z

Can you please approve the CI for this PR @mattdangerw?

mattdangerw

Thanks for the PR! Left some initial comments.

keras_nlp/models/f_net/f_net_tokenizer.py

keras_nlp/models/f_net/f_net_masked_lm.py

mattdangerw · 2023-02-16T17:27:34Z

keras_nlp/models/f_net/f_net_masked_lm_preprocessor.py

+        token_ids, segment_ids, padding_mask = (
+            x["token_ids"],
+            x["segment_ids"],
+            x["padding_mask"],


i'm pretty sure f_net has no padding mask, so I think we will need to remove this

mattdangerw · 2023-02-16T17:29:14Z

keras_nlp/models/f_net/f_net_preprocessor.py

@@ -174,6 +174,7 @@ def call(self, x, y=None, sample_weight=None):
        x = {
            "token_ids": token_ids,
            "segment_ids": segment_ids,
+            "padding_mask": token_ids != self.tokenizer.pad_token_id,


We don't actually want to do this I believe. f_net, because of the transformations it does to the sequence during it's transformer blocks, does not have a padding mask (though it does have a padding token).

cc @abheesht17 to double check this.

Yes, I believe you're correct. I looked at the official implementation (here: function at line 57). I'll fix that.

Edit: Found a conversation on the same topic. I'll make the required changes.

Sorry, missed this. That is correct. I've confirmed this with the authors of the paper and they stated that they simply mix the entire sequence using the Fourier Transform. The model might be sensitive to examples with very different amounts of padding, but no major issues/problems were observed in practice.

mattdangerw · 2023-02-16T17:31:23Z

Can you please approve the CI for this PR @mattdangerw?

Done! And left some initial comments.

apupneja · 2023-02-18T04:22:06Z

I've tested this on the gist you shared in another PR.

Here's the link for it: https://colab.research.google.com/gist/apupneja/8258f796a76940394d4a510ed61d08d7/deberta-masked-lm.ipynb

mattdangerw

Thank you! This looks great! Just a few small comments to clean up, and this needs a merge/rebase with the master branch.

mattdangerw · 2023-02-23T04:42:29Z

keras_nlp/models/f_net/f_net_masked_lm.py

+    Disclaimer: Pre-trained models are provided on an "as is" basis, without
+    warranties or conditions of any kind. The underlying model is provided by a
+    third party and subject to a separate license, available
+    [here](https://github.com/facebookresearch/fairseq).


this is not correct for this model! we can use the same disclaimer as BERT

mattdangerw · 2023-02-23T04:44:00Z

keras_nlp/models/f_net/f_net_masked_lm_preprocessor.py

+
+    # Creating sentencepiece tokenizer for FNet LM preprocessor
+    bytes_io = io.BytesIO()
+


we can remove the empty newlines past this point in the code block

mattdangerw · 2023-02-23T04:44:21Z

keras_nlp/models/f_net/f_net_masked_lm_preprocessor.py

+
+    tokenizer = FNetTokenizer(proto=proto)
+
+    preprocessor = FNetMaskedLMPreprocessor(


format this as one line

By one line, do you mean that I should create the object inside the FNetMaskedLMPreprocessor constructor parameters?

preprocessor = FNetMaskedLMPreprocessor(
tokenizer=FNetTokenizer(proto=proto)
)

Something like this?

Oh I just mean what inside the parentheses here.
preprocessor = FNetMaskedLMPreprocessor(tokenizer=tokenizer)

Right. Fixed it.

mattdangerw · 2023-02-23T04:50:56Z

Also thanks so much for testing this our via the gist! Super helpful.

Rebase

mattdangerw · 2023-02-24T01:10:19Z

Thanks!

apupneja added 3 commits February 5, 2023 18:57

commit adding masked language modeling task for FNet

ec765af

fixed errors in f_net_masked_lm_preprocessor.py

18107f2

updated the docstring

14de0ec

Formatting the code

80d6f9e

mattdangerw self-requested a review February 16, 2023 17:16

mattdangerw requested changes Feb 16, 2023

View reviewed changes

apupneja added 3 commits February 17, 2023 15:16

fixing doc strings, preprocessor, and tokenizer

e811c00

fixing f_net_classifier_test

7917f47

changing the mask token according to the pretrained model vocab

2c92c2b

apupneja requested a review from mattdangerw February 18, 2023 04:41

mattdangerw approved these changes Feb 23, 2023

View reviewed changes

apupneja and others added 4 commits February 23, 2023 10:37

Merge branch 'master' into rebase

f6b33ed

Merge pull request #3 from apupneja/rebase

17570d8

Rebase

fixing doc string

92ef5da

addressing review

9df85e0

mattdangerw merged commit 696040f into keras-team:master Feb 24, 2023

mattdangerw mentioned this pull request Apr 18, 2023

Add an FNetMaskedLM task model #722

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a FNetMaskedLM task model and preprocessor #740

Adding a FNetMaskedLM task model and preprocessor #740

apupneja commented Feb 10, 2023

google-cla bot commented Feb 10, 2023

apupneja commented Feb 13, 2023

mattdangerw left a comment

mattdangerw Feb 16, 2023

mattdangerw Feb 16, 2023

apupneja Feb 16, 2023 •

edited

Loading

abheesht17 Feb 22, 2023

mattdangerw commented Feb 16, 2023

apupneja commented Feb 18, 2023

mattdangerw left a comment

mattdangerw Feb 23, 2023

mattdangerw Feb 23, 2023

mattdangerw Feb 23, 2023

apupneja Feb 23, 2023 •

edited

Loading

mattdangerw Feb 23, 2023

apupneja Feb 23, 2023

mattdangerw commented Feb 23, 2023

mattdangerw commented Feb 24, 2023


		# Creating sentencepiece tokenizer for FNet LM preprocessor
		bytes_io = io.BytesIO()


		tokenizer = FNetTokenizer(proto=proto)

		preprocessor = FNetMaskedLMPreprocessor(

Adding a FNetMaskedLM task model and preprocessor #740

Adding a FNetMaskedLM task model and preprocessor #740

Conversation

apupneja commented Feb 10, 2023

google-cla bot commented Feb 10, 2023

apupneja commented Feb 13, 2023

mattdangerw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apupneja Feb 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdangerw commented Feb 16, 2023

apupneja commented Feb 18, 2023

mattdangerw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apupneja Feb 23, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdangerw commented Feb 23, 2023

mattdangerw commented Feb 24, 2023

apupneja Feb 16, 2023 •

edited

Loading

apupneja Feb 23, 2023 •

edited

Loading