Add BartTokenizer and BART Presets #685

abheesht17 · 2023-01-20T12:46:44Z

No description provided.

mattdangerw

Just one minor comment. LGTM!

Weights have been uploaded hopefully. Let me know if any of them aren't showing up.

mattdangerw · 2023-01-24T02:08:47Z

keras_nlp/models/bart/bart_tokenizer.py

+    Examples:
+
+    Batched inputs.
+    >>> vocab = {"<s>": 0, "<pad>": 1, "</s>": 2, "reful": 3, "gent": 4}


Can we shrink these vocabs down? They end up distracting from the actual tokenizer, where we want the attention to be. I was playing around with some shorter examples on #653 , e.g. https://github.com/keras-team/keras-nlp/blob/dc62952b023602fde8e5c2373894a449be15265f/keras_nlp/models/roberta/roberta_preprocessor.py#L135-L149

mattdangerw · 2023-01-25T00:50:52Z

keras_nlp/models/bart/bart_tokenizer.py

+    def presets(cls):
+        return copy.deepcopy(backbone_presets)
+
+    @classmethod


Also, we have no done the base class stuff for tokenizer! Though slightly different as there are many tokenizer types. #673

You can remove all this.

mattdangerw

Thanks!

Add BartTokenizer and BART Presets

f29e79a

abheesht17 requested review from mattdangerw and jbischof January 20, 2023 12:46

mattdangerw reviewed Jan 24, 2023

View reviewed changes

mattdangerw reviewed Jan 25, 2023

View reviewed changes

abheesht17 added 3 commits February 2, 2023 17:15

Merge branch 'master' into bart-presets

5a350bf

Address comments, add GCP links

29e24cd

Test fixes

208a844

mattdangerw approved these changes Feb 2, 2023

View reviewed changes

mattdangerw merged commit 30fcbdb into keras-team:master Feb 2, 2023

abheesht17 mentioned this pull request Feb 7, 2023

Add BART presets #672

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BartTokenizer and BART Presets #685

Add BartTokenizer and BART Presets #685

abheesht17 commented Jan 20, 2023

mattdangerw left a comment

mattdangerw Jan 24, 2023

mattdangerw Jan 25, 2023

mattdangerw left a comment

Add BartTokenizer and BART Presets #685

Add BartTokenizer and BART Presets #685

Conversation

abheesht17 commented Jan 20, 2023

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw Jan 24, 2023

Choose a reason for hiding this comment

mattdangerw Jan 25, 2023

Choose a reason for hiding this comment

mattdangerw left a comment

Choose a reason for hiding this comment