`bert_base_zh`, `bert_base_multi_cased`: Add BERT Base Variants #319

abheesht17 · 2022-08-27T13:44:25Z

Resolves #308

Converted model weights and vocab have been uploaded here: https://drive.google.com/drive/folders/19APUi7fobORdQjoe8YDyk8ou0x_L6hWu?usp=sharing.

Edit: Merging #320 with this PR. Might cause conflicts while merging with master, otherwise.

mattdangerw · 2022-08-29T18:43:11Z

@abheesht17 I think we would want to avoid the add_pooling_layer parameter if possible.

One thing we could check is the bert checkpoints on the original bert repo. Could you see if those include the pooling layer?

https://github.com/google-research/bert

abheesht17 · 2022-08-30T15:18:25Z

@abheesht17 I think we would want to avoid the add_pooling_layer parameter if possible.

One thing we could check is the bert checkpoints on the original bert repo. Could you see if those include the pooling layer?

https://github.com/google-research/bert

My bad. The pooling layers are present in this ckpt after all: https://storage.googleapis.com/tf_model_garden/nlp/bert/v3/multi_cased_L-12_H-768_A-12.tar.gz. Missed them earlier

mattdangerw

This looks good! We recently renamed the inputs to the models, could you make the changes in your colab to match?
#327

I think we are fine with the weights we have now (as the variable names do not depend on the input names), but might be good to double check the rename did not break the weights we uploaded.

mattdangerw · 2022-09-01T02:16:32Z

tools/checkpoint_conversion/bert_base_multi_cased.ipynb

+        "    )\n",
+        "\n",
+        "model.get_layer(\"pooled_dense\").kernel.assign(\n",
+        "    weights[f\"encoder/layer_with_weights-{i + 5}/kernel/.ATTRIBUTES/VARIABLE_VALUE\"]\n",


This is using a variable (i) that is not out of scope. Can't you just do num_layers + X?

Same comment for other colab.

mattdangerw

This looks good to me! Thanks!

abheesht17 added 4 commits August 27, 2022 19:08

Add ckpt conversion notebook for bert_base_chinese

04342e5

Change Colab link

c879aa6

Run notebook again

9f1f2de

Minor changes

3536d15

abheesht17 added 2 commits August 30, 2022 18:56

Map pooling layer

d5f57f7

Merge branch 'keras-team:master' into bert-base-chinese

2c4b3c5

abheesht17 added 2 commits August 31, 2022 05:48

Add model to checkpoints dict

3dc5653

Format

7c3716c

abheesht17 changed the title ~~Add Checkpoint Conversion Notebook for bert_base_chinese~~ bert_base_chinese, bert_base_multi_cased: Add Checkpoint Conversion Notebooks Aug 31, 2022

abheesht17 changed the title ~~bert_base_chinese, bert_base_multi_cased: Add Checkpoint Conversion Notebooks~~ bert_base_zh, bert_base_multi_cased: Add Checkpoint Conversion Notebooks Aug 31, 2022

abheesht17 added 5 commits August 31, 2022 06:12

Verify KerasNLP Chinese model

b96f6c6

Add bert_base_multi_cased

d5a0898

Check cloud output

787762c

Merge branch 'keras-team:master' into bert-base-chinese

d308606

Parameterise network tests

b84400c

abheesht17 changed the title ~~bert_base_zh, bert_base_multi_cased: Add Checkpoint Conversion Notebooks~~ bert_base_zh, bert_base_multi_cased: Add BERT Variants Aug 31, 2022

abheesht17 changed the title ~~bert_base_zh, bert_base_multi_cased: Add BERT Variants~~ bert_base_zh, bert_base_multi_cased: Add BERT Base Variants Aug 31, 2022

abheesht17 added 2 commits August 31, 2022 12:41

Use named_parameters for parameterising

dd88964

Fix Colab links

f021148

mattdangerw requested changes Sep 1, 2022

View reviewed changes

abheesht17 added 2 commits September 1, 2022 11:26

Address review comments - I

3b615d2

Run notebooks once again

8511396

mattdangerw approved these changes Sep 1, 2022

View reviewed changes

mattdangerw merged commit 6953e53 into keras-team:master Sep 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`bert_base_zh`, `bert_base_multi_cased`: Add BERT Base Variants #319

`bert_base_zh`, `bert_base_multi_cased`: Add BERT Base Variants #319

abheesht17 commented Aug 27, 2022 •

edited

Loading

mattdangerw commented Aug 29, 2022

abheesht17 commented Aug 30, 2022

mattdangerw left a comment

mattdangerw Sep 1, 2022

mattdangerw left a comment

bert_base_zh, bert_base_multi_cased: Add BERT Base Variants #319

bert_base_zh, bert_base_multi_cased: Add BERT Base Variants #319

Conversation

abheesht17 commented Aug 27, 2022 • edited Loading

mattdangerw commented Aug 29, 2022

abheesht17 commented Aug 30, 2022

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw Sep 1, 2022

Choose a reason for hiding this comment

mattdangerw left a comment

Choose a reason for hiding this comment

`bert_base_zh`, `bert_base_multi_cased`: Add BERT Base Variants #319

`bert_base_zh`, `bert_base_multi_cased`: Add BERT Base Variants #319

abheesht17 commented Aug 27, 2022 •

edited

Loading