Integrate ELECTRA Pretrained Model into Keras_NLP Ecosystem #1281

pranavvp16 · 2023-10-24T13:33:12Z

I would like to add ELECTRA pretrained model in the keras_nlp ecosystem. I have went through the CONTRIBUTINT_MODELS.md and making an issue is the first task to add the model. I have pretty good understanding of transformers and have went through the ELECTRA architecture mentioned here. So if there is no problem regarding the integration I can start working on this !!

The text was updated successfully, but these errors were encountered:

mattdangerw · 2023-10-25T20:41:56Z

Electra would be a welcome addition! Thanks! As mentioned in the model contribution guide, first thing to do would be to write a backbone and a checkpoint conversion script from a known source that shows we can match outputs.

Huggingface might be the easiest checkpoint source. The original electra repo would be totally fine as well, but looks like the implementation is in tf1, which would be a little painful to work with.

pranavvp16 · 2023-10-27T15:38:18Z

Thanks for assigning the issue, I'll start working on it

shivance · 2023-10-30T04:15:54Z

@mattdangerw I had one Colab floating around which would implement electra in KerasNLP, will share if I find . Would be helpful for @pranavvp16

shivance · 2023-10-30T11:05:11Z

#794

pranavvp16 · 2023-10-30T14:55:56Z

@shivance thanks for the notebook but I think the notebook shares the pre-training approach of ELECTRA while I'm trying to implement the backbone of ELECTRA which is similar to BERT with some changes. Please let me know if I'm wrong here cause I'm pretty new to this thing. I have implemented the backbone successfully in keras according to the ELECTRA architecture, and working of weight conversion from hugging face checkpoint

mattdangerw · 2023-10-31T06:55:49Z

I'm trying to implement the backbone of ELECTRA which is similar to BERT with some changes. Please let me know if I'm wrong here cause I'm pretty new to this thing. I have implemented the backbone successfully in keras according to the ELECTRA architecture, and working of weight conversion from hugging face checkpoint

Yes that's definitely a good approach! Let's start with the backbone, not the pretraining approach.

The pretraining approach would make for an excellent keras.io example (which was #794), but beginning with a backbone is the right first step on adding this model to our API.

Will take a look at the issues on the PR shortly!

mattdangerw assigned pranavvp16 Oct 25, 2023

mattdangerw added the type:feature New feature or request label Oct 25, 2023

pranavvp16 mentioned this issue Oct 31, 2023

ELECTRA backbone implementation in keras #1291

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate ELECTRA Pretrained Model into Keras_NLP Ecosystem #1281

Integrate ELECTRA Pretrained Model into Keras_NLP Ecosystem #1281

pranavvp16 commented Oct 24, 2023

mattdangerw commented Oct 25, 2023

pranavvp16 commented Oct 27, 2023

shivance commented Oct 30, 2023 •

edited

Loading

shivance commented Oct 30, 2023

pranavvp16 commented Oct 30, 2023

mattdangerw commented Oct 31, 2023

Integrate ELECTRA Pretrained Model into Keras_NLP Ecosystem #1281

Integrate ELECTRA Pretrained Model into Keras_NLP Ecosystem #1281

Comments

pranavvp16 commented Oct 24, 2023

mattdangerw commented Oct 25, 2023

pranavvp16 commented Oct 27, 2023

shivance commented Oct 30, 2023 • edited Loading

shivance commented Oct 30, 2023

pranavvp16 commented Oct 30, 2023

mattdangerw commented Oct 31, 2023

shivance commented Oct 30, 2023 •

edited

Loading