Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ernie-M #21123

Closed
2 tasks done
KnutJaegersberg opened this issue Jan 14, 2023 · 5 comments · Fixed by #21349
Closed
2 tasks done

Ernie-M #21123

KnutJaegersberg opened this issue Jan 14, 2023 · 5 comments · Fixed by #21349

Comments

@KnutJaegersberg
Copy link

Model description

Ernie-M looks pretty good in multilingual benchmarks, beating XLM-Roberta.
Paddlepaddle recently added ernie-m to the huggingface repo, we can use it with paddlenlp.transformers.
Would be nice to have the model supported in huggingface transformer as well.

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

https://huggingface.co/PaddlePaddle/ernie-m-base
https://huggingface.co/PaddlePaddle/ernie-m-large

@shermansiu
Copy link
Contributor

shermansiu commented Jan 17, 2023

@susnato
Copy link
Contributor

susnato commented Jan 20, 2023

Hi, @shermansiu is there any pytorch/tf implementation of this model?

@shermansiu
Copy link
Contributor

None that I'm aware of.

Anyways, the author of ERNIE-Pytorch ported over a few other Ernie models to Huggingface. I'm sure it could be adapted for this. And the PaddlePaddle syntax is quite similar to that of PyTorch, so I'm sure it should be relatively easy, though it'll probably take some time.

@susnato
Copy link
Contributor

susnato commented Jan 23, 2023

@shermansiu Thanks for the resources!
I am currently trying to port the model to huggingface(pytorch), (done till Embedding Layer with acceptable tolerance of 1e-3)

@susnato
Copy link
Contributor

susnato commented Feb 16, 2023

Hi @KnutJaegersberg, Ernie-M is implemented!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants