NER task using Flair BertEmbeddings VS HuggingFace scripts #1508

ChessMateK · 2020-04-03T08:48:38Z

Hi everyone!

I am new to NLP and NER so I'm still trying to understand how exactly different architectures work.

My question is the following: the architecture implemented to do NER using Flair BertEmbeddings withing Flair SequenceTagger is the same implemented by HuggingFace Team in the pytorch/tf example scripts here?

In particular my doubt is related to the fact the Flair SequenceTagger is based on Bi-LSTM(-CRF) that I still see through layers when running, while HuggingFace scripts are based purely on the Transformer architecture.

I am running this tutorial in Google Colab.

I'd really appreciate any clarification. Thank you all in advance.

Best regards.

ChessMateK · 2020-04-08T09:21:51Z

Maybe I have to ask to the master :-) @alanakbik

alanakbik · 2020-04-08T10:19:14Z

For the Huggingface scripts @stefan-it is the person to ask :)

Both implementations are very different: In Flair, our default sequence labeling architecture is BiLSTM-CRF with a feature-based approach (i.e. no fine-tuning of transformers) trained with many epochs of SGD and annealing. Huggingface is I believe doing a fine-tuning of transformers as in the BERT paper (few epochs, very small learning rate, Adam optimizer) which is very different.

We are just now adding this fine-tuning transformers approach for Flair as well - it's part of master branch and undergoing testing (see #1494), so it will be part of the next release. It should allow the community to directly compare both approaches.

ChessMateK · 2020-04-15T16:41:41Z

Thank you @alanakbik :)

ChessMateK added the question Further information is requested label Apr 3, 2020

ChessMateK closed this as completed Apr 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NER task using Flair BertEmbeddings VS HuggingFace scripts #1508

NER task using Flair BertEmbeddings VS HuggingFace scripts #1508

ChessMateK commented Apr 3, 2020 •

edited

Loading

ChessMateK commented Apr 8, 2020

alanakbik commented Apr 8, 2020

ChessMateK commented Apr 15, 2020 •

edited

Loading

NER task using Flair BertEmbeddings VS HuggingFace scripts #1508

NER task using Flair BertEmbeddings VS HuggingFace scripts #1508

Comments

ChessMateK commented Apr 3, 2020 • edited Loading

ChessMateK commented Apr 8, 2020

alanakbik commented Apr 8, 2020

ChessMateK commented Apr 15, 2020 • edited Loading

ChessMateK commented Apr 3, 2020 •

edited

Loading

ChessMateK commented Apr 15, 2020 •

edited

Loading