implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS #75

miracle24 · 2018-08-17T08:50:02Z

Hello. Thanks for your wonderful code. I find you also calculate the F1 score using the perl script. I think there is a problem if this script is used to calculate the F1 score for the BIOS tag scheme, since this perl script cannot deal with those tag start with 'S-'. So I wonder if you guys correct the script code or you convert the tag scheme of tokens before using the script? If I am not right, please let me know. Thanks again.

alanakbik · 2018-08-17T11:54:21Z

Hi Miracle, many thanks for spotting this - the script indeed does not have any special provisions for the S-tags, but seems to default to chunkstart=True and chunkend=True for any non-BIE entity tag which is good. I've ran some tests and found some rare cases where it fails to work as intended, so this will probably have a minor effect on evaluation numbers and means that I have to take a closer look. I think for the upcoming release we might end up just implementing our own evaluation routine in Python, which will make a lot of this easier!

GH 75 spans

tabergma · 2018-10-11T07:43:59Z

in release-0.3

alanakbik added the release-0.3 label Aug 17, 2018

alanakbik changed the title ~~what the tag scheme during evaluation for NER on Conll 2003?~~ implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS Aug 17, 2018

alanakbik added the bug Something isn't working label Aug 17, 2018

stefan-it mentioned this issue Aug 19, 2018

call eval.pl bugs #79

Closed

alanakbik mentioned this issue Sep 3, 2018

Named entity starts with "I" tag #97

Closed

alanakbik pushed a commit that referenced this issue Sep 14, 2018

GH-75: added span class

49d73e8

alanakbik pushed a commit that referenced this issue Sep 14, 2018

GH-75: added native span-F1 evaluation method

044e56c

alanakbik pushed a commit that referenced this issue Sep 17, 2018

GH-75: inferspaceafter() is not public method

791bdd5

tabergma added a commit that referenced this issue Sep 17, 2018

Merge pull request #113 from zalandoresearch/GH-75-spans

72f82f3

GH 75 spans

tabergma pushed a commit that referenced this issue Sep 20, 2018

GH-75: added span class

a48ae75

tabergma pushed a commit that referenced this issue Sep 20, 2018

GH-75: added native span-F1 evaluation method

e191ac4

tabergma pushed a commit that referenced this issue Sep 20, 2018

GH-75: inferspaceafter() is not public method

27d4212

tabergma closed this as completed Oct 11, 2018

fsonntag mentioned this issue Oct 19, 2018

Added class-based metrics #164

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS #75

implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS #75

miracle24 commented Aug 17, 2018

alanakbik commented Aug 17, 2018

tabergma commented Oct 11, 2018

implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS #75

implement pure Python "span-F1" evaluation routine applicable to BIO/BIOS #75

Comments

miracle24 commented Aug 17, 2018

alanakbik commented Aug 17, 2018

tabergma commented Oct 11, 2018