Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
#1985
Test:
corpus = EUROPARL_NER_GERMAN(label_name_map={'LOC': 'location test a-b-c', 'PPER': 'person test a-b-c'})
print(corpus.test[83])
Output:
Sentence: "In Wien gab es eine große Konferenz ." [− Tokens: 8 − Token-Labels: "In <in/APPR/I-PC> Wien <Wien/NE/I-PC/S-location test a b c> gab <geben/VVFIN/I-VC> es <es/person test a b c/I-NC> eine <ein/ART/B-NC> große <groß/ADJA/I-NC> Konferenz <Konferenz/NN/I-NC> . <./$.>"]
As you can see, my implementation replaces "-" with " " because e.g. the methods iob2 and iob_iobes in data.py run into problems when tags contain "-".