You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't need this fixed, but maybe it's of interest:
I noticed that this smatch version returns a score of 1.00 for the different graphs:
(s / see-01
:ARG0 (p / person
:name (n / name
:op1 "Hans")))
and
(s / see-01
:ARG0 (p / person
:name (n / name
:op1 "Hans_")))
Wondering if there's a bug, or if there is some reason for this? It might be some sort of preprocessing that happens here which is not obvious. Since there are also https link etc. in AMR that may contain stuff like "_" I think it may not be sensible to remove characters.
edit
It can be even more severe. The score is also 1.00 for the very different graphs:
(s / see-01
:ARG0 (p / person
:name (n / name
:op1 "Hans Meier")))
and
(s / see-01
:ARG0 (p / person
:name (n / name
:op1 "Hans")))
While the first two graphs could be some pre-processing quirk, the second two ones clearly seem like bug.
The text was updated successfully, but these errors were encountered:
Is this intended behaviour? I noticed the given test_input1 and test_input2 have a similar example, with "Wiilliam" differeing in capitalization and having underscores. It currently counts it as a match.
Hi @jzw2 I guess from the view of AMR evaluation/similarity it can seem okay to not differ between upper- and lower-case and be considered a user choice (for very strict evaluation, we might want to differ though). The observed treatment of underscores (the bug you experience, or the example shown above), however, seems indeed like wrong/unintended behavior and should not count as a match.
Maybe you can check out my SMATCH++ that has a best-practice evaluation protocol with AMR normalization (and ILP solver).
I don't need this fixed, but maybe it's of interest:
I noticed that this smatch version returns a score of 1.00 for the different graphs:
and
Wondering if there's a bug, or if there is some reason for this? It might be some sort of preprocessing that happens here which is not obvious. Since there are also https link etc. in AMR that may contain stuff like "_" I think it may not be sensible to remove characters.
edit
It can be even more severe. The score is also 1.00 for the very different graphs:
and
While the first two graphs could be some pre-processing quirk, the second two ones clearly seem like bug.
The text was updated successfully, but these errors were encountered: