-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Demo results does not match when the model is loaded locally #3418
Comments
While the demo models are hosted on Google cloud for performance reasons (it's faster and cheaper to download from GCS for the running demo), they are identical.
I don't know why you are seeing different performance. Can you give specific examples of the differences you're seeing? |
See here: https://github.com/allenai/allennlp/blob/9a6962f00d2b0d30b81900b4e9764ddc3433f400/tutorials/how_to/elmo.md#notes-on-statefulness-and-non-determinism. There are several other issues in the repo with more discussion on this; you can probably find them for searching for links to that note that I linked to. |
@matt-gardner does this model use ELMo? I gathered from the name that it didn't. |
Hello, I have just encountered this problem. Given the sentence:
While the demo returns this beautiful result: The python-api returns: In other sentences, I got the same results. I am using this model Thank you for your work! allennlp is really useful :) @matt-gardner |
This is almost certainly due to a mismatch in spacy models. We use spacy to detect verbs, and different versions of spacy models detect verbs differently, especially with things like "was". In the demo, with an older version of spacy, "was" gets detected as a verb, so the prediction is made. In newer versions of spacy, I believe "was" in this gets detected as AUX, so no prediction is made. |
You are right. I have just downgraded spacy to 2.1.4 and now the behaviour is the same as in the demo. Thank you |
Hello, As you can see, it classifies 'to review' as Purpose. But on the demo, it correctly says that this is not a Purpose. I have tried with newer as well as older spacy versions, specifically: 2.1.4,2.1.9 and 2.2.4. |
I have used SRL model (other models also). It seems the output generated in the demo screen are more accurate than the results obtained locally. What is the difference between the models loaded for demo and loaded locally. Or is there any extra files to be added to get better prediction
The text was updated successfully, but these errors were encountered: