Adds the `pretrained` module from allennlp-hub. #42

dirkgr · 2020-05-04T23:57:48Z

These tests might not succeed because of the disk space issue.

Finishes allenai/allennlp#3302.
Closes allenai/allennlp#3965.

dirkgr · 2020-05-05T00:43:23Z

Our organization of pre-trained models is still a total mess. We have three places now where we have the URLs. Here, and twice in allennlp-models. They don't all agree, and they don't all have the same models. We will have to fix this soon. It's confusing.

epwalsh · 2020-05-08T15:45:34Z

Looks like the tests you wrote aren't found by pytest. I think by default pytest assumes test classes are named Test* instead of *Test. But if we just add this line to the pytest.ini file it will find them.

epwalsh · 2020-05-08T17:55:42Z

Another thing to consider is that we'll be re-downloading these models on each CI run, which will run up our cloud bill. If we used a self-hosted runner for the pretrained tests we wouldn't have this problem since the models would be cached locally.

epwalsh · 2020-05-08T19:40:17Z

allennlp_models/pretrained.py

+    return predictor
+
+
+def named_entity_recognition_with_elmo_peters_2018() -> SentenceTaggerPredictor:


I think we'll need to import the crf tagger module for this to work

dirkgr · 2020-05-08T20:00:53Z

tests/pretrained_test.py

+# But default we don't run these tests
+@pytest.mark.skipif(
+    not os.environ.get("ALLENNLP_MODELS_RUN_PRETRAINED_TEST"), reason="requires massive downloads"
+)


The GPU tests are selected in a different way, aren't they? Why isn't this done the same way?

Oh yea, we should probably run those on the self-hosted runner as well

Sorry - I realize your question now. I felt like there should be an explicit skip condition here because if you were running tests locally you might not want to download these massive files, and I didn't know what that condition could be other than an environment variable flag

But we could also add a custom mark like we do for GPU tests and then exclude those by default. That might be cleaner

Looks like we're inconsistent about how we handle GPU tests. Some have @pytest.mark.gpu, others use skipif and detect how many GPUs there are. Others yet have both.

The skipif ensures they're never accidentally run if you don't have a GPU available. The mark.gpu is just so that we can select only GPU tests to run

dirkgr · 2020-05-08T20:28:03Z

I think this is good to go then?

epwalsh

I think this is good to go. I'll follow up with a PR to add a GPU testing workflow

Adds the pretrained module from allennlp-hub.

18c57c9

dirkgr requested a review from epwalsh May 4, 2020 23:57

dirkgr added 2 commits May 4, 2020 17:01

Formatting

a497931

Ignore long lines

3030aa0

dirkgr added 3 commits May 5, 2020 10:34

Bumping CI

3e4decc

Merge branch 'master' into Pretrained

7944ad5

Merge branch 'master' into Pretrained

8b0f7bf

Make PyTest find tests

e4630b4

add self-hosted workflow

f0e1356

epwalsh reviewed May 8, 2020

View reviewed changes

dirkgr commented May 8, 2020

View reviewed changes

epwalsh added 3 commits May 8, 2020 13:20

use custom mark

5ab005a

add missing import

8360627

revert

02ecd54

epwalsh approved these changes May 8, 2020

View reviewed changes

dirkgr merged commit 96d623c into master May 8, 2020

dirkgr deleted the Pretrained branch May 8, 2020 20:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds the `pretrained` module from allennlp-hub. #42

Adds the `pretrained` module from allennlp-hub. #42

dirkgr commented May 4, 2020 •

edited

Loading

dirkgr commented May 5, 2020

epwalsh commented May 8, 2020

epwalsh commented May 8, 2020

epwalsh May 8, 2020

dirkgr May 8, 2020

epwalsh May 8, 2020

epwalsh May 8, 2020 •

edited

Loading

epwalsh May 8, 2020

dirkgr May 8, 2020

epwalsh May 8, 2020

dirkgr commented May 8, 2020

epwalsh left a comment

		return predictor


		def named_entity_recognition_with_elmo_peters_2018() -> SentenceTaggerPredictor:

Adds the pretrained module from allennlp-hub. #42

Adds the pretrained module from allennlp-hub. #42

Conversation

dirkgr commented May 4, 2020 • edited Loading

dirkgr commented May 5, 2020

epwalsh commented May 8, 2020

epwalsh commented May 8, 2020

epwalsh May 8, 2020

Choose a reason for hiding this comment

dirkgr May 8, 2020

Choose a reason for hiding this comment

epwalsh May 8, 2020

Choose a reason for hiding this comment

epwalsh May 8, 2020 • edited Loading

Choose a reason for hiding this comment

epwalsh May 8, 2020

Choose a reason for hiding this comment

dirkgr May 8, 2020

Choose a reason for hiding this comment

epwalsh May 8, 2020

Choose a reason for hiding this comment

dirkgr commented May 8, 2020

epwalsh left a comment

Choose a reason for hiding this comment

Adds the `pretrained` module from allennlp-hub. #42

Adds the `pretrained` module from allennlp-hub. #42

dirkgr commented May 4, 2020 •

edited

Loading

epwalsh May 8, 2020 •

edited

Loading