Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replication of monoBERT & monoT5 Baselines for MSMARCO Passage Ranking #128

Merged
merged 1 commit into from
Dec 11, 2020

Conversation

rakeeb-hossain
Copy link
Contributor

Modifying replication log to track that I was successfully able to replicate the monoBERT and monoT5 baseline for a subset of the MSMARCO Passage tasks (available here: https://www.dropbox.com/s/5xa5vjbjle0c8jv/msmarco_ans_small.zip).

I replicated both of these baselines in two environments:

  1. Ubuntu 20.04 using a GeForce 940MX, Python 3.8.5, Java 11, CUDA 11.0
  2. ComputeCanada supercomputer with Tesla V100, Python 3.8, Java 13, CUDA 10.1

In the second environment, there were no issues obtaining the results.

In the first environment, the following problems were encountered:

  1. When running monoBERT baseline, encountered ImportError: cannot import name 'AutoModel' from 'transformers'. See ImportError: cannot import name 'AutoModel' from 'transformers' huggingface/transformers#4172 for more details. This was fixed by running pip install torch.
  2. CUDA was not installed. Followed CUDA Toolkit installation steps to fix.
  3. PyTorch was not detecting libcudart.so.11.1, which was located at /usr/local/cuda-11.1/lib64. I had to append this path to LD_LIBRARY_PATH.
  4. Exception due to Line 149 of pygaggle/rerank/transformer.py during monoBERT baseline replication. Had to remove return_dict=False (this was not committed since the same change was not necessary in the Compute Canada environment).

@ronakice
Copy link
Member

Thanks for replicating on both environments! I believe the errors you pointed out are more personal in nature, so we can look into it if many others are having the same issues.

@ronakice ronakice merged commit 3b1f69c into castorini:master Dec 11, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants