-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MonoBERT and MonoT5 defaults #86
Comments
@lintool it already defaults to as for regression scripts (if you mean the evaluation scripts), there shouldn't be any changes necessary because |
I see. Perhaps change to a comment? Something like: reranker = MonoT5()
# Initializes with defaults, model_name = 'castorini/monot5-base-msmarco', tokenizer_name = 't5-base'
# Same as: MonoT5('castorini/monot5-base-msmarco', 't5-base') README docs should be as simple as possible IMO. re: regression scripts, e.g., https://github.com/castorini/pygaggle/blob/master/pygaggle/run/evaluate_passage_ranker.py#L80 Wouldn't it make sense to use this new abstraction so we don't have multiple code paths (that might subtly diverge down the road...)? And same with MonoBERT? |
agree with changing it to a comment! will make a PR for it as for the regression scripts, I did think of changing those, but wasn't sure how because of Agreed that forcing the manual construction of models/tokenizers can cause divergence issues in the future, but I wouldn't particularly want to add more parameters to |
Follow up to #83, we have:
I think the
model_name
andtokenizer_name
should have defaults? So we can boil down to:Also, the regression scripts should be modified to use this new abstraction?
The text was updated successfully, but these errors were encountered: