MonoBERT and MonoT5 defaults #86

lintool · 2020-09-13T11:09:30Z

Follow up to #83, we have:

from pygaggle.rerank.base import Query, Text
from pygaggle.rerank.transformer import MonoT5

model_name = 'castorini/monot5-base-msmarco'
tokenizer_name = 't5-base'
reranker =  MonoT5(model_name, tokenizer_name)

I think the model_name and tokenizer_name should have defaults? So we can boil down to:

from pygaggle.rerank.base import Query, Text
from pygaggle.rerank.transformer import MonoT5

reranker =  MonoT5()

Also, the regression scripts should be modified to use this new abstraction?

The text was updated successfully, but these errors were encountered:

yuxuan-ji · 2020-09-13T16:32:42Z

@lintool it already defaults to 'castorini/monot5-base-msmarco' and 't5-base'! I included it in the example because I thought it'd be helpful to show the user that they can easily change model/tokenizer using pretrained stuff from huggingface., but maybe it's confusing?

as for regression scripts (if you mean the evaluation scripts), there shouldn't be any changes necessary because monoBERT/T5 also accepts instances of PretrainedModel or PretrainedTokenizer

lintool · 2020-09-13T16:55:18Z

I see. Perhaps change to a comment?

Something like:

reranker =  MonoT5()
# Initializes with defaults, model_name = 'castorini/monot5-base-msmarco', tokenizer_name = 't5-base'
# Same as: MonoT5('castorini/monot5-base-msmarco', 't5-base')

README docs should be as simple as possible IMO.

re: regression scripts, e.g., https://github.com/castorini/pygaggle/blob/master/pygaggle/run/evaluate_passage_ranker.py#L80

Wouldn't it make sense to use this new abstraction so we don't have multiple code paths (that might subtly diverge down the road...)? And same with MonoBERT?

yuxuan-ji · 2020-09-13T21:58:04Z

agree with changing it to a comment! will make a PR for it

as for the regression scripts, I did think of changing those, but wasn't sure how because of PassageRankingEvaluationOptions, since values like options.device, options.from_tf, options.batch_size don't fit in nicely in the current abstraction.

Agreed that forcing the manual construction of models/tokenizers can cause divergence issues in the future, but I wouldn't particularly want to add more parameters to MonoBERT/MonoT5's __init__ method for these options as it'd be imo an anti-pattern, maybe two separate model_options: dict/ tokenizer_options: dict parameters? or is that too dirty?

yuxuan-ji mentioned this issue Sep 17, 2020

Add constructor functions for model and tokenizer of MonoBERT/T5 #93

Merged

ronakice closed this as completed in #93 Oct 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MonoBERT and MonoT5 defaults #86

MonoBERT and MonoT5 defaults #86

lintool commented Sep 13, 2020

yuxuan-ji commented Sep 13, 2020 •

edited

Loading

lintool commented Sep 13, 2020

yuxuan-ji commented Sep 13, 2020 •

edited

Loading

MonoBERT and MonoT5 defaults #86

MonoBERT and MonoT5 defaults #86

Comments

lintool commented Sep 13, 2020

yuxuan-ji commented Sep 13, 2020 • edited Loading

lintool commented Sep 13, 2020

yuxuan-ji commented Sep 13, 2020 • edited Loading

yuxuan-ji commented Sep 13, 2020 •

edited

Loading

yuxuan-ji commented Sep 13, 2020 •

edited

Loading