Add comet #107

devrimcavusoglu · 2022-03-11T17:34:15Z

Closes #99.

- New Task CrossLingualEvaluation is defined. - requirements.txt and setup.py updated accordingly.

devrimcavusoglu · 2022-03-14T10:19:07Z

Currently CI is failing and one potential reason to that is not disabling the multiprocessing while scoring. There is a bug in the comet implementation, see this issue.

…ix for num_workers).

devrimcavusoglu · 2022-03-15T10:13:26Z

Any ideas for CI failing @Sophylax ? Some notes below:

Test preserves around 4.5 GB memory, this is a similar case with Prism
The PR addressing bug fix for the issue on unbabel/COMET is merged. Still with num_workers=0 it is failing.

Any similar situations experienced with your tests @ricardorei ?

ricardorei · 2022-03-15T10:45:19Z

Hi @devrimcavusoglu thanks for tagging me on this. Let's see if I can help you

I had some problems building the CI for COMET. My solution is to create tests using lightweight encoder models.

For example [wmt21-cometinho-da](https://github.com/Unbabel/COMET/blob/master/METRICS.md)?

ricardorei · 2022-03-15T10:45:48Z

which version of COMET are you currently using?

devrimcavusoglu · 2022-03-15T11:21:17Z

Thank you for the quick response @ricardorei . I missed the Metrics doc at first sight and only glanced to README, and didn't think there was a lighter model for COMET. Currently with your suggestion the CI problem is solved, thank you very much. One suggesstion from me could be to add/link METRICS.md in README.md.

ricardorei · 2022-03-15T11:27:58Z

Nice! I am happy to help, tag me if you need something else.

I'll do that. Actually, I need to update all the documentation, it is extremely outdated...

PS: cool repo, I think it's important to have tools to evaluate several metrics and stop people from reporting just BLEU :)

devrimcavusoglu · 2022-03-15T12:06:17Z

Thanks @ricardorei, appreciate it. Thanks for the comment about the repo as well, yes one of the milestone is that supplying a diverse set of metrics to people to make the evaluation in NLP more properly :).

# Conflicts: # jury/metrics/__init__.py

devrimcavusoglu added 2 commits March 11, 2022 18:56

Add COMET metric.

b54cc2b

- New Task CrossLingualEvaluation is defined. - requirements.txt and setup.py updated accordingly.

Multiple ref & multiple pred cases implemented.

304a68d

devrimcavusoglu requested review from Sophylax and cemilcengiz March 11, 2022 17:34

devrimcavusoglu added 3 commits March 11, 2022 20:40

Removed unused imports and redundant leftover code segments.

1c971a5

num_workers & batch_size is explicitly set for comet.

b09ccf5

Code formatting.

54e338d

Setup.py updated for unbabel-comet (from spesific commit addressing f…

7d2d393

…ix for num_workers).

Changed Comet model for tests as light model 'wmt21-cometinho-da'.

65d4aab

devrimcavusoglu added 2 commits March 15, 2022 14:35

2nd jury prism is removed from Prism tests.

373378f

Removed unused imports.

5d3c137

devrimcavusoglu and others added 4 commits March 15, 2022 17:02

Merge branch 'main' into add-comet

44c04fd

Merge branch 'main' into add-comet

072812f

# Conflicts: # jury/metrics/__init__.py

Merge remote-tracking branch 'origin/add-comet' into add-comet

cf95b30

Updated setup.py for unbabel-comet package as git+https (from git+git).

06df559

devrimcavusoglu merged commit 939c1b6 into main Mar 21, 2022

devrimcavusoglu deleted the add-comet branch March 21, 2022 07:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comet #107

Add comet #107

devrimcavusoglu commented Mar 11, 2022

devrimcavusoglu commented Mar 14, 2022

devrimcavusoglu commented Mar 15, 2022

ricardorei commented Mar 15, 2022 •

edited

Loading

ricardorei commented Mar 15, 2022 •

edited

Loading

devrimcavusoglu commented Mar 15, 2022

ricardorei commented Mar 15, 2022 •

edited

Loading

devrimcavusoglu commented Mar 15, 2022

Add comet #107

Add comet #107

Conversation

devrimcavusoglu commented Mar 11, 2022

devrimcavusoglu commented Mar 14, 2022

devrimcavusoglu commented Mar 15, 2022

ricardorei commented Mar 15, 2022 • edited Loading

ricardorei commented Mar 15, 2022 • edited Loading

devrimcavusoglu commented Mar 15, 2022

ricardorei commented Mar 15, 2022 • edited Loading

devrimcavusoglu commented Mar 15, 2022

ricardorei commented Mar 15, 2022 •

edited

Loading

ricardorei commented Mar 15, 2022 •

edited

Loading

ricardorei commented Mar 15, 2022 •

edited

Loading