Features/context #109

steffencruz · 2024-02-19T15:59:39Z

Refactors dataset.py into a submodule with each dataset type having its' own file
Adds a base Dataset class which enables context generation via 3 methods
- get (specific context retrieval, fully deterministic)
- search (dataset-specific search algorithm, generally deterministic)
- random (produces a random context, can be seeded)
Removes existing requests based wikipedia dataclass in favor of the wiki python api. Closes Improve wikipedia retrieval mechanism #76
Adds Selector class and variants, which enable customizable selection from sets of link-like objects
Adds Context dataclass which all datasets now return.
- Enforces consistent schema.
- Makes development, logging and testing more robust and efficient.
Adds MaxRetryError exception class for handling api call errors

…apis

…odel aliases

…_is_not_null more explicit

steffencruz · 2024-02-19T19:41:07Z

625 tests passing for each python version :)

steffencruz · 2024-02-20T14:15:18Z

Tracked experiment

p-ferreira · 2024-02-20T17:43:36Z

prompting/tools/datasets/wiki.py

+from .base import Dataset
+from ..selector import Selector
+
+# DO WE NEED INTERNAL LINKS??


I see that you define the page sections as internal_links, I believe that would be more than enough for our use case. Do we still need this comment?

p-ferreira · 2024-02-20T17:53:11Z

prompting/tools/datasets/wiki.py

+            month = self.rng.randint(1, 12)
+
+        max_days = 31 if month in (1, 3, 5, 7, 8, 10, 12) else 30
+        max_days = max_days if month != 2 else 29


It seems that we are not handling possible leap year scenarios, which could break the code with ValueError: day is out of range for month, even though its probability of happening is small

p-ferreira

Formatting is a bit weird (which will be solved once black PR is merged) but overall this PR seems to add a very nice structure around the tasks and rewards organization, not to mention the dataset modularization + extra refactoring.

The only consideration I have is towards date qa not taking leap years in consideration, which could cause an exception but it wouldn't affect the flow that much considering that we have a task creation retry policy in place.

steffencruz added 17 commits February 19, 2024 09:57

Add base dataset

051ace1

Add selector class

94cda65

Add wiki datasets (date and normal)

5d40ff7

Add context class

41f1615

Add mock dataset

481632b

Add code dataset

17873e9

Add math dataset

a6e7b6a

Add init

b2c8425

Remove old monolothic dataset file

3768036

Update submodule init

9fcf494

Refactor QA task to use new context class, and cleanup

ae9769a

Refactor summarization task to use new context class, and cleanup

849ab97

Update base task so that context can be unpacked into state dict

7171a28

Refactor date QA task to use new context class, and cleanup

d5ceec4

Refactor math task to use new context class, and cleanup

9824cb5

Refactor debugging task to use new context class, and cleanup

edb0a39

Add TASKS list in submodule init

b0cc7da

steffencruz changed the base branch from features/crawler to pre-staging February 19, 2024 16:52

steffencruz requested a review from p-ferreira February 19, 2024 16:53

steffencruz marked this pull request as ready for review February 19, 2024 17:00

steffencruz added 10 commits February 19, 2024 11:04

Add MaxRetryError exception class

1f6b2af

Catch MaxRetryError and continue validation

b37b110

Update dependencies: synapse fork of mathegenerator and wiki sections

7682f76

Update fixtures for dataset tests to use updated dataset and context …

bfdf7b3

…apis

Update tests for dataset and context

73ed922

Update tests for tasks

5aa8673

Add pre-staging to workflows

5dca2dd

Fix dataset name typos

e282a27

Import REWARD_MODELS dict from pipeline for global access to reward m…

5543e90

…odel aliases

Import TASKS from tasks submodule

a57a491

steffencruz added 5 commits February 19, 2024 13:30

Remove redundant args

a58c973

Remove redundant args

9694848

Remove redundant args

9229eac

Add more task fields to tests

0dbd3f0

Add tests for reward and penalty definitions and make test_task_field…

77ee3ed

…_is_not_null more explicit

Remove hanging reference to score decay

5112b47

p-ferreira reviewed Feb 20, 2024

View reviewed changes

p-ferreira approved these changes Feb 20, 2024

View reviewed changes

p-ferreira added the v1.1.0 label Feb 20, 2024

p-ferreira mentioned this pull request Feb 20, 2024

Pre staging #112

Merged

steffencruz merged commit 3d7b577 into pre-staging Feb 20, 2024
3 checks passed

steffencruz mentioned this pull request Feb 21, 2024

Staging #110

Merged

steffencruz mentioned this pull request Mar 1, 2024

Improve wikipedia retrieval mechanism #76

Closed

mccrindlebrian deleted the features/context branch April 16, 2024 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/context #109

Features/context #109

steffencruz commented Feb 19, 2024 •

edited

Loading

steffencruz commented Feb 19, 2024

steffencruz commented Feb 20, 2024

p-ferreira Feb 20, 2024

p-ferreira Feb 20, 2024

p-ferreira left a comment •

edited

Loading

Features/context #109

Features/context #109

Conversation

steffencruz commented Feb 19, 2024 • edited Loading

steffencruz commented Feb 19, 2024

steffencruz commented Feb 20, 2024

p-ferreira Feb 20, 2024

Choose a reason for hiding this comment

p-ferreira Feb 20, 2024

Choose a reason for hiding this comment

p-ferreira left a comment • edited Loading

Choose a reason for hiding this comment

steffencruz commented Feb 19, 2024 •

edited

Loading

p-ferreira left a comment •

edited

Loading