forked from NVIDIA/NeMo-Curator
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix bugs in retriever sdg notebook (NVIDIA#522)
* Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * fixed qa bug 5008113, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * bug fixes for generator, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed precommit, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed filters, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed all issues, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed bug with document id, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * check if filtering pipeline is present, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed notebook, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * added functionality to filter pre-generated datasets, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * separated generation & filtering pipelines, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed pre-commit, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * minor changes, Signed-off by [email protected] Signed-off-by: Vinay Raman <[email protected]> * fixed Ryan Wolf's comments, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * fixed minor bugs in configs, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * removed commented code in main.py, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * added CLI flags for generation & filtering removed code duplication, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * minor fix to quickstart notebook, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> * removed filter.py & generate.py, Signed-off by [email protected] Signed-off-by: viraman <[email protected]> --------- Signed-off-by: viraman <[email protected]> Signed-off-by: Vinay Raman <[email protected]>
- Loading branch information
1 parent
d8f99f9
commit a46fb87
Showing
9 changed files
with
332 additions
and
180 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.