diff --git a/scripts/tokenizers/README.md b/scripts/tokenizers/README.md index 7985eadc8f..bbcac18995 100644 --- a/scripts/tokenizers/README.md +++ b/scripts/tokenizers/README.md @@ -1,6 +1,7 @@ # Training WordPiece Vocabularies on Wikipedia This is unmaintained helper code for training the vocabularies on Wikipedia. +It is advised to run these scripts on GCS. ### Screens Use screens to continue the download even when the terminal is not open!