-
Notifications
You must be signed in to change notification settings - Fork 811
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wikitext-2 is not available anymore #2247
Comments
Is there an alternate link we can get? The documentation here says:
... can we just find an alternate URL and change the function? |
Hi team, about this error, is there any solution now? We also encountered the same error. |
how can we change the URL |
I uploaded the wikitext-2-v1.zip file to my server and changed the source code lines in the URL = "https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-2-v1.zip"
MD5 = "542ccefacc6c27f945fb54453812b3cd" to URL = "http://la.ihainan.me/wikitext-2-v1.zip"
MD5 = "f6e734fc17885b364243f67b30385a3d" to temporarily solve this issue. |
🐛 Bug
Describe the bug
requests.exceptions.HTTPError: 403 Client Error: Forbidden for url: https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-2-v1.zip
This exception is thrown by iter of HTTPReaderIterDataPipe(skip_on_error=False, source_datapipe=OnDiskCacheHolderIterDataPipe, timeout=None)
To Reproduce Steps to reproduce the behavior:
from torchtext.datasets import WikiText2
from torchtext.data.utils import get_tokenizer
from torchtext.vocab import build_vocab_from_iterator
from torch.utils.data import DataLoader, Dataset
tokenizer = get_tokenizer("basic_english")
train_iter = WikiText2(split='train')
valid_iter = WikiText2(split='valid')
def yield_tokens(data_iter):
for item in data_iter:
yield tokenizer(item)
vocab = build_vocab_from_iterator(yield_tokens(train_iter),
specials=["", "", ""])
vocab.set_default_index(vocab[""])
Expected behavior A clear and concise description of what you expected to happen.
Screenshots If applicable, add screenshots to help explain your problem.
Environment
Please copy and paste the output from our
environment collection script (or
fill out the checklist below manually).
You can get the script and run it with:
conda
,pip
, source):Additional context Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: