-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when creating embeddings - HEAD request to S3 bucket returns 404 #1831
Comments
Seeing the same behaviour for the Flair forward and backward news embeddings ('news-forward' and 'news-backward'):
|
Hm, it looks like the AWS bucket is down. I have to check what's going on! |
Thanks @alanakbik... I think we are all blocked by this, those of us who are using this on different machines / CI where we need this stuff to work. Your help is much appreciated!! |
Quick update: Unfortunately the entire AWS account was deleted yesterday by an internal process since I no longer work at Zalando (I'm now at university). That means that most models are currently not accessible. To fix this, I have to set up another file hosting solution and do a hotfix of Flair. If anybody knows a good way to host a large amount of large files (with high amounts of download traffic by a big community of users), please let me know! |
S3 is still your best bet. |
Is it resolved? We are blocked with this. |
@alanakbik is this repo still being supported by Zalando – financially or otherwise? My startup uses this package we could see if we can help with the hosting. |
Folks, I just nabbed the |
There is currently no financial support - the project is maintained only through code contributions by open source community and members of my group. We're now tentatively thinking of setting up a system for donations to cover costs (and maybe even to hire people to maintain the code). |
If a developer on my team ran our app locally <20 hours ago, will the files from Thanks for any guidance! |
Folks, to clarify #1831 (comment): I've reserved the bucket name but I do not have the files. The emails are flooding in, and I can't help you. |
Yes all files are cached in the |
Quick update: we are working on a fix. Will keep you posted. |
What about Zenodo as hoster? You also get a DOI for each model which would make it easier to cite models in papers. You can download models from Zenodo e.g. like this: import requests
import wget
def download(doi, filepath):
url = f"https://doi.org/{doi}"
r = requests.get(url)
record = r.url.split("/")[-1].strip()
url = f"https://zenodo.org/api/records/{record}"
r = requests.get(url)
if r.ok:
print("Downloading model from Zenodo...")
print(f"Target directory: {filepath}")
response = r.json()
files = response["files"]
total = sum(file["size"] for file in files)
for file in files:
link = file["links"]["self"]
size = file["size"] / 2 ** 20
print(f"Total size: {size:.1f} MB")
fname = file["key"]
checksum = file["checksum"]
filename = wget.download(link, filepath)
return filename
else:
raise Excpetion("Unable to download model from Zenodo.") |
i need en-ner-conll03-v0.4.pt file . its stored in .flair/ directory default . |
What's the approximate costs / month or year for hosting? We and a few others rely heavily on this project so can discuss with our investors make a meaningful donation. |
@RXminuS that would be great and very much appredicated! I am currently looking into options for hosting and donation models! @severinsimmler thanks for the pointer to Zenodo. Looks interesting - do you have experience wrt download speeds? Some of the models are pretty big and there is a good amount of traffic. As a first fix, I've moved all models to our university server. Download speeds are slower than before and I worry that the server will have problems with the traffic (so please don't all try at the same time ;)), but at least everything should run again. Will be merged and released soon. |
Thank you @alanakbik so how to use it. as i am getting the below error OSError Traceback (most recent call last) 2 frames OSError: HEAD request failed for url https://s3.eu-central-1.amazonaws.com/alan-nlp/resources/embeddings-v0.4/ar-wiki-fasttext-300d-1M.vectors.npy with status code 301. |
The fix is merged to master and can be used by installing Flair with:
I'll also push this to pip later, but first let's test if this works. |
Thanks a bunch, @alanakbik. My stack uses Flair 0.4.2. How do I install that specific version? |
@Masum06 I'm afraid the model download in older versions will remain broken. If updating is not possible you could use the new version to download the models manually and then run the old version. |
Just wanted to say fantastic work @alanakbik. Not just for the library but keeping your cool during a situation like this. And your dedication to finding a way around so quickly. I have the deepest respect for open-source maintainers like yourself 🙇♂️🙇♂️🙇♂️ |
@alanakbik You are the best! Thank you very much! |
@RXminuS @VigneshBaskar Thanks a lot! We just pushed the new version to pip so you can do a regular update with:
|
Thank you so much for your reactivity @alanakbik ! It all works fine for us. |
@alanakbik |
@djstrong can you try again? Should work now |
Thank you. It works. |
Hello, I also ran the upgrade flair command that was mentioned in earlier posts. code to reproduce the errorfrom flair.embeddings import WordEmbeddings, FlairEmbeddings |
@truptikirve26 - I was able to solve the issue with v0.61. Looks like the older version is still cached in your IDE/venv. |
Hello, I have a problem loading Word/FlairEmbeddings for English and German languages located at the urls: "https://s3.eu-central-1.amazonaws.com/alan-nlp/resources/embeddings/xxxxxxxxx.pt".
When following Tutorial 3, trying to create these embeddings gives the following error:
OSError: HEAD request failed for url https://s3.eu-central-1.amazonaws.com/alan-nlp/resources/embeddings/glove.gensim.vectors.npy with status code 404
Making a simple HEAD request to that url outside of flair returns 404, so it looks like the embeddings are not located there anymore ?
To Reproduce
from flair.embeddings import WordEmbeddings
glove_embedding = WordEmbeddings('glove')
I would appreciate your help,
Thank you !
The text was updated successfully, but these errors were encountered: