-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UnicodeDecodeError: Invalid continuation byte, when running pytest on new dataset created with tfds new on Windows #57
Comments
Looked up how to diff two files in Windows, got this:
|
foo is the one that causes crashes, new is the one that does not. Looks like my Windows/my version of epath can't handle some of the symbols? But VS Code or whatever will just change them when resaving |
Tried on an Ubuntu machine, no issue at all. Runs fine. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Followed steps from #56 to get pytest running, and then used instructions from https://tensorflow.google.cn/datasets/add_dataset?hl=en#test_your_dataset to create a new dataset.
Then I get errors like this:
with tracebacks going to
abstract_path.py
I am on Windows, potentially this is an issue only with that, because when I do the same steps on Colab it does not occur. https://colab.research.google.com/drive/1X9sem_qFHNHgpRl-IqkHN0Mft8CBCp_O?usp=sharing
I went to abstract_path.py and manually edited it to dump to a .txt file
original:
edited:
output:
This lead me to finally realize that what it actually wanted me to do, I think, was remove invalid tags?
So I opened up Tags.txt to have a look, closed it, and then ran the pytest again... and got a new error:
Apparently opening and closing Tags.txt made the error go away? I theorize it's something to do with the formatting of the .txt file on Windows
The text was updated successfully, but these errors were encountered: