You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is expected behavior, which was chosen to mimic the samtools faidx behavior of splitting deflines on whitespace. You can specify that the .fai index should not be used as a key name by passing the read_long_names argument:
Since duplicate keys are detected during index reading in pyfaidx, the .fai will appear to contain duplicate sequences (which may be a problem for other tools) but will not contain whitespace, and so I think is "more correct" with respect to samtools behavior. See #111 for more information about how I arrived at this behavior.
The documentation should definitely be clearer about this, as well as other Fasta and Faidx arguments, so feel free to submit a PR if you want to add something to the README.
I was using some relatively simple code I wrote to generate FASTA files containing random sequences:
The file ends up looking like this:
However, I am getting the following error:
It seems that it's only picking up on the
seq
in the description line, not on the identifier.The text was updated successfully, but these errors were encountered: