You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you very much for developing this tool.
I had been using it for some time when I realised that the number of reads that the tool is generating does not correspond to the number of reads I specify with n_reads.
I did realised about this issue using some complex designs, but I still see the same behaviour in the simpler cases.
For instance, let's use this reference genome as an example: https://www.ncbi.nlm.nih.gov/assembly/GCF_005845145.1
Here we have multiple contigs, so in principle I would like to use the "--draft" command, but we can also assume that each of the contigs is in fact a complete assembly, and provide the "--genome" command. The result is the same.
This is an example in which there were not that many reads missing, but I have run this over multiple files and I find a bigger percentage of missing reads.
Any clue of what is going on? Also tried with --gc_bias and there is no difference.
Here there is the list of installed python packages, which should be the only dependence. Python itself is v3.7.7
I haven't been able to reproduce the bug yet. Is the number of reads you get wrong every time?
Can you re-run your example with the --debug flag and post the output?
EDIT: This is the code I ran:
iss generate --n_reads 1000 --draft Bacteroides_ovatus.fa --model hiseq --output TEST_sim_R1.fastq --seed 10 --cpus 1 --debug 2> DEBUG_info.txt
Dear Hadrien,
Thank you very much for developing this tool.
I had been using it for some time when I realised that the number of reads that the tool is generating does not correspond to the number of reads I specify with n_reads.
I did realised about this issue using some complex designs, but I still see the same behaviour in the simpler cases.
For instance, let's use this reference genome as an example: https://www.ncbi.nlm.nih.gov/assembly/GCF_005845145.1
Here we have multiple contigs, so in principle I would like to use the "--draft" command, but we can also assume that each of the contigs is in fact a complete assembly, and provide the "--genome" command. The result is the same.
Here there are some examples:
Using the bioconda installation (v1.5.1)
(assuming that the 1,000 reads are divided between the two files, I would expect to get 2,000 lines)
If I repeat the process with the --genome command:
I wondered if it was due to the conda version, so I tried directly with the Git repository.
This is an example in which there were not that many reads missing, but I have run this over multiple files and I find a bigger percentage of missing reads.
Any clue of what is going on? Also tried with --gc_bias and there is no difference.
Here there is the list of installed python packages, which should be the only dependence. Python itself is v3.7.7
pip list:
Thank you very much for your help,
Cheers,
Sergio.
The text was updated successfully, but these errors were encountered: