-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Raw SINE results not found #540
Comments
Hi Shujun, It appears I am having the same issues as well (same errors). It appears as the .fasta.mod.SINE.raw. and.fa directory/files are missing. I have tried this both with the full pipeline and the SINE only pipe. In both cases I cant find the fasta.mod or the fasta.mod.EDTA.raw directory being written to the working directory. I am proceeding to take a divide and rule approach to check the other parts of the pipeline. The LTR discovery has worked (as for #bbista) |
Hi Again, Have tried the ITR module and the same issue appears. See below the error log: Thu Jan 30 02:25:03 UTC 2025 EDTA_raw: Check dependencies, prepare working directories. Thu Jan 30 02:25:36 UTC 2025 Start to find TIR candidates. Thu Jan 30 02:25:36 UTC 2025 Identify TIR candidates from scratch. Species: others Error: Error while loading sequence mv: cannot stat 'NbLx03.shortheader.fasta.mod.TIR.ext30.fa.pass.fa.dusted.cln.cln': No such file or directory Appears writing of the inital raw data files/directory for modules other than LTR are not being written |
Hi, This error still persist. Running directly TIR-Learner.py also gives the same problem. (EDTA) @farm:~$ ./TIR-Learner.py -h Any update would be great. |
I received several of these error reports, but could not reproduce them on my end. Can anyone provide a reproducible case? Thanks! |
Hi Shujun, I have repeatedly had these issues. I am wondering (from your comment), whether the issue may be a bad install. I will clean the current instance off my account and reinstall and get back to you ASAP to see if this is the issue persists. Since you aren't seeing any issues on your end with the current versions I am wondering if this may be the issue. Another thing that pops into mind is whether the raw data needs to be within the EDTA folder. In my case, I call EDTA from my working folder where the raw genome file is housed. All the EDTA folders and associated mod.raw files are generated within the EDTA folder (i.e. where the EDTA.pl files are installed). This doesn't appear to be a problem for LTR-finder but might be causing a problem for the other parts of the pipeline? Regards |
Hi Chris, The issue is with the TIR-Learner not the LTR-finder. Thanks |
Yes you are right - but initiating the SINE/LINE and Helitron pipelines independently via EDTA also causes the same crash (but for each of the appropriate sub-pipelines) - basically as far as I can see the folders and mod.raw files for these individual packages are not being written - the programmes initiate and fail as they cannot find the correct input files and folders |
Another recent issue indicates that you may need to install the cpu version
of tensorflow. Not sure if it’s related.
EDTA does not require input being in the program folder.
You may need to test with different servers/platforms to rule out corner
cases. Are you using conda or apptainer?
Shujun
…On Mon, Feb 10, 2025 at 10:47 PM Chris Winefield ***@***.***> wrote:
Yes you are right - but initiating the SINE/LINE and Helitron pipelines
independently via EDTA also causes the same crash (but for each of the
appropriate sub-pipelines) - basically as far as I can see the folders and
mod.raw files for these individual packages are not being written - the
programmes initiate and fail as they cannot find the correct input files
and folders
—
Reply to this email directly, view it on GitHub
<#540 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNX4NEP6JRAKYBVFMTFNPD2PFXFJAVCNFSM6AAAAABV6NWCJCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMNBZG42DCNZVGY>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Interesting - I have been using the Conda install rather than the apptainer. I'll ask our HPC bods if we can get an apptainer instance installed. Also I'll see if we are installing the CPU version of tensorflow as you suggest. I think you are correct in that there may be an interaction with the HPC/server setup causing the issues. I'll keep the thread informed as we work through the issues with our setup. Thanks for the help |
Hi Shujun, Do you think it is problem of the TIR-Learner itself? Installed with conda (including the beta version). TIR-Learner -h Traceback (most recent call last): Thanks |
Hi Chris, I download EDTA locally on my comp. I still get the same error so may be not due to the HPC/server setup. EDTA.pl --genome genome.fa --cds genome.cds.fa --curatedlib rice7.0.0.liban --exclude genome.exclude.bed --overwrite 1 --sensitive 1 --anno 1 --threads 10 > EDTA.test Wed Feb 12 16:35:13 EST 2025 EDTA_raw: Check dependencies, prepare working directories. Species: others Error: Error while loading sequence mv: cannot stat 'genome.fa.mod.TIR.ext30.fa.pass.fa.dusted.cln.cln': No such file or directory ERROR: Raw TIR results not found in genome.fa.mod.EDTA.raw/genome.fa.mod.TIR.intact.raw.fa Error: Error while loading sequence mv: cannot stat 'genome.fa.mod.TIR.ext30.fa.pass.fa.dusted.cln.cln': No such file or directory ERROR: Raw TIR results not found in genome.fa.mod.EDTA.raw/genome.fa.mod.TIR.intact.raw.fa |
I will see if I can get the Aptainer install working as this might relate to something that has broken in the the generation of the Conda environment. |
@B10inform The current TIR-Learner recipe uses pytorch-cuda, so if your machine does not have a Nvidia GPU it will run into error. Please try the updated yml file for installation of EDTA, which uses pytorch-cpu. Thanks! |
Hi Shujin, I am still getting the same problem. Species: others conda installation: Linking annosine2-2.0.8-pyh7e72e81_0 git installation. perl ./EDTA/EDTA.pl --check_dependencies Thanks |
Hi Shujin, It is working now. I had to completely(clean) remove conda environment (all traces of the configuration files). Thanks |
Hello,
I was running EDTA on a house fly genome and I get this error. It is a housefly genome.
I installed EDTA using conda.
#########################################################
Extensive de-novo TE Annotator (EDTA) v2.2.2
Shujun Ou ([email protected])
#########################################################
Parameters: --genome /project/meisel/users/bbista/genomes/USDA_David/3MV3/Mdom_3M-v3b_clean.fasta -t 36
Fri Jan 17 11:11:59 AM CST 2025 Dependency checking:
All passed!
Fri Jan 17 11:12:16 AM CST 2025 Obtain raw TE libraries using various structure-based programs:
Fri Jan 17 11:12:16 AM CST 2025 EDTA_raw: Check dependencies, prepare working directories.
Fri Jan 17 11:12:26 AM CST 2025 Start to find LTR candidates.
Fri Jan 17 11:12:26 AM CST 2025 Identify LTR retrotransposon candidates from scratch.
Fri Jan 17 01:00:22 PM CST 2025 Finish finding LTR candidates.
Fri Jan 17 01:00:22 PM CST 2025 Start to find SINE candidates.
cp: cannot stat 'Mdom_3M-v3b_clean.fasta.mod.SINE.raw.fa': No such file or directory
Error: SINE results not found!
ERROR: Raw SINE results not found in Mdom_3M-v3b_clean.fasta.mod.EDTA.raw/Mdom_3M-v3b_clean.fasta.mod.SINE.raw.fa
If you believe the program is working properly, this may be caused by the lack of SINEs in your genome.
slurmstepd: error: Detected 1 oom_kill event in StepId=4398198.batch. Some of the step tasks have been OOM Killed.
The text was updated successfully, but these errors were encountered: