Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to speed up for big genome? #38

Closed
zyworship opened this issue Jan 4, 2020 · 1 comment
Closed

How to speed up for big genome? #38

zyworship opened this issue Jan 4, 2020 · 1 comment
Labels
question Further information is requested

Comments

@zyworship
Copy link

Hi dear Shujun,
I have configured the environment for computing about the EDTA.
But I work on genome for amphibians, the genome size is bigger than other animals. I have run EDTA_raw for TIR, LTR, helitron.EDTA_raw.pl -genome frog1_genome.chromosome.fa -type tir -thrads 16. It's been running for 48 hours and it's not finished yet.
Is there any methods for speed up for big genomes?

Thank you for your attention and reply.

Zhangyi

@oushujun
Copy link
Owner

oushujun commented Jan 4, 2020

Dear @zyworship,

48 hours on a big genome is a VERY short time. You may want to give it a week or more. Think about the time for gene predictions your genome may take, EDTA is no less simple or faster at the moment. Your way of splitting the EDTA_raw.pl is the recommended way to speed up EDTA. While waiting for TIR to finish, you may also run -type ltr and -type helitron at the same time in the same folder in different jobs.

Best,
Shujun

@oushujun oushujun added the question Further information is requested label Jan 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants