Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unclear about raw and TE fasta files #536

Open
katieemelianova opened this issue Jan 10, 2025 · 1 comment
Open

Unclear about raw and TE fasta files #536

katieemelianova opened this issue Jan 10, 2025 · 1 comment
Labels
question Further information is requested

Comments

@katieemelianova
Copy link

Hi there,

I wanted to clarify the status of an output file in the .final folder of EDTA, I have a file titled

species.fasta.mod.EDTA.final/species.fasta.mod.EDTA.raw.fa

From the schematic in Figure 1. of "A Tutorial of EDTA: Extensive De Novo TE Annotator" it seems as though raw files have not gone through CDS filtering, however this file is in the .final folder - is this just a copy of the output from the .raw folder? My aim is to get a fasta file of intact, full length and fragmented TEs which have gone through EDTA and CDS filters. I thought perhaps that the file that I would need might be:

species.fasta.mod.EDTA.anno/species.fasta.mod.EDTA.TE.fa

But I could not yet figure out what this file is. Could you clarify what these two files are please? Any advice would be greatly appreciated :)

Best,

Katie

@oushujun
Copy link
Owner

Hi Katie,

You are correct, the raw.fa file has been copied to different folders. You should use the final TEanno.gff3 file to extract intact and fragmented TEs.

THanks!
Shujun

@oushujun oushujun added the question Further information is requested label Feb 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants