Comparative analysis of tick cementome composition

Comparison of tick cementome composition was performed by comparing published sialome and cementome datasets of a variety of species.
Due to the fact that the datasets analysed were originated from various published research, there is a wide range of differences and uneveness in the data. To create some consistency across the numerous datasets, only proteins with an existing Uniprot ID were included in the data analysis process.

Uniprot IDs to FASTA files

A template script was written to search for fasta files matching each Uniprot ID stored within an input file.
Once a sequence for each protein was identified, this would be appended to a new fasta file, which would then be used in Orthofinder.

Identifying proteins in shared orthogroups

Obtain a list of shared orthogroups by removing rows with empty values from the file "Orthogroups.tsv" (generated by Orthofinder). Each row is filled with names of the proteins stored into the orthogroups, separated in columns by species name.

Contribution

All scripts and data wrangling was written and executed by Areda Elezi.
QMUL MSc Bioinformatics 2020/21

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
data		data
renv		renv
.Rprofile		.Rprofile
.gitignore		.gitignore
Comparative-analysis-of-tick-cementome.Rproj		Comparative-analysis-of-tick-cementome.Rproj
README.md		README.md
comparison statsAll.csv		comparison statsAll.csv
microplusUNIPROTlist.py		microplusUNIPROTlist.py
obtain_shared_orthogroups.R		obtain_shared_orthogroups.R
renv.lock		renv.lock
templateFASTAfile.py		templateFASTAfile.py
templateNCBIUniprotFASTAfile.py		templateNCBIUniprotFASTAfile.py
templateNCBIfasta.py		templateNCBIfasta.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparative analysis of tick cementome composition

Uniprot IDs to FASTA files

Identifying proteins in shared orthogroups

Contribution

About

Releases

Packages

Languages

aelezi01/Comparative-analysis-of-tick-cementome

Folders and files

Latest commit

History

Repository files navigation

Comparative analysis of tick cementome composition

Uniprot IDs to FASTA files

Identifying proteins in shared orthogroups

Contribution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages