Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

318 Sites only filtered vcf then annotate wdl #7305

Merged
Merged
Show file tree
Hide file tree
Changes from 71 commits
Commits
Show all changes
73 commits
Select commit Hold shift + click to select a range
2b7fef2
init wdl to make sites only vcfs from extracted cohort vcfs
RoriCremer Jun 10, 2021
41d3ef8
init schema
RoriCremer Jun 10, 2021
4d07f83
just get the wdl working
RoriCremer Jun 10, 2021
2753382
update schema to have all NIRVANA selected annotations
RoriCremer Jun 11, 2021
ca058c5
add tarball of sources to example input
RoriCremer Jun 11, 2021
4ec5e2b
add annotations
RoriCremer Jun 12, 2021
7444ae6
update the schema
RoriCremer Jun 16, 2021
11766e1
clean up test files
RoriCremer Jun 16, 2021
fd9b3ce
add project id to inputs
RoriCremer Jun 16, 2021
c3c46a6
add jq and query
RoriCremer Jun 16, 2021
eef77b5
prob not to keep--maybe for testing
RoriCremer Jun 16, 2021
b6df3f9
fixup on schema
RoriCremer Jun 16, 2021
58676c8
add dataset
RoriCremer Jun 16, 2021
7ee16b7
add dataset param
RoriCremer Jun 16, 2021
ee59062
short term fix
RoriCremer Jun 17, 2021
bbcf684
update for nirvana results?
RoriCremer Jun 22, 2021
5b510f7
remove test files
RoriCremer Jun 23, 2021
39df108
schema update
RoriCremer Jun 23, 2021
47cf857
omg so many vcfs!
RoriCremer Jun 23, 2021
efb488a
genes schema for a join later on
RoriCremer Jun 23, 2021
7de9531
this vat schema works for the csv upload
RoriCremer Jun 23, 2021
aca070d
schema update
RoriCremer Jun 23, 2021
b350152
genes table schema
RoriCremer Jun 23, 2021
b9ce0cd
more wdl drama
RoriCremer Jun 23, 2021
b73b42a
this needs to be json-ified
RoriCremer Jun 23, 2021
ad7a78d
wdl idea
RoriCremer Jun 24, 2021
6aa56f0
update vat schema for json upload
RoriCremer Jun 24, 2021
15eabc6
genes json is right, positions json needs help
RoriCremer Jun 24, 2021
7e69606
updating the query in the WDL
RoriCremer Jun 24, 2021
d31678e
clean up python script
RoriCremer Jun 24, 2021
47f3b30
python cleanup
RoriCremer Jun 25, 2021
2294c52
more vars for the wdl
RoriCremer Jun 29, 2021
68846ae
still need to get the right array info
RoriCremer Jun 29, 2021
876716e
vat should be all the cols from Lee
RoriCremer Jun 29, 2021
deb9c3b
add initial validation to wdl
RoriCremer Jun 29, 2021
cd2eb2f
testing ideas
kcibul Jun 29, 2021
5ced764
add new python script to dockerfile
RoriCremer Jun 29, 2021
4a1e79f
spit on compressed json files
RoriCremer Jun 29, 2021
57fc083
add my wdl to dockstore!?!!?
RoriCremer Jun 29, 2021
bb5374e
take into account multiple phenotypes
RoriCremer Jun 30, 2021
660119a
add branch to dockstore
RoriCremer Jun 30, 2021
df8bcda
update dockstore again
RoriCremer Jun 30, 2021
8e15114
cleanup
RoriCremer Jun 30, 2021
f30cc7e
more schema updates
RoriCremer Jul 1, 2021
46441e0
update python script
RoriCremer Jul 1, 2021
dddfc85
update wdl
RoriCremer Jul 1, 2021
9014a1e
add loading data back
RoriCremer Jul 2, 2021
d965f3f
typo in clinvar_id
RoriCremer Jul 2, 2021
ff4d123
add updated docker
RoriCremer Jul 2, 2021
d8a8020
add done flags
RoriCremer Jul 3, 2021
cdedc8a
update vat schema
RoriCremer Jul 3, 2021
898e756
eventually this will need to be all defaults
RoriCremer Jul 6, 2021
a2b4b98
add a table identifier
RoriCremer Jul 6, 2021
dc64fd9
fix python typo
RoriCremer Jul 6, 2021
73d8e5e
update python docker
RoriCremer Jul 6, 2021
54f7412
table id
RoriCremer Jul 6, 2021
0b2eab4
lowercase clinvar
RoriCremer Jul 6, 2021
ea8b1e6
update docker
RoriCremer Jul 6, 2021
01578c3
typo
RoriCremer Jul 6, 2021
caef686
update wdl
RoriCremer Jul 6, 2021
2bbb869
more validation
RoriCremer Jul 8, 2021
efe7267
fix python typo
RoriCremer Jul 8, 2021
049d586
update the query to avoid duplicate genes
RoriCremer Jul 8, 2021
4482863
clean up validation queries
RoriCremer Jul 8, 2021
0056648
update docker image with omim friendly python
RoriCremer Jul 9, 2021
63018ae
remove cols from python that we are pushing to august p0
RoriCremer Jul 9, 2021
c129dcb
python now omim multi value friendly
RoriCremer Jul 9, 2021
eb35986
VAT changes for alpha1 (#7345)
kcibul Jul 10, 2021
816d363
optimize wdl
RoriCremer Jul 12, 2021
56eaa65
pr review w bec
RoriCremer Jul 13, 2021
4656c46
pr review part 2
RoriCremer Jul 14, 2021
6828829
typo
RoriCremer Jul 14, 2021
83a5858
smoketest tune up
RoriCremer Jul 15, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions .dockstore.yml
Original file line number Diff line number Diff line change
Expand Up @@ -93,6 +93,14 @@ workflows:
branches:
- master
- ah_var_store
- name: GvsSitesOnlyVCF
subclass: WDL
primaryDescriptorPath: /scripts/variantstore/wdl/GvsSitesOnlyVCF.wdl
testParameterFiles:
- /scripts/variantstore/wdl/GvsSitesOnlyVCF.example.inputs.json
filters:
branches:
- ah_var_store
- name: MitochondriaPipeline
subclass: WDL
primaryDescriptorPath: /scripts/mitochondria_m2_wdl/MitochondriaPipeline.wdl
Expand Down
14 changes: 14 additions & 0 deletions scripts/variantstore/wdl/GvsSitesOnlyVCF.example.inputs.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
{
"GvsSitesOnlyVCF.gvs_extract_cohort_filtered_vcfs": ["gs://broad-dsp-spec-ops/kcibul/ft/gvs.chr20.vcf.gz", "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/Data/gvs_features_0.vcf.gz"],
"GvsSitesOnlyVCF.gvs_extract_cohort_filtered_vcf_indices": ["gs://broad-dsp-spec-ops/kcibul/ft/gvs.chr20.vcf.gz.tbi", "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/Data/gvs_features_0.vcf.gz.tbi"],
"GvsSitesOnlyVCF.output_sites_only_file_name": "hello_did_I_sites_only",
"GvsSitesOnlyVCF.output_annotated_file_name": "hello_did_I_annotate",
"GvsSitesOnlyVCF.nirvana_data_directory": "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/NirvanaData.tar.gz",
"GvsSitesOnlyVCF.vat_schema_json_file": "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/vat_schema.json",
"GvsSitesOnlyVCF.variant_transcript_schema_json_file": "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/vt_schema.json",
"GvsSitesOnlyVCF.genes_schema_json_file": "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/schemas/genes_schema.json",
"GvsSitesOnlyVCF.output_path": "gs://broad-dsp-spec-ops/scratch/rcremer/Nirvana/output/jul13/",
"GvsSitesOnlyVCF.table_suffix": "jul13",
"GvsSitesOnlyVCF.project_id": "spec-ops-aou",
"GvsSitesOnlyVCF.dataset_name": "anvil_100_for_testing"
}
Loading