-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VS-694 - Extract Callset for VQSR Lite #8182
Merged
Merged
Changes from 13 commits
Commits
Show all changes
18 commits
Select commit
Hold shift + click to select a range
9035af3
Initial work on GvsExtractCallset.wdl
gbggrant 7f5cdce
Update dockers
gbggrant a651ed7
Fix refactoring error
gbggrant 5ca681a
empty commit
gbggrant 1a6daf5
Fix the branch name in .dockstore.yml
gbggrant ff2dfbf
Name vqsr lite version of filter_set_info table consistently between …
gbggrant 38fb966
Clean up
gbggrant 3db5aaf
Fixed boolean usage in bash?
gbggrant df96761
Updated docker
gbggrant 680020d
CalibrationSensitivity -> Sensitivity in ExtractCohortLite
gbggrant c55c463
Update GvsCreateFilterSet.wdl to handle error in JointVcfFiltering an…
gbggrant 6e30a77
Resolve VS-818.
gbggrant b552d51
Merge remote-tracking branch 'origin/ah_var_store' into gg_VS-694_VQS…
gbggrant cd66405
Merge remote-tracking branch 'origin/ah_var_store' into gg_VS-694_VQS…
gbggrant fd19cba
Trying to standardize on 'project_id'
gbggrant e0091f9
Missed one.
gbggrant 15677b3
Merge remote-tracking branch 'origin/ah_var_store' into gg_VS-694_VQS…
gbggrant 8da8284
Cleanups, update the docker
gbggrant File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -56,7 +56,7 @@ workflow GvsCreateFilterSet { | |
String fq_sample_table = "~{project_id}.~{dataset_name}.sample_info" | ||
String fq_alt_allele_table = "~{project_id}.~{dataset_name}.alt_allele" | ||
String fq_info_destination_table = "~{project_id}.~{dataset_name}.filter_set_info" | ||
String fq_info_destination_table_vqsr_lite = "~{project_id}.~{dataset_name}.vqsr_lite_filter_set_info" | ||
String fq_info_destination_table_vqsr_lite = "~{project_id}.~{dataset_name}.filter_set_info_vqsr_lite" | ||
String fq_tranches_destination_table = "~{project_id}.~{dataset_name}.filter_set_tranches" | ||
String fq_filter_sites_destination_table = "~{project_id}.~{dataset_name}.filter_set_sites" | ||
|
||
|
@@ -158,14 +158,36 @@ workflow GvsCreateFilterSet { | |
preemptible_tries = 3, | ||
} | ||
|
||
call PopulateFilterSetInfo { | ||
# These calls to SelectVariants are being added for two reasons | ||
# 1) The snps_variant_scored_vcf and indels_variant_scored_vcf output by JointVcfFiltering contains ALL variants, | ||
# but are currently ONLY annotating SNPs and INDELs respectively. | ||
# 2) Those output VCFs also contain filtered sites which we don't want to put into the filter_set_info_vqsr_lite table. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. what does "filtered sites" here mean? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Sites in the VCF that have the filter field set to anything other than '.' or 'PASS'. |
||
call Utils.SelectVariants as CreateFilteredScoredSNPsVCF { | ||
input: | ||
input_vcf = MergeSNPScoredVCFs.output_vcf, | ||
input_vcf_index = MergeSNPScoredVCFs.output_vcf_index, | ||
type_to_include = "SNP", | ||
exclude_filtered = true, | ||
output_basename = "${filter_set_name}.filtered.scored.snps" | ||
} | ||
|
||
call Utils.SelectVariants as CreateFilteredScoredINDELsVCF { | ||
input: | ||
input_vcf = MergeINDELScoredVCFs.output_vcf, | ||
input_vcf_index = MergeINDELScoredVCFs.output_vcf_index, | ||
type_to_include = "INDEL", | ||
exclude_filtered = true, | ||
output_basename = "${filter_set_name}.filtered.scored.indels" | ||
} | ||
|
||
call PopulateFilterSetInfo { | ||
input: | ||
gatk_override = gatk_override, | ||
filter_set_name = filter_set_name, | ||
snp_recal_file = MergeSNPScoredVCFs.output_vcf, | ||
snp_recal_file_index = MergeSNPScoredVCFs.output_vcf_index, | ||
indel_recal_file = MergeINDELScoredVCFs.output_vcf, | ||
indel_recal_file_index = MergeINDELScoredVCFs.output_vcf_index, | ||
snp_recal_file = CreateFilteredScoredSNPsVCF.output_vcf, | ||
snp_recal_file_index = CreateFilteredScoredSNPsVCF.output_vcf_index, | ||
indel_recal_file = CreateFilteredScoredINDELsVCF.output_vcf, | ||
indel_recal_file_index = CreateFilteredScoredINDELsVCF.output_vcf_index, | ||
fq_info_destination_table = fq_info_destination_table_vqsr_lite, | ||
filter_schema = fq_info_destination_table_vqsr_lite_schema, | ||
query_project = project_id, | ||
|
@@ -402,7 +424,7 @@ task ExtractFilterTask { | |
>>> | ||
|
||
runtime { | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2022_10_17_2a8c210ac35094997603259fa1cd784486b92e42" | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2023_02_07_38fb966b84d138680e9e46992ae60feb28127d41" | ||
memory: "7 GB" | ||
disks: "local-disk 10 HDD" | ||
bootDiskSizeGb: 15 | ||
|
@@ -445,7 +467,7 @@ task PopulateFilterSetInfo { | |
|
||
export GATK_LOCAL_JAR=~{default="/root/gatk.jar" gatk_override} | ||
|
||
echo "Creating SNPs reacalibration file" | ||
echo "Creating SNPs recalibration file" | ||
gatk --java-options "-Xmx1g" \ | ||
CreateFilteringFiles \ | ||
--ref-version 38 \ | ||
|
@@ -455,7 +477,7 @@ task PopulateFilterSetInfo { | |
-V ~{snp_recal_file} \ | ||
-O ~{filter_set_name}.snps.recal.tsv | ||
|
||
echo "Creating INDELs reacalibration file" | ||
echo "Creating INDELs racalibration file" | ||
gatk --java-options "-Xmx1g" \ | ||
CreateFilteringFiles \ | ||
--ref-version 38 \ | ||
|
@@ -482,7 +504,7 @@ task PopulateFilterSetInfo { | |
>>> | ||
|
||
runtime { | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2022_10_17_2a8c210ac35094997603259fa1cd784486b92e42" | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2023_02_07_38fb966b84d138680e9e46992ae60feb28127d41" | ||
memory: "3500 MB" | ||
disks: "local-disk 250 HDD" | ||
bootDiskSizeGb: 15 | ||
|
@@ -538,7 +560,7 @@ task PopulateFilterSetSites { | |
>>> | ||
|
||
runtime { | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2022_10_17_2a8c210ac35094997603259fa1cd784486b92e42" | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2023_02_07_38fb966b84d138680e9e46992ae60feb28127d41" | ||
memory: "3500 MB" | ||
disks: "local-disk 200 HDD" | ||
bootDiskSizeGb: 15 | ||
|
@@ -585,7 +607,7 @@ task PopulateFilterSetTranches { | |
>>> | ||
|
||
runtime { | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2022_10_17_2a8c210ac35094997603259fa1cd784486b92e42" | ||
docker: "us.gcr.io/broad-dsde-methods/broad-gatk-snapshots:varstore_2023_02_07_38fb966b84d138680e9e46992ae60feb28127d41" | ||
memory: "3500 MB" | ||
disks: "local-disk 200 HDD" | ||
bootDiskSizeGb: 15 | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are we going back to having the query proj being diff?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh shoot!