-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bcftools to the rescue #7456
bcftools to the rescue #7456
Conversation
a1f7a59
to
0588913
Compare
0588913
to
991ffd3
Compare
7a64735
to
ef42f88
Compare
56f41e4
to
bda3960
Compare
7db583f
to
c63904b
Compare
c63904b
to
a6f668e
Compare
e334c96
to
9dd5fd4
Compare
37fcca1
to
753dc8e
Compare
753dc8e
to
d2b03d9
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
a couple small changes, otherwise 👍🏻
WHERE | ||
gvs_all_ac != gvs_afr_an + gvs_amr_an + gvs_eas_an + gvs_eur_an + gvs_mid_an + gvs_oth_an + gvs_sas_an' | ||
|
||
# if the result of the query has any rows, that means gvs_all_an has not been calulated correctly/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit
# if the result of the query has any rows, that means gvs_all_an has not been calulated correctly/ | |
# if the result of the query has any rows, that means gvs_all_an has not been calculated correctly |
# ------------------------------------------------ | ||
# Output: {"Name of validation rule": "PASS/FAIL plus additional validation results"} | ||
output { | ||
Map[String, String] result = {"SubpopulationAlleleCount": read_string('validation_results.txt')} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Map[String, String] result = {"SubpopulationAlleleCount": read_string('validation_results.txt')} | |
Map[String, String] result = {"SubpopulationAlleleNumber": read_string('validation_results.txt')} |
Keeping the dockstore as is for now because I may want to run this on a few shards from the 30k while it's still in review
This pr adds a fair amount of work to the bcftools task (ExtractAnAcAfFromVCF) and adds a significant number of columns to the schema: the sample count for all of the samples, as well as for each subpopulation.
Note that AC_hemi will be added in a follow on pr
Note that additional validation tests will be added in a follow on pr