Releases: ACEnglish/kanpig
Releases · ACEnglish/kanpig
v1.0.2
Minor patch
- UI tweaks, including cleaner gt parameters
- Slight htslib bam record and other minor efficiency improvement
- Fix bug arising from ploidy==1 regions sometimes.
- Fix minor bugs around PS and HP tag counting/record keeping
Full Changelog: v1.0.1...v1.0.2
v1.0.1
Faster, Smaller, More Accurate Genotyping
Kanpig now has sub-commands gt
and plup
. The new plup
command will extract reads and their SVs from a bam file into a small file that's useful for long-term storage of reads.
gt
improvements:- can parse plup files much more quickly (up to 8x) than parsing a bam. Though bam parsing is also now ~2x faster.
- parse PS and HP information from haplotagged reads to increase genotyping accuracy as well as record long-range phasing information in the output VCF.
- now uses kmedoid clustering instead of kmeans, resulting in a modest improvement to genotyping accuracy.
Full Updates: https://github.com/ACEnglish/kanpig/wiki/Updates#v101
Full Changelog: v0.3.1...v1.0.1
v0.3.1
Consistency improvements
- New filtering of haplotypes without paths increases accuracy
- New path scoring improves accuracy and consistency
- ZS and SS FORMAT fields replaced by KS reporting the score
- Requiring reads to span the full variant graph window including --chunksize buffer increases accuracy
- Exhaustive search of partial haplotypes
- Slight runtime reduction from avoidance of redundant path searches
Full Changelog: v0.3.0...v0.3.1
v0.3.0
General improvements
- ~8% speed increase from less work in the path-searching
- Partial haplotypes bug fix increases accuracy
- Fixed SQ and FT fields
- Dedicated writing thread helps reduce memory usage by preventing a backlog of completed variants while reading
- Default
--out
is stdout to allow easier compression/indexing (e.g.kanpig .. | bcftools sort -O z -o out.vcf.gz
) - IUPAC codes are fixed by kanpig according to vcf specifications (Issue #1)
- Fixed filtering of symbolic alts and BNDs
- Argument validation
Full Changelog: v0.2.0...v0.3.0
v0.2.0
- Up to 40% reduction in runtime
- Hemizygous and sex chromosome aware genotyping with new
--ploidy-bed
- Variants with alternate alleles of stars, monozygotic reference, and BNDs are filtered out
- Comparing PathScores by their average size and sequence similarity increases accuracy
Full Changelog: v0.1.2...v0.2.0
v0.1.2
- New optional hompolymer filter doesn't kmerize long homopolymers
- Improved logging info
- Correcting GQ field
- Correcting kmer counting
- Small speed/memory/io improvements
- Off-loaded annotation work from the single writer thread to the worker threads and using a large
multiple of page size for the BufWriter capacity - Fewer bam file opens
- Fewer clone operations
- Off-loaded annotation work from the single writer thread to the worker threads and using a large
Full Changelog: v0.1.0...v0.1.2