Skip to content

Releases: ACEnglish/kanpig

v1.0.2

19 Jan 19:37
Compare
Choose a tag to compare

Minor patch

  • UI tweaks, including cleaner gt parameters
  • Slight htslib bam record and other minor efficiency improvement
  • Fix bug arising from ploidy==1 regions sometimes.
  • Fix minor bugs around PS and HP tag counting/record keeping

Full Changelog: v1.0.1...v1.0.2

v1.0.1

10 Dec 16:47
Compare
Choose a tag to compare

Faster, Smaller, More Accurate Genotyping

Kanpig now has sub-commands gt and plup. The new plup command will extract reads and their SVs from a bam file into a small file that's useful for long-term storage of reads.

  • gt improvements:
    • can parse plup files much more quickly (up to 8x) than parsing a bam. Though bam parsing is also now ~2x faster.
    • parse PS and HP information from haplotagged reads to increase genotyping accuracy as well as record long-range phasing information in the output VCF.
    • now uses kmedoid clustering instead of kmeans, resulting in a modest improvement to genotyping accuracy.

Full Updates: https://github.com/ACEnglish/kanpig/wiki/Updates#v101
Full Changelog: v0.3.1...v1.0.1

v0.3.1

25 Jun 15:00
Compare
Choose a tag to compare

Consistency improvements

  • New filtering of haplotypes without paths increases accuracy
  • New path scoring improves accuracy and consistency
  • ZS and SS FORMAT fields replaced by KS reporting the score
  • Requiring reads to span the full variant graph window including --chunksize buffer increases accuracy
  • Exhaustive search of partial haplotypes
  • Slight runtime reduction from avoidance of redundant path searches

Full Changelog: v0.3.0...v0.3.1

v0.3.0

11 Jun 20:06
Compare
Choose a tag to compare

General improvements

  • ~8% speed increase from less work in the path-searching
  • Partial haplotypes bug fix increases accuracy
  • Fixed SQ and FT fields
  • Dedicated writing thread helps reduce memory usage by preventing a backlog of completed variants while reading
  • Default --out is stdout to allow easier compression/indexing (e.g. kanpig .. | bcftools sort -O z -o out.vcf.gz)
  • IUPAC codes are fixed by kanpig according to vcf specifications (Issue #1)
  • Fixed filtering of symbolic alts and BNDs
  • Argument validation

Full Changelog: v0.2.0...v0.3.0

v0.2.0

21 May 21:24
Compare
Choose a tag to compare
  • Up to 40% reduction in runtime
  • Hemizygous and sex chromosome aware genotyping with new --ploidy-bed
  • Variants with alternate alleles of stars, monozygotic reference, and BNDs are filtered out
  • Comparing PathScores by their average size and sequence similarity increases accuracy

Full Changelog: v0.1.2...v0.2.0

v0.1.2

05 May 05:35
Compare
Choose a tag to compare
  • New optional hompolymer filter doesn't kmerize long homopolymers
  • Improved logging info
  • Correcting GQ field
  • Correcting kmer counting
  • Small speed/memory/io improvements
    • Off-loaded annotation work from the single writer thread to the worker threads and using a large
      multiple of page size for the BufWriter capacity
    • Fewer bam file opens
    • Fewer clone operations

Full Changelog: v0.1.0...v0.1.2