GA4GH WGS Quality Control Standards
  • Overview
  • Motivation
  • Roadmap v1
  • Roadmap v2
  • Links

Specifications

  • About
  • Post Alignment metrics
    • Bases ≥ Q30
    • Cross contamination
    • Genome coverage uniformity
    • Insert size standard deviation
    • Mean autosome coverage
    • Mean insert size
    • Percent autosomes covered ≥ 15 X
    • Percent reads mapped
    • Percent reads properly paired
  • Post Variant calling metrics
    • Count: Deletions
    • Count: Insertions
    • Count: SNVs
    • Ratio: Heterozygous/Homozygous (indels)
    • Ratio: Heterozygous/Homozygous (SNVs)
    • Ratio: Insertions/Deletions
    • Ratio: Transitions/Transversions (ti/tv)
  • Terminologies & Concepts
  • References
    • ICGC ARGO
    • DRAGEN
    • Bedtools subtract
    • NPM-sample-qc
    • Samtools stats
    • Samtools view
    • VCF Format
    • VerifyBamID2
    • VerifyBamID reference panel

Implementations

  • About
  • NPM-sample-qc
  • ICGC-ARGO-dnaalnqc
  • ICGC-ARGO-vcfqc

Implementation Benchmarking

  • About
  • Resources
  • Jupyter Notebook
GA4GH WGS Quality Control Standards
  • Specifications
  • References
  • Edit on GitHub

References

ICGC ARGO

dnaalnqc: https://github.com/icgc-argo-workflows/dnaalnqc

vcfqc: https://github.com/icgc-argo-workflows/vcfqc

DRAGEN

Illumina DRAGEN Bio-IT Platform 3.7 User Guide

Bedtools subtract

https://bedtools.readthedocs.io/en/latest/content/tools/subtract.html

NPM-sample-qc

https://github.com/c-BIG/NPM-sample-qc

Samtools stats

http://www.htslib.org/doc/samtools-stats.html

Samtools view

http://www.htslib.org/doc/samtools-view.html

VCF Format

[https://samtools.github.io/hts-specs/VCFv4.2.pdf]

VerifyBamID2

https://github.com/Griffan/VerifyBamID

VerifyBamID reference panel

Pre-calculated reference panel of 1000 Genome Project phase 3 dataset: 100,000 sites mapped on GRCh38

  • UDPath: https://raw.githubusercontent.com/Griffan/VerifyBamID/master/resource/1000g.phase3.100k.b38.vcf.gz.dat.UD
  • BedPath: https://raw.githubusercontent.com/Griffan/VerifyBamID/master/resource/1000g.phase3.100k.b38.vcf.gz.dat.bed
  • MeanPath: https://raw.githubusercontent.com/Griffan/VerifyBamID/master/resource/1000g.phase3.100k.b38.vcf.gz.dat.mu
Previous Next

GA4GH Large Scale Genomics Workstream, WGS-QC Standards team

Built with MkDocs using a theme provided by Read the Docs.
GitHub « Previous Next »