Copy number variations are a significant source of genetic diversity in humans. The platform of choice to detect genome-wide CNVs has traditionally been microarray, including SNP arrays which can also detect copy neutral LOH regions. Next-generation sequencing (NGS) has advanced rapidly and is the platform of choice for sequence variants. Decreasing costs have made it much more affordable to perform sequencing and the need for deriving copy number from NGS data has been rising. There are several tools available (CoNIFER, xHMM, CNVkit, QDNA) for this but most are not user-friendly; they require strong bioinformatics expertise and use of the command line. Often each tool is adept in only one arena (e.g. cancer or constitutional samples, data from WGS or targeted panels, …).
A new algorithm incorporated into Nexus Copy Number 9.0 is the BAM (multiscale reference) method. BAM (multiscale reference) functions well with both shallow/targeted sequencing data and WGS/WES with normal depth of coverage. We compare the ability to detect CNVs between algorithms and platforms.
NGS methods compared:
- CNVkit
- BAM ngCGH (matched)
- BAM (multiscale reference)
Data sets used:
- Large-cell Lung Carcinoma (LCC) panel data with eight normals including pair matched normals
- Shallow (low pass) PGD (Preimplantation Genetic Diagnosis) WGS data (<1x)
- Constitutional (DiGeorge Syndrome) WES with normal depth of coverage (30x) and microarray