cn.mops
cn.mops estimates copy number variations (CNVs) and copy number aberrations (CNAs) from next-generation sequencing (NGS) data by modeling read depth across samples to enable accurate CNV detection for genomic analysis and association studies.
Key Features:
- Data processing: Converts BAM files into read count matrices or genomic ranges objects for per-position coverage analysis.
- Depth-of-coverage modeling: Models depths of coverage across samples at each genomic position to mitigate technological and genomic coverage biases.
- Bayesian mixture modeling: Uses a Bayesian framework with mixture components of Poisson distributions to decompose coverage variation into integer copy numbers and noise.
- Noise estimation and FDR control: Estimates noise levels and filters high-noise detections to reduce the false discovery rate (FDR).
Scientific Applications:
- CNV/CNA detection: Detection and analysis of CNVs and CNAs from NGS datasets.
- Association studies: Association analyses between CNVs and disease or phenotypic traits, with reduced false positives improving statistical power.
Methodology:
Converts BAM files to read count matrices or genomic ranges objects; models per-position coverage across samples; applies a Bayesian mixture of Poisson distributions to infer integer copy numbers and noise; estimates and filters high-noise detections to reduce FDR; benchmarked against five CNV detection methods on four datasets (simulated data; NGS data from a male HapMap individual with implanted CNVs on the X chromosome; HapMap individuals with known CNVs; high-coverage 1000 Genomes Project data) with evaluation by precision (1−FDR) and recall for gains and losses.
Topics
Collections
Details
- License:
- GPL-2.0
- Tool Type:
- command-line tool, library
- Operating Systems:
- Linux, Windows, Mac
- Programming Languages:
- R
- Added:
- 1/17/2017
- Last Updated:
- 12/30/2018
Operations
Publications
Klambauer G, Schwarzbauer K, Mayr A, Clevert D, Mitterecker A, Bodenhofer U, Hochreiter S. cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate. Nucleic Acids Research. 2012;40(9):e69-e69. doi:10.1093/nar/gks003. PMID:22302147. PMCID:PMC3351174.