selscan
selscan implements haplotype-based scans to detect recent or ongoing positive selection in genomic datasets.
Key Features:
- Implemented statistics: Computes Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH).
- Multithreaded performance: Uses multithreading to accelerate computation on large simulated and real datasets.
- Input formats: Accepts phased genotypes in formats including TPED.
- Performance benchmarks: iHS on chromosome 22 (22,147 loci) with 204 CEU haplotypes: 353 seconds (1 thread) and 33 seconds (16 threads); XPEHH relative to 210 YRI haplotypes: 578 seconds (1 thread) and 52 seconds (16 threads).
- Scalability: Designed to process extensive genomic datasets efficiently for high-throughput analyses.
Scientific Applications:
- Detection of positive selection: Identifies loci with extended haplotypes indicative of recent or ongoing selective sweeps within populations.
- Comparative population genomics: Detects differential selection between populations by comparing haplotype homozygosity.
- Evolutionary biology and population genetics: Supports inference of adaptive traits and evolutionary pressures from large-scale genotype data.
Methodology:
Implements EHH, iHS, and XPEHH: EHH measures decay of haplotype homozygosity with distance from a focal SNP; iHS quantifies integrated haplotype homozygosity within a population; XPEHH compares haplotype homozygosity between populations.
Topics
Collections
Details
- License:
- GPL-3.0
- Tool Type:
- command-line tool
- Operating Systems:
- Linux
- Programming Languages:
- R, Shell, C++
- Added:
- 8/20/2017
- Last Updated:
- 11/25/2024
Operations
Publications
Szpiech ZA, Hernandez RD. selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection. Molecular Biology and Evolution. 2014;31(10):2824-2827. doi:10.1093/molbev/msu211. PMID:25015648. PMCID:PMC4166924.