Run genome-wide association tests and plot Manhattan plots—adjust sample size to see power increase
Genome-Wide Association Studies (GWAS) test hundreds of thousands to millions of genetic variants (SNPs) for association with a phenotype. GWAS has identified thousands of loci influencing complex traits like height, disease risk, and metabolic traits, revolutionizing our understanding of genetic architecture.
The Manhattan plot displays association test results across the genome:
The probability of detecting an association depends on:
For each SNP, we test association using linear regression:
Testing millions of SNPs requires strict significance thresholds:
GWAS power calculations show:
In this simulation, SNPs are independent. In real GWAS:
GWAS emerged in the mid-2000s with the advent of SNP microarrays enabling affordable genome-wide genotyping. The Wellcome Trust Case Control Consortium (2007) demonstrated GWAS feasibility with the first large-scale studies of common diseases. Since then, GWAS meta-analyses have grown to hundreds of thousands of participants, identifying thousands of trait-associated loci and enabling polygenic risk scores for disease prediction.