Self-reported ethnicity, genetic structure and the impact of population stratification in a multiethnic study

Hum Genet. 2010 Aug;128(2):165-77. doi: 10.1007/s00439-010-0841-4. Epub 2010 May 25.

Abstract

It is well-known that population substructure may lead to confounding in case-control association studies. Here, we examined genetic structure in a large racially and ethnically diverse sample consisting of five ethnic groups of the Multiethnic Cohort study (African Americans, Japanese Americans, Latinos, European Americans and Native Hawaiians) using 2,509 SNPs distributed across the genome. Principal component analysis on 6,213 study participants, 18 Native Americans and 11 HapMap III populations revealed four important principal components (PCs): the first two separated Asians, Europeans and Africans, and the third and fourth corresponded to Native American and Native Hawaiian (Polynesian) ancestry, respectively. Individual ethnic composition derived from self-reported parental information matched well to genetic ancestry for Japanese and European Americans. STRUCTURE-estimated individual ancestral proportions for African Americans and Latinos are consistent with previous reports. We quantified the East Asian (mean 27%), European (mean 27%) and Polynesian (mean 46%) ancestral proportions for the first time, to our knowledge, for Native Hawaiians. Simulations based on realistic settings of case-control studies nested in the Multiethnic Cohort found that the effect of population stratification was modest and readily corrected by adjusting for race/ethnicity or by adjusting for top PCs derived from all SNPs or from ancestry informative markers; the power of these approaches was similar when averaged across causal variants simulated based on allele frequencies of the 2,509 genotyped markers. The bias may be large in case-only analysis of gene by gene interactions but it can be corrected by top PCs derived from all SNPs.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Asian / ethnology
  • Asian / genetics
  • Asian People / ethnology
  • Asian People / genetics
  • Black People / ethnology
  • Black People / genetics
  • Black or African American / ethnology
  • Black or African American / genetics
  • Case-Control Studies
  • Cohort Studies
  • Data Collection
  • Ethnicity / ethnology
  • Ethnicity / genetics*
  • Gene Frequency
  • Genetic Structures
  • Genotype
  • Hispanic or Latino / ethnology
  • Hispanic or Latino / genetics
  • Humans
  • Male
  • Native Hawaiian or Other Pacific Islander / ethnology
  • Native Hawaiian or Other Pacific Islander / genetics
  • Polymorphism, Single Nucleotide
  • Population Groups / ethnology
  • Population Groups / genetics
  • White People / ethnology
  • White People / genetics