Please use this identifier to cite or link to this item: http://repositorio.ufla.br/jspui/handle/1/13980
Title: On the equivalence of methods for population stratification and their application in genetic association studies
Other Titles: Sobre a equivalência de métodos para populações estratificadas e sua aplicação em estudos de associação genética
Authors: Duarte, Nubia Esteban
Giolo, Suely Ruiz
Andrade, Mariza de
Soler, Júlia Pavan Pavan
Keywords: Principal components
Singular value decomposition
Duality in genetic matrix
Componentes principais - Análise
Marcadores genéticos
Decomposição em valores singulares
Dualidade da matriz genética
Issue Date: 1-Aug-2017
Publisher: Universidade Federal de Lavras
Citation: DUARTE, N. E. et al. On the equivalence of methods for population stratification and their application in genetic association studies. Revista Brasileira de Biometria, Lavras, v. 33, n. 4, p.494-507, dez. 2015.
Description: Population-based association studies with unrelated individuals have been used in the mapping of genes involved in the regulation of complex diseases. However, when subjects are from dierent ethnic ancestries, these studies may yield spurious associations due to population stratication, with an excess of false positive or negative results. Principal components analysis based either on genotype values from known genetic markers (columns of the matrix) or on individuals (rows of the matrix) are the most common approaches used for correction of the confounding effect due the population stratication in genetic association studies. In this paper, results from the singular value decomposition theory of matrices are used to show the analytical equivalence between these approaches, focusing mainly in their relevant role in population stratication analysis. It is also shown the importance of using the biplot as a visualization tool not only to explain the joint information of samples and genetic markers but also to detect informative markers. Although both procedures can be used to correct for population stratication, principal components analysis based on samples is more computationally feasible due to the large number of genetic markers (n << p problem). As an application, it is used genotype data from four HapMap populations.
URI: http://repositorio.ufla.br/jspui/handle/1/13980
Appears in Collections:Revista Brasileira de Biometria



This item is licensed under a Creative Commons License Creative Commons