Finding haplotype tagging SNPs by use of principal components analysis

American Journal of Human Genetics
Zhen Lin, Russ B Altman

Abstract

The immense volume and rapid growth of human genomic data, especially single nucleotide polymorphisms (SNPs), present special challenges for both biomedical researchers and automatic algorithms. One such challenge is to select an optimal subset of SNPs, commonly referred as "haplotype tagging SNPs" (htSNPs), to capture most of the haplotype diversity of each haplotype block or gene-specific region. This information-reduction process facilitates cost-effective genotyping and, subsequently, genotype-phenotype association studies. It also has implications for assessing the risk of identifying research subjects on the basis of SNP information deposited in public domain databases. We have investigated methods for selecting htSNPs by use of principal components analysis (PCA). These methods first identify eigenSNPs and then map them to actual SNPs. We evaluated two mapping strategies, greedy discard and varimax rotation, by assessing the ability of the selected htSNPs to reconstruct genotypes of non-htSNPs. We also compared these methods with two other htSNP finders, one of which is PCA based. We applied these methods to three experimental data sets and found that the PCA-based methods tend to select the smallest set of htSNPs to ach...Continue Reading

References

Dec 31, 1997·Science·F S CollinsA Charkravarti
Mar 20, 2001·American Journal of Human Genetics·M StephensP Donnelly
Jun 8, 2001·Bioinformatics·O TroyanskayaR B Altman
Jul 14, 2001·Science·J C StephensG F Vovis
May 25, 2002·Science·Stacey B GabrielDavid Altshuler
Jun 8, 2002·Pharmacogenomics·Richard JudsonJ Claiborne Stephens
Jan 10, 2003·Nucleic Acids Research·David L WheelerLukas Wagner
Jun 11, 2003·American Journal of Human Genetics·Zhaoling MengMargaret G Ehm
Jul 15, 2003·American Journal of Human Genetics·Eric C Anderson, John Novembre
Aug 6, 2003·Proceedings of the National Academy of Sciences of the United States of America·Paola SebastianiMarco F Ramoni
Sep 25, 2003·Human Genetics·Jochen HampeMichael Krawczak
Oct 24, 2003·American Journal of Human Genetics·Matthew Stephens, Peter Donnelly
Dec 19, 2003·American Journal of Human Genetics·Christopher S CarlsonDeborah A Nickerson
Jul 13, 2004·Science·Zhen LinRuss B Altman

❮ Previous
Next ❯

Citations

May 4, 2007·International Archives of Occupational and Environmental Health·Alexis DescathaAnnette Leclerc
Oct 31, 2012·Cancer Metastasis Reviews·David S GutteryJacqueline A Shaw
Mar 24, 2011·European Journal of Human Genetics : EJHG·Wei YangC Charles Gu
Apr 21, 2001·European Journal of Human Genetics : EJHG·J AkeyM Xiong
Dec 14, 2006·European Journal of Human Genetics : EJHG·Keyue Ding, Iftikhar J Kullo
Jan 19, 2010·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Lan LiuTao Jiang
Sep 17, 2013·Journal of Computational Biology : a Journal of Computational Molecular Cell Biology·Emrah Kostem, Eleazar Eskin
Jun 14, 2013·Omics : a Journal of Integrative Biology·Ilhan Ilhan, Gülay Tezel
Apr 1, 2011·Briefings in Bioinformatics·Raphaël MouradPhilippe Leray
Dec 8, 2006·Genome Research·Peristera PaschouPetros Drineas
Sep 21, 2006·Annals of Human Genetics·C Charles GuEric Boerwinkle
Aug 16, 2008·Annals of Human Genetics·B HanE Eskin
Sep 10, 2011·Annals of Human Genetics·Asif JavedPeristera Paschou
Jul 5, 2006·Journal of Bioinformatics and Computational Biology·Tu Minh PhuongRuss B Altman
Apr 7, 2005·Human Heredity·Bjarni V HalldórssonFrancisco M De La Vega
Feb 28, 2009·BMC Bioinformatics·Jun WangChun-yu Wang
Oct 21, 2005·BMC Bioinformatics·Joshua J FormanStephen J Haggarty
May 10, 2008·BMC Proceedings·Sohee Oh, Taesung Park
Sep 26, 2007·PLoS Genetics·Peristera PaschouPetros Drineas
Oct 10, 2006·Genetics·Mikko J Sillanpää, Madhuchhanda Bhattacharjee
Apr 6, 2007·Genetic Epidemiology·W James GaudermanDavid V Conti
Jun 15, 2011·Statistical Analysis and Data Mining·Yulan Liang, Arpad Kelemen
Feb 19, 2010·Molecular Ecology·Joost Van HeerwaardenLuis E Eguiarte
Jun 27, 2009·IEEE Transactions on Information Technology in Biomedicine : a Publication of the IEEE Engineering in Medicine and Biology Society·Arpad KelemenYulan Liang
Jul 14, 2010·NeuroImage·Maria VounouUNKNOWN Alzheimer's Disease Neuroimaging Initiative
Feb 13, 2008·Genomics, Proteomics & Bioinformatics·Nina Zhou, Lipo Wang
Aug 5, 2005·American Journal of Human Genetics·Duncan C ThomasDavid Duggan
Oct 28, 2016·Molecular Ecology·Jaime A ChavesJ Albert C Uy
Oct 16, 2016·Biometrical Journal. Biometrische Zeitschrift·Juha Karvanen, Mikko J Sillanpää
Mar 20, 2010·The Journal of Biological Chemistry·Elizabeth D KimSunyoung Kim
Jul 22, 2006·Genetic Epidemiology·Yan V SunSharon L R Kardia
Sep 27, 2007·Genetic Epidemiology·C Charles GuD C Rao
Sep 21, 2005·Genetic Epidemiology·Zhenqiu Liu, Shili Lin

❮ Previous
Next ❯

Related Concepts

Related Feeds

BioHub - Researcher Network

The Chan-Zuckerberg Biohub aims to support the fundamental research and develop the technologies that will enable physicians to cure, prevent, or manage all diseases in our childrens' lifetimes. The CZ Biohub brings together researchers from UC Berkeley, Stanford, and UCSF. Find the latest research from the CZ Biohub researcher network here.

Related Papers

Proceedings of the National Academy of Sciences of the United States of America
Paola SebastianiMarco F Ramoni
© 2021 Meta ULC. All rights reserved