Abstract
This paper is about an algorithm, FlexTree, for general supervised learning. It extends the binary tree-structured approach (Classification and Regression Trees, CART) although it differs greatly in its selection and combination of predictors. It is particularly applicable to assessing interactions: gene by gene and gene by environment as they bear on complex disease. One model for predisposition to complex disease involves many genes. Of them, most are pure noise; each of the values that is not the prevalent genotype for the minority of genes that contribute to the signal carries a "score." Scores add. Individuals with scores above an unknown threshold are predisposed to the disease. For the additive score problem and simulated data, FlexTree has cross-validated risk better than many cutting-edge technologies to which it was compared when small fractions of candidate genes carry the signal. For the model where only a precise list of aberrant genotypes is predisposing, there is not a systematic pattern of absolute superiority; however, overall, FlexTree seems better than the other technologies. We tried the algorithm on data from 563 Chinese women, 206 hypotensive, 357 hypertensive, with information on ethnicity, menopausal sta...Continue Reading
References
Jan 16, 1992·Nature·R P LiftonJ M Lalouel
Jul 1, 1994·Hypertension·A BonnardeauxF Soubrier
May 3, 1996·Science·R P Lifton
Mar 5, 1999·Science·M ElcheblyB P Kennedy
Dec 7, 2000·Genetic Epidemiology·H Zhang, G Bonney
Feb 24, 2001·Genome Biology·T HastieP Brown
Feb 22, 2001·Annual Review of Physiology·J P Morello, D G Bichet
Oct 9, 2001·Human Molecular Genetics·K RanadeD Botstein
Nov 21, 2001·Journal of Molecular Medicine : Official Organ of the Gesellschaft Deutscher Naturforscher Und Ärzte·L M ChuangT Y Tai
Feb 22, 2002·European Journal of Biochemistry·Stephanie BechtelRita Bernhardt
Feb 28, 2002·Biochemical and Biophysical Research Communications·Claes-Göran OstensonSuad Efendic
Jun 12, 2003·Metabolism: Clinical and Experimental·Xiangdong WuBarry J Goldstein
Aug 13, 2003·Current Atherosclerosis Reports·Gerald M Reaven
Citations
Oct 29, 2010·Human Genetics·Jiang GuiAngeline S Andrew
Jul 26, 2011·Journal of Autism and Developmental Disorders·Yun JiaoEdward H Herskovits
Jan 17, 2008·European Journal of Human Genetics : EJHG·Yan V Sun, Sharon Lr Kardia
Jun 12, 2010·Bioinformatics·Xing HuaAnthony Y C Kuk
Jun 15, 2006·Biostatistics·Zhi Wei, Hongzhe Li
Apr 13, 2007·Biostatistics·Mee Young Park, Trevor Hastie
Jan 18, 2007·Annals of Human Genetics·K YuQ Lan
Sep 18, 2010·Annals of Human Genetics·Nengjun YiBoris Pasche
Sep 22, 2011·Human Heredity·Annette M MolinaroNilanjan Chatterjee
Dec 8, 2009·BMC Medical Genetics·Hua HeSaonli Basu
Mar 17, 2009·PloS One·Richard A MushlinTimothy R Rebbeck
Dec 6, 2011·PloS One·Joong-Ho WonRichard A Olshen
Apr 27, 2012·PloS One·Gang FangVipin Kumar
Jun 28, 2013·PloS One·Jiang GuiDiane Gilbert-Diamond
Mar 22, 2014·PloS One·Sait Can Yücebaş, Yeşim Aydın Son
Jun 3, 2014·PloS One·Sangho YoonRichard A Olshen
Apr 17, 2007·Genetics·Anders AlbrechtsenRasmus Nielsen
Dec 26, 2006·Pharmacogenomics·Eugene LinEllson Y Chen
Oct 11, 2007·Pharmacogenomics·Alison A MotsingerDavid M Reif
Nov 1, 2011·Advances in Medical Sciences·Y JiaoE H Herskovits
Oct 26, 2016·Journal of Human Genetics·Dong-Hao JinJoobae Park
Jan 30, 2019·Archives of Toxicology·Tobias TietzHolger Schwender
Aug 12, 2009·European Journal of Epidemiology·Ronja Foraita
Jan 27, 2006·Genetic Epidemiology·Wen-Harn PanHsing-Yi Chang
Jan 1, 2009·Statistical Science : a Review Journal of the Institute of Mathematical Statistics·Charles KooperbergIndika Rajapakse