Missing data imputation using statistical and machine learning methods in a real breast cancer problem

Artificial Intelligence in Medicine
José M JerezLeonardo Franco

Abstract

Missing data imputation is an important task in cases where it is crucial to use all available data and not discard records with missing values. This work evaluates the performance of several statistical and machine learning imputation methods that were used to predict recurrence in patients in an extensive real breast cancer data set. Imputation methods based on statistical techniques, e.g., mean, hot-deck and multiple imputation, and machine learning techniques, e.g., multi-layer perceptron (MLP), self-organisation maps (SOM) and k-nearest neighbour (KNN), were applied to data collected through the "El Álamo-I" project, and the results were then compared to those obtained from the listwise deletion (LD) imputation method. The database includes demographic, therapeutic and recurrence-survival information from 3679 women with operable invasive breast cancer diagnosed in 32 different hospitals belonging to the Spanish Breast Cancer Research Group (GEICAM). The accuracies of predictions on early cancer relapse were measured using artificial neural networks (ANNs), in which different ANNs were estimated using the data sets with imputed missing values. The imputation methods based on machine learning algorithms outperformed imputat...Continue Reading

References

Feb 1, 1997·Artificial Intelligence in Medicine·G F CooperP Spirtes
Jun 8, 2001·Bioinformatics·O TroyanskayaR B Altman
Nov 14, 2001·Neural Computation·R Setiono
Dec 11, 2002·Artificial Intelligence in Medicine·José M Jerez-AragonésEmilio Alba-Conejo
Mar 8, 2003·International Journal of Urology : Official Journal of the Japanese Urological Association·Keita FujikawaTatsushiro Okabe
Jan 22, 2004·Medicina clínica·Miguel MartínUNKNOWN Grupo GEICAM
Jul 6, 2005·The Australian and New Zealand Journal of Psychiatry·Graeme Hawthorne, Peter Elliott
Oct 29, 2005·Breast Cancer Research and Treatment·J M JerezM Martín
Feb 18, 2006·Neural Networks : the Official Journal of the International Neural Network Society·Paulo J Lisboa, Azzam F G Taktak
Jul 11, 2007·Statistical Methods in Medical Research·Michael G Kenward, James Carpenter
Jul 11, 2007·Statistical Methods in Medical Research·Gareth AmblerPatrick Royston
Jul 20, 2007·Statistics in Medicine·Juned Siddique, Thomas R Belin
Feb 6, 2008·IEEE Transactions on Neural Networks·S KaskiJ Peltonen

❮ Previous
Next ❯

Citations

Feb 23, 2013·Artificial Intelligence in Medicine·Federico CismondiStan N Finkelstein
Mar 10, 2012·Neural Networks : the Official Journal of the International Neural Network Society·E J PalomoT Watson
Dec 23, 2011·Artificial Intelligence in Medicine·Loris NanniSheryl Brahnam
Nov 6, 2014·BMC Bioinformatics·Serena G LiaoGeorge C Tseng
Oct 2, 2015·Journal of Biomedical Informatics·Miriam Seoane SantosArmando Carvalho
Jun 22, 2014·IEEE Journal of Biomedical and Health Informatics·Darwin TayRichard Kitney
Mar 1, 2015·Computers in Biology and Medicine·Pedro J García-LaencinaNoémia Afonoso
Jan 3, 2015·Journal of Biomedical Informatics·Robert R KelleyJulio Ramirez
Sep 17, 2013·Journal of Biomedical Informatics·Darwin TayRichard I Kitney
Sep 29, 2012·Computer Methods and Programs in Biomedicine·D UrdaJ M Jerez
Nov 1, 2016·Journal of Medical Systems·Shalini GambhirYugal Kumar
Dec 8, 2016·Applied Clinical Informatics·José Carlos FerrãoHenrique M G Martins
Sep 13, 2014·Proceedings of the Institution of Mechanical Engineers. Part H, Journal of Engineering in Medicine·Omneya Attallah, Xianghong Ma
Jan 1, 2015·Bioinformatics and Biology Insights·William SeffensHerman Taylor
Jan 21, 2020·Biometrical Journal. Biometrische Zeitschrift·Ralph C WardMulugeta Gebregziabher
Mar 5, 2020·Clinical Pharmacology and Therapeutics·Solveig BadilloJitao David Zhang
Nov 5, 2019·Pain Practice : the Official Journal of World Institute of Pain·Francisco J Pérez-BenitoCésar Fernández-de-Las-Peñas
Oct 28, 2019·Health Information Science and Systems·Xuetong WuMichelle Peate
Feb 16, 2019·Scientific Reports·Peilin LiYang Liu
Mar 27, 2020·Sensors·Xianglin ZhuMuhammad Shahzad
May 23, 2020·Pain Reports·Steven Z GeorgeJeff Boissoneault
Oct 1, 2020·BMC Medical Informatics and Decision Making·Wei Tse LiWeg M Ongkeko
Nov 11, 2019·Journal of Clinical Medicine·Jau-Woei PerngChih-Min Su
Jul 15, 2020·The Journal of Knee Surgery·Emily LearyJames L Cook
Feb 28, 2019·Entropy·Jaime Salvador-MenesesJose Garcia-Rodriguez
Oct 17, 2020·International Journal of General Medicine·Fatima AlshakhsMohamed Elasheri
Nov 7, 2020·PLoS Computational Biology·Pei-Yau LungJinfeng Zhang
Feb 25, 2021·Applied Clinical Informatics·Joanna AbrahamAlicia Meng
Mar 10, 2021·The Journal of Clinical Endocrinology and Metabolism·Luiz Eduardo WildembergMônica Gadelha
May 6, 2021·Journal of Sports Science & Medicine·Lauren C BensonCarolyn A Emery
Aug 21, 2021·Frontiers in Neurology·Gerome VivarSeyed-Ahmad Ahmadi
Aug 24, 2021·Burns and Trauma·Francisco Serra E MouraChidi Ekwobi
Sep 15, 2021·Circulation. Cardiovascular Quality and Outcomes·Byron C JaegerRamaraju Rudraraju
Nov 5, 2021·Skeletal Radiology·Gabby B JosephThomas M Link
Nov 6, 2020··Hanan Hammad Alharbi, Masaomi Kimura
Nov 14, 2017··Ahmad AhmadovRobert Wrembel

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.