How should variable selection be performed with multiply imputed data?

Statistics in Medicine
Angela M WoodPatrick Royston

Abstract

Multiple imputation is a popular technique for analysing incomplete data. Given the imputed data and a particular model, Rubin's rules (RR) for estimating parameters and standard errors are well established. However, there are currently no guidelines for variable selection in multiply imputed data sets. The usual practice is to perform variable selection amongst the complete cases, a simple but inefficient and potentially biased procedure. Alternatively, variable selection can be performed by repeated use of RR, which is more computationally demanding. An approximation can be obtained by a simple 'stacked' method that combines the multiply imputed data sets into one and uses a weighting scheme to account for the fraction of missing data in each covariate. We compare these and other approaches using simulations based around a trial in community psychiatry. Most methods improve on the naïve complete-case analysis for variable selection, but importantly the type 1 error is only preserved if selection is based on RR, which is our recommended approach.

Associated Clinical Trials

References

Dec 1, 1991·Statistics in Medicine·D G Altman
Jul 1, 1989·Statistics in Medicine·D G Altman, P K Andersen
Apr 16, 1999·Statistics in Medicine·S van BuurenD L Knook
Nov 11, 2005·Clinical Trials : Journal of the Society for Clinical Trials·Angela M WoodSimon G Thompson
Jul 11, 2007·Statistical Methods in Medical Research·Michael G Kenward, James Carpenter
Jul 11, 2007·Statistical Methods in Medical Research·L-M YuOliver Rivero-Arias
Jul 11, 2007·Statistical Methods in Medical Research·James R CarpenterIan R White

❮ Previous
Next ❯

Citations

Mar 27, 2013·Journal of Community Health·Electra D PaskettMira L Katz
Nov 26, 2009·Journal of Cancer Survivorship : Research and Practice·Anne C KirchhoffKaren L Syrjala
Sep 23, 2008·Journal of Laparoendoscopic & Advanced Surgical Techniques. Part a·Jennifer M J SchreinemakersInne H M Borel Rinkes
Apr 19, 2012·Critical Care Medicine·Sabina HunzikerMichael D Howell
Oct 14, 2009·Journal of Public Health Management and Practice : JPHMP·Zhiwei Zeng
Nov 12, 2013·BMJ : British Medical Journal·Matthieu HanfMichel Chavance
Sep 18, 2010·BMC Medical Research Methodology·David VergouwDaniëlle A W M van der Windt
Mar 9, 2012·American Journal of Public Health·Stefan WalterHenning Tiemeier
May 30, 2013·Annals of the Rheumatic Diseases·Jasmijn F M HollaJoost Dekker
Dec 26, 2009·European Spine Journal : Official Publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research Society·Pieter H HelmhoutRob A de Bie
Sep 27, 2014·Trials·Anna Thit JohnsenMogens Groenvold
Oct 18, 2014·BMC Medical Research Methodology·Jammbe Z MusoroRonald B Geskus
Aug 11, 2012·Health Services & Outcomes Research Methodology·Ashok Chaurasia, Ofer Harel
Dec 22, 2015·Biology of Blood and Marrow Transplantation : Journal of the American Society for Blood and Marrow Transplantation·Annemarie M J BraamseKaren L Syrjala
Jan 27, 2016·BMC Research Notes·Catrin O PlumptonIan R White
Nov 18, 2015·The Breast : Official Journal of the European Society of Mastology·Sophie FrankCaroline Cuvier
May 3, 2011·Revista española de cardiología·Eduardo NúñezJulio Núñez
Apr 13, 2010·The Journal of Heart and Lung Transplantation : the Official Publication of the International Society for Heart Transplantation·Cynthia J GriesChristopher H Goss
Nov 14, 2015·Statistics in Medicine·Xiaoyan YinMartin G Larson
Aug 1, 2015·Annals of Surgical Oncology·Michelle G RathJoachim Rom
Jul 15, 2009·Journal of Clinical Epidemiology·Yvonne VergouweDouglas G Altman
Mar 29, 2016·HPB : the Official Journal of the International Hepato Pancreato Biliary Association·Samer TohmeAllan Tsung
Feb 19, 2016·Neurology·Daan C VelseboerCaroline H Williams-Gray
Oct 29, 2011·International Journal of Cancer. Journal International Du Cancer·Rebecca GilbertRichard M Martin
Jan 13, 2011·Statistics in Medicine·Ian R WhiteAngela M Wood
Jul 17, 2013·Journal of Thrombosis and Haemostasis : JTH·J M T HendriksenJ A H de Groot
Aug 24, 2013·Biometrical Journal. Biometrische Zeitschrift·Chung-Wei Shen, Yi-Hau Chen
Mar 26, 2013·Statistics in Medicine·Qixuan Chen, Sijian Wang
Nov 3, 2010·Academic Emergency Medicine : Official Journal of the Society for Academic Emergency Medicine·Gerard M O'ReillyBelinda Gabbe
Mar 30, 2016·European Journal of Clinical Pharmacology·Andreas D MeidWalter E Haefeli
Jun 23, 2015·Statistics in Medicine·Tim P MorrisPatrick Royston
Jan 30, 2015·Statistics in Medicine·Debashis GhoshDonna L Coffman

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.