Local-learning-based feature selection for high-dimensional data analysis.

IEEE Transactions on Pattern Analysis and Machine Intelligence
Yijun SunSteve Goodison

Abstract

This paper considers feature selection for data classification in the presence of a huge number of irrelevant features. We propose a new feature-selection algorithm that addresses several major issues with prior work, including problems with algorithm implementation, computational complexity, and solution accuracy. The key idea is to decompose an arbitrarily complex nonlinear problem into a set of locally linear ones through local learning, and then learn feature relevance globally within the large margin framework. The proposed algorithm is based on well-established machine learning and numerical analysis techniques, without making any assumptions about the underlying data distribution. It is capable of processing many thousands of features within minutes on a personal computer while maintaining a very high accuracy that is nearly insensitive to a growing number of irrelevant features. Theoretical analyses of the algorithm's sample complexity suggest that the algorithm has a logarithmical sample complexity with respect to the number of features. Experiments on 11 synthetic and real-world data sets demonstrate the viability of our formulation of the feature-selection problem for supervised learning and the effectiveness of our ...Continue Reading

References

Dec 23, 2000·Science·S T Roweis, L K Saul
Feb 2, 2002·Nature·Laura J van 't VeerStephen H Friend
Sep 25, 2004·IEEE Transactions on Neural Networks·Volker Roth
Apr 1, 2006·Proceedings of the National Academy of Sciences of the United States of America·David L Donoho, Michael Elad
Mar 4, 2008·Briefings in Bioinformatics·Melanie Hilario, Alexandros Kalousis

❮ Previous
Next ❯

Citations

Mar 13, 2013·Molecular Diagnosis & Therapy·Steve GoodisonVirginia Urquidi
Oct 26, 2012·Cancer Epidemiology, Biomarkers & Prevention : a Publication of the American Association for Cancer Research, Cosponsored by the American Society of Preventive Oncology·Virginia UrquidiCharles J Rosser
Nov 19, 2010·Bioanalysis·Steve GoodisonVirginia Urquidi
Oct 19, 2012·Biomarkers in Medicine·Steve Goodison, Virginia Urquidi
Mar 15, 2014·BMC Bioinformatics·Hongmin CaiTatsuya Akutsu
Nov 7, 2014·Genetics, Selection, Evolution : GSE·Heyun HuangMario P L Calus
Jun 22, 2012·Neural Networks : the Official Journal of the International Neural Network Society·Daqi GaoFangjun Liu
Aug 27, 2014·Genome Biology·Yijun SunSteve Goodison
Aug 19, 2014·IEEE Transactions on Neural Networks and Learning Systems·Yun LiSongcan Chen
Jul 19, 2013·IEEE Transactions on Cybernetics·Shutao Li, Dan Wei
Jul 23, 2014·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society·Sakrapee PaisitkriangkraiAnton Van den Hengel
Jan 18, 2012·IEEE Transactions on Bio-medical Engineering·Athanasios TsanasLorraine O Ramig
Apr 9, 2016·Journal of Theoretical Biology·Shun GuoQingshan Jiang
Jun 11, 2014·Schizophrenia Bulletin·Nikolaos KoutsoulerisStefan Borgwardt
Sep 22, 2015·IEEE Transactions on Pattern Analysis and Machine Intelligence·Narges ArmanfardMajid Komeili
Sep 22, 2015·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Jun Chin AngHaza Nuzly Abdull Hamed
Jan 22, 2017·Nucleic Acids Research·Yijun SunSteve Goodison
Jul 22, 2017·Neuroscience and Biobehavioral Reviews·Diego Librenza-GarciaIves Cavalcante Passos
Dec 13, 2017·Frontiers in Computational Neuroscience·Lei JiangYiwen Wang
Apr 25, 2017·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society· Xianglong Liu Shih-Fu Chang
Sep 4, 2018·Proceedings of the Institution of Mechanical Engineers. Part H, Journal of Engineering in Medicine·Puja BhartiRupa Ananthasivan
Jun 11, 2017·IEEE/ACM Transactions on Computational Biology and Bioinformatics·Shuai AnJinmao Wei
Aug 12, 2015·Journal of Digital Imaging·Marcos Vinicius Naves BedoCaetano Traina
Oct 27, 2016·Biostatistics·Marla Johnson, Elizabeth Purdom
Jul 24, 2018·Computational and Mathematical Methods in Medicine·Zhennao CaiHuiling Chen
Mar 28, 2019·PloS One·Majid KomeiliFrank Rudzicz
Sep 25, 2018·Journal of Parkinson's Disease·Siddharth AroraConnie Marras
Apr 3, 2019·Journal of Medical Internet Research·Yonglai ZhangWenai Song
Dec 23, 2016·Computers in Biology and Medicine·Aiguo WangGil Alterovitz

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.