Mixture Model Tests Of Hierarchical Clustering Algorithms: The Problem Of Classifying Everybody

Multivariate Behavioral Research
C Edelbrock

Abstract

Due to the effects of outliers, mixture model tests that require all objects to be classified can severely underestimate the accuracy of hierarchical clustering algorithms. More valid and relevant comparisons between algorithms can be made by calculating accuracy at several levels in the hierarchical tree and considering accuracy as a function of the coverage of the classification. Using this procedure, several algorithms were compared on their ability to resolve ten multivariate normal mixtures. All of the algorithms were significantly more accurate than a random linkage algorithm, and accuracy was inversely related to coverage. Algorithms using correlation as the similarity measure were significantly more accurate than those using Euclidean distance (p < .001). A subset of high accuracy algorithms, including single, average, and centroid linkage using correlation, and Ward's minimum variance technique, was identified.

Citations

May 1, 1987·Psychological Medicine·W M GroveT Reich
Jan 13, 2009·Public Health Nutrition·Jane A Pryer, Stephen Rogers
Apr 1, 1988·Multivariate Behavioral Research·R M DregerR L Lemoine
Oct 1, 1983·Multivariate Behavioral Research·F H Borgen
Jun 27, 1998·The American Journal of Drug and Alcohol Abuse·M L WilliamsN L Weatherby
Jan 2, 2016·Behavior Research Methods·Emilie ShiremanMichael J Brusco
Jul 1, 1991·Multivariate Behavioral Research·K Schweizer
Jul 1, 1992·Multivariate Behavioral Research·L BelbinG W Milligan
Jun 1, 2006·Multivariate Behavioral Research·Saskia de CraenWillem J Heiser
Oct 1, 1988·Multivariate Behavioral Research·L R Bergman
Oct 1, 1986·Multivariate Behavioral Research·G W Milligan, M C Cooper
Apr 1, 1997·Multivariate Behavioral Research·C J HubertyR W Kamphaus
Oct 22, 2005·British Journal of Health Psychology·Jane ClatworthyRobert Horne
Jul 15, 2005·Scandinavian Journal of Psychology·Jolanta Sondaite, Rita Zukauskiene
Oct 1, 1988·British Journal of Addiction·D C Hodgins, L O Lightfoot
Jan 23, 2002·American Journal of Medical Genetics·L E ScuttA S Bassett
Nov 1, 1994·Journal of Clinical Psychology·G D ZimetT Zimmerman
Dec 1, 1988·Journal of Autism and Developmental Disorders·L Rescorla
Sep 1, 1982·British Journal of Addiction·H A Skinner
Aug 13, 1999·Breast Cancer Research and Treatment·T F Hack, L F Degner
Jul 1, 1983·Multivariate Behavioral Research·L C MoreyH A Skinner
Mar 11, 2011·Perspectives on Sexual and Reproductive Health·Angela R DempseyCarolyn L Westhoff

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.