Deep Learning for Voice Gender Identification: Proof-of-concept for Gender-Affirming Voice Care.

The Laryngoscope
Yael BensoussanMichael Johns

Abstract

The need for gender-affirming voice care has been increasing in the transgender population in the last decade. Currently, objective treatment outcome measurements are lacking to assess the success of these interventions. This study uses neural network models to predict binary gender from short audio samples of "male" and "female" voices. This preliminary work is a proof-of-concept for further work to develop an AI-assisted treatment outcome measure for gender-affirming voice care. Retrospective cohort study. Two hundred seventy-eight voices from male and female speakers from the Perceptual Voice Qualities Database were used to train a deep neural network to classify voices as male or female. Each audio sample was mapped to the frequency domain using Mel spectrograms. To optimize model performance, we performed 10-fold cross validation of the entire dataset. The dataset was split into 80% training, 10% validation, and 10% test. Overall accuracy of 92% was obtained, both when considering the accuracy per spectrum and per patient metric. The accuracy of the model was higher for recognizing female voices (F1 score of 0.94) compared to male voices (F1 score of 0.87). This proof of concept study shows promising performance for furthe...Continue Reading

References

Oct 1, 1991·The Journal of the Acoustical Society of America·D G Childers, K Wu
Aug 11, 2004·Trends in Cognitive Sciences·Pascal BelinCatherine Bédard
Jul 13, 2006·The Journal of Laryngology and Otology·E J M McNeill
Nov 1, 2006·Journal of Voice : Official Journal of the Voice Foundation·Roland LinderRainer Schönweiler
Feb 22, 2012·Frontiers in Psychology·Cyril R Pernet, Pascal Belin
Oct 8, 2013·Journal of Voice : Official Journal of the Voice Foundation·Adrienne HancockFiacre Douglas
Aug 29, 2017·Journal of Voice : Official Journal of the Voice Foundation·Sally J K GallenaEmily Stickels
Oct 27, 2017·Archives of Sexual Behavior·Seth O WattNicholas O Rule
Mar 24, 2018·Journal of Voice : Official Journal of the Voice Foundation·Shih-Hau FangChi-Te Wang
May 5, 2018·Artificial Intelligence in Medicine·Dalal BardouSayed Mohammad Ahmad
Nov 18, 2018·Conference Proceedings : ... Annual International Conference of the IEEE Engineering in Medicine and Biology Society·Huiyi WuGaetano Di Caterina
Feb 2, 2019·The Laryngoscope·Matthew G CrowsonTimothy C Y Chan
Feb 6, 2019·Otolaryngology--head and Neck Surgery : Official Journal of American Academy of Otolaryngology-Head and Neck Surgery·Andrés M BurJacob New
Jun 27, 2019·Laryngoscope Investigative Otolaryngology·Maria E PowellAlexander Gelbard

❮ Previous
Next ❯

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Related Papers

Seminars in Speech and Language
Anita McAllister, Peta Sjölander
Archives of Women's Mental Health
Stefanie Suessenbacher-KesslerMichaela Amering
© 2021 Meta ULC. All rights reserved