Extracting cancer mortality statistics from death certificates: A hybrid machine learning and rule-based approach for common and rare cancers

Artificial Intelligence in Medicine
Bevan KoopmanNarelle Grayson

Abstract

Death certificates are an invaluable source of cancer mortality statistics. However, this value can only be realised if accurate, quantitative data can be extracted from certificates-an aim hampered by both the volume and variable quality of certificates written in natural language. This paper proposes an automatic classification system for identifying all cancer related causes of death from death certificates. Detailed features, including terms, n-grams and SNOMED CT concepts were extracted from a collection of 447,336 death certificates. The features were used as input to two different classification sub-systems: a machine learning sub-system using Support Vector Machines (SVMs) and a rule-based sub-system. A fusion sub-system then combines the results from SVMs and rules into a single final classification. A held-out test set was used to evaluate the effectiveness of the classifiers according to precision, recall and F-measure. The system was highly effective at determining the type of cancers for both common cancers (F-measure of 0.85) and rare cancers (F-measure of 0.7). In general, rules performed superior to SVMs; however, the fusion method that combined the two was the most effective. The system proposed in this study p...Continue Reading

Citations

Jun 22, 2021·Journal of the American Medical Informatics Association : JAMIA·Eunsuk Chang, Javed Mostafa

❮ Previous
Next ❯

Related Concepts

Related Feeds

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.

Related Papers

International Journal of Medical Informatics
Bevan KoopmanNarelle Grayson
Computerized Medical Imaging and Graphics : the Official Journal of the Computerized Medical Imaging Society
Md Mahmudur RahmanPrabir Bhattacharya
Neural Networks : the Official Journal of the International Neural Network Society
Sangwook KimMinho Lee
© 2022 Meta ULC. All rights reserved