Semantic Classification of Diseases in Discharge Summaries Using a Context-aware Rule-based Classifier

dc.contributor.authorSolt, Illes
dc.contributor.authorTikk, Domonkos
dc.contributor.authorGal, Viktor
dc.contributor.authorKardkovacs, Zsolt T.
dc.date.accessioned2015-12-10T22:55:17Z
dc.date.available2015-12-10T22:55:17Z
dc.date.issued2009
dc.date.updated2016-02-24T10:36:07Z
dc.description.abstractObjective: Automated and disease-specific classification of textual clinical discharge summaries is of great importance in human life science, as it helps physicians to make medical studies by providing statistically relevant data for analysis. This can be further facilitated if, at the labeling of discharge summaries, semantic labels are also extracted from text, such as whether a given disease is present, absent, questionable in a patient, or is unmentioned in the document. The authors present a classification technique that successfully solves the semantic classification task. Design: The authors introduce a context-aware rule-based semantic classification technique for use on clinical discharge summaries. The classification is performed in subsequent steps. First, some misleading parts are removed from the text; then the text is partitioned into positive, negative, and uncertain context segments, then a sequence of binary classifiers is applied to assign the appropriate semantic labels. Measurement: For evaluation the authors used the documents of the i2b2 Obesity Challenge and adopted its evaluation measures: F1-macro and F1-micro for measurements. Results: On the two subtasks of the Obesity Challenge (textual and intuitive classification) the system performed very well, and achieved a F1-macro = 0.80 for the textual and F1-macro = 0.67 for the intuitive tasks, and obtained second place at the textual and first place at the intuitive subtasks of the challenge. Conclusions: The authors show in the paper that a simple rule-based classifier can tackle the semantic classification task more successfully than machine learning techniques, if the training data are limited and some semantic labels are very sparse.
dc.identifier.issn1527-974X
dc.identifier.urihttp://hdl.handle.net/1885/60037
dc.publisherAmerican Medical Informatics Association
dc.sourceJournal of the American Medical Informatics Association
dc.subjectKeywords: article; automation; comorbidity; controlled study; disease classification; hospital discharge; hospital information system; obesity; semantics; Artificial Intelligence; Classification; Comorbidity; Disease; Humans; Natural Language Processing; Obesity; P
dc.titleSemantic Classification of Diseases in Discharge Summaries Using a Context-aware Rule-based Classifier
dc.typeJournal article
local.bibliographicCitation.issue4
local.contributor.affiliationSolt, Illes, Budapest University of Technology and Economics
local.contributor.affiliationTikk, Domonkos, Budapest University of Technology and Economics
local.contributor.affiliationGal, Viktor, College of Engineering and Computer Science, ANU
local.contributor.affiliationKardkovacs, Zsolt T., Budapest University of Technology and Economics
local.contributor.authoremailrepository.admin@anu.edu.au
local.contributor.authoruidGal, Viktor, u4603344
local.description.notesImported from ARIES
local.identifier.absfor110399 - Clinical Sciences not elsewhere classified
local.identifier.ariespublicationU4105084xPUB519
local.identifier.citationvolume16
local.identifier.doi10.1197/jamia.M3087
local.identifier.scopusID2-s2.0-67649359351
local.identifier.uidSubmittedByU4105084
local.type.statusPublished Version

Downloads