Adapting state-of-the-art deep language models to clinical information extraction systems: Potentials, challenges, and solutions

Zhou, Liyuan; Suominen, Hanna; Gedeon, Tom

Adapting state-of-the-art deep language models to clinical information extraction systems: Potentials, challenges, and solutions

dc.contributor.author	Zhou, Liyuan
dc.contributor.author	Suominen, Hanna
dc.contributor.author	Gedeon, Tom
dc.date.accessioned	2023-12-11T00:09:21Z
dc.date.available	2023-12-11T00:09:21Z
dc.date.issued	2019
dc.date.updated	2022-09-04T08:17:16Z
dc.description.abstract	Background: Deep learning (DL) has been widely used to solve problems with success in speech recognition, visual object recognition, and object detection for drug discovery and genomics. Natural language processing has achieved noticeable progress in artificial intelligence. This gives an opportunity to improve on the accuracy and human-computer interaction of clinical informatics. However, due to difference of vocabularies and context between a clinical environment and generic English, transplanting language models directly from up-to-date methods to real-world health care settings is not always satisfactory. Moreover, the legal restriction on using privacy-sensitive patient records hinders the progress in applying machine learning (ML) to clinical language processing. Objective: The aim of this study was to investigate 2 ways to adapt state-of-the-art language models to extracting patient information from free-form clinical narratives to populate a handover form at a nursing shift change automatically for proofing and revising by hand: first, by using domain-specific word representations and second, by using transfer learning models to adapt knowledge from general to clinical English. We have described the practical problem, composed it as an ML task known as information extraction, proposed methods for solving the task, and evaluated their performance. Methods: First, word representations trained from different domains served as the input of a DL system for information extraction. Second, the transfer learning model was applied as a way to adapt the knowledge learned from general text sources to the task domain. The goal was to gain improvements in the extraction performance, especially for the classes that were topically related but did not have a sufficient amount of model solutions available for ML directly from the target domain. A total of 3 independent datasets were generated for this task, and they were used as the training (101 patient reports), validation (100 patient reports), and test (100 patient reports) sets in our experiments. Results: Our system is now the state-of-the-art in this task. Domain-specific word representations improved the macroaveraged F1 by 3.4%. Transferring the knowledge from general English corpora to the task-specific domain contributed a further 7.1% improvement. The best performance in populating the handover form with 37 headings was the macroaveraged F1 of 41.6% and F1 of 81.1% for filtering out irrelevant information. Performance differences between this system and its baseline were statistically significant (P<.001; Wilcoxon test). Conclusions: To our knowledge, our study is the first attempt to transfer models from general deep models to specific tasks in health care and gain a significant improvement. As transfer learning shows its advantage over other methods, especially on classes with a limited amount of training data, less experts’ time is needed to annotate data for ML, which may enable good results even in resource-poor domains.	en_AU
dc.description.sponsorship	This work was supported by the Commonwealth Department of Education and Training (The Australian National University Australian Postgraduate Award).	en_AU
dc.format.mimetype	application/pdf	en_AU
dc.identifier.issn	2291-9694	en_AU
dc.identifier.uri	http://hdl.handle.net/1885/309733
dc.language.iso	en_AU	en_AU
dc.provenance	This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.	en_AU
dc.publisher	JMIR Publications Inc	en_AU
dc.rights	©Liyuan Zhou, Hanna Suominen, Tom Gedeon. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 25.04.2019.	en_AU
dc.rights.license	Creative Commons Attribution 4.0 International License	en_AU
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_AU
dc.source	JMIR Medical Informatics	en_AU
dc.subject	computer systems	en_AU
dc.subject	artificial intelligence	en_AU
dc.subject	deep learning	en_AU
dc.subject	information storage and retrieval	en_AU
dc.subject	medical informatics	en_AU
dc.subject	nursing records	en_AU
dc.subject	patient handoff	en_AU
dc.title	Adapting state-of-the-art deep language models to clinical information extraction systems: Potentials, challenges, and solutions	en_AU
dc.type	Journal article	en_AU
dcterms.accessRights	Open Access	en_AU
local.bibliographicCitation.issue	2	en_AU
local.bibliographicCitation.lastpage	15	en_AU
local.bibliographicCitation.startpage	1	en_AU
local.contributor.affiliation	Zhou, Liyuan, College of Engineering and Computer Science, ANU	en_AU
local.contributor.affiliation	Suominen, Hanna, College of Engineering and Computer Science, ANU	en_AU
local.contributor.affiliation	Gedeon, Tom, College of Engineering and Computer Science, ANU	en_AU
local.contributor.authoruid	Zhou, Liyuan, u4978108	en_AU
local.contributor.authoruid	Suominen, Hanna, u4872279	en_AU
local.contributor.authoruid	Gedeon, Tom, u4088783	en_AU
local.description.notes	Imported from ARIES	en_AU
local.identifier.absfor	460208 - Natural language processing	en_AU
local.identifier.absfor	460102 - Applications in health	en_AU
local.identifier.absfor	461103 - Deep learning	en_AU
local.identifier.ariespublication	u3102795xPUB3513	en_AU
local.identifier.citationvolume	7	en_AU
local.identifier.doi	10.2196/11499	en_AU
local.identifier.scopusID	2-s2.0-85067314500
local.identifier.thomsonID	WOS:000473777800006
local.publisher.url	https://medinform.jmir.org/	en_AU
local.type.status	Published Version	en_AU

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: Adapting State-of-the-Art Deep Language Models.pdf
Size:: 635.37 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

ANU Research Publications