Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining

Joint Authors

Zhang, Zhifei
Chen, Shiqi
Gong, Lejun

Source

Journal of Healthcare Engineering

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-8, 8 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-11-24

Country of Publication

Egypt

No. of Pages

8

Main Subjects

Public Health
Medicine

Abstract EN

Background.

Clinical named entity recognition is the basic task of mining electronic medical records text, which are with some challenges containing the language features of Chinese electronic medical records text with many compound entities, serious missing sentence components, and unclear entity boundary.

Moreover, the corpus of Chinese electronic medical records is difficult to obtain.

Methods.

Aiming at these characteristics of Chinese electronic medical records, this study proposed a Chinese clinical entity recognition model based on deep learning pretraining.

The model used word embedding from domain corpus and fine-tuning of entity recognition model pretrained by relevant corpus.

Then BiLSTM and Transformer are, respectively, used as feature extractors to identify four types of clinical entities including diseases, symptoms, drugs, and operations from the text of Chinese electronic medical records.

Results.

75.06% Macro-P, 76.40% Macro-R, and 75.72% Macro-F1 aiming at test dataset could be achieved.

These experiments show that the Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition effect.

Conclusions.

These experiments show that the proposed Chinese clinical entity recognition model based on deep learning pretraining can effectively improve the recognition performance.

American Psychological Association (APA)

Gong, Lejun& Zhang, Zhifei& Chen, Shiqi. 2020. Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining. Journal of Healthcare Engineering،Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1186481

Modern Language Association (MLA)

Gong, Lejun…[et al.]. Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining. Journal of Healthcare Engineering No. 2020 (2020), pp.1-8.
https://search.emarefa.net/detail/BIM-1186481

American Medical Association (AMA)

Gong, Lejun& Zhang, Zhifei& Chen, Shiqi. Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining. Journal of Healthcare Engineering. 2020. Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1186481

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1186481