Feature Engineering for Drug Name Recognition in Biomedical Texts: Feature Conjunction and Feature Selection

المؤلفون المشاركون

Wang, Xiaolong
Liu, Shengyu
Chen, Qingcai
Fan, Xiaoming
Tang, Buzhou

المصدر

Computational and Mathematical Methods in Medicine

العدد

المجلد 2015، العدد 2015 (31 ديسمبر/كانون الأول 2015)، ص ص. 1-9، 9ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2015-03-12

دولة النشر

مصر

عدد الصفحات

9

التخصصات الرئيسية

الطب البشري

الملخص EN

Drug name recognition (DNR) is a critical step for drug information extraction.

Machine learning-based methods have been widely used for DNR with various types of features such as part-of-speech, word shape, and dictionary feature.

Features used in current machine learning-based methods are usually singleton features which may be due to explosive features and a large number of noisy features when singleton features are combined into conjunction features.

However, singleton features that can only capture one linguistic characteristic of a word are not sufficient to describe the information for DNR when multiple characteristics should be considered.

In this study, we explore feature conjunction and feature selection for DNR, which have never been reported.

We intuitively select 8 types of singleton features and combine them into conjunction features in two ways.

Then, Chi-square, mutual information, and information gain are used to mine effective features.

Experimental results show that feature conjunction and feature selection can improve the performance of the DNR system with a moderate number of features and our DNR system significantly outperforms the best system in the DDIExtraction 2013 challenge.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Liu, Shengyu& Tang, Buzhou& Chen, Qingcai& Wang, Xiaolong& Fan, Xiaoming. 2015. Feature Engineering for Drug Name Recognition in Biomedical Texts: Feature Conjunction and Feature Selection. Computational and Mathematical Methods in Medicine،Vol. 2015, no. 2015, pp.1-9.
https://search.emarefa.net/detail/BIM-1058036

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Liu, Shengyu…[et al.]. Feature Engineering for Drug Name Recognition in Biomedical Texts: Feature Conjunction and Feature Selection. Computational and Mathematical Methods in Medicine No. 2015 (2015), pp.1-9.
https://search.emarefa.net/detail/BIM-1058036

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Liu, Shengyu& Tang, Buzhou& Chen, Qingcai& Wang, Xiaolong& Fan, Xiaoming. Feature Engineering for Drug Name Recognition in Biomedical Texts: Feature Conjunction and Feature Selection. Computational and Mathematical Methods in Medicine. 2015. Vol. 2015, no. 2015, pp.1-9.
https://search.emarefa.net/detail/BIM-1058036

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1058036