Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection

Joint Authors

Han, Bing Qing
Chen, Yifei
Sun, Yuxing

Source

BioMed Research International

Issue

Vol. 2015, Issue 2015 (31 Dec. 2015), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2015-08-03

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Medicine

Abstract EN

Protein interaction article classification is a text classification task in the biological domain to determine which articles describe protein-protein interactions.

Since the feature space in text classification is high-dimensional, feature selection is widely used for reducing the dimensionality of features to speed up computation without sacrificing classification performance.

Many existing feature selection methods are based on the statistical measure of document frequency and term frequency.

One potential drawback of these methods is that they treat features separately.

Hence, first we design a similarity measure between the context information to take word cooccurrences and phrase chunks around the features into account.

Then we introduce the similarity of context information to the importance measure of the features to substitute the document and term frequency.

Hence we propose new context similarity-based feature selection methods.

Their performance is evaluated on two protein interaction article collections and compared against the frequency-based methods.

The experimental results reveal that the context similarity-based methods perform better in terms of the F1 measure and the dimension reduction rate.

Benefiting from the context information surrounding the features, the proposed methods can select distinctive features effectively for protein interaction article classification.

American Psychological Association (APA)

Chen, Yifei& Sun, Yuxing& Han, Bing Qing. 2015. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection. BioMed Research International،Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1056623

Modern Language Association (MLA)

Chen, Yifei…[et al.]. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection. BioMed Research International No. 2015 (2015), pp.1-10.
https://search.emarefa.net/detail/BIM-1056623

American Medical Association (AMA)

Chen, Yifei& Sun, Yuxing& Han, Bing Qing. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection. BioMed Research International. 2015. Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1056623

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1056623