Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures

Joint Authors

Guo, Lin
Zuo, Wanli
Peng, Tao
Yue, Lin

Source

Mathematical Problems in Engineering

Issue

Vol. 2015, Issue 2015 (31 Dec. 2015), pp.1-9, 9 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2015-09-28

Country of Publication

Egypt

No. of Pages

9

Main Subjects

Civil Engineering

Abstract EN

The diversities of large-scale semistructured data make the extraction of implicit semantic information have enormous difficulties.

This paper proposes an automatic and unsupervised method of text categorization, in which tree-shape structures are used to represent semantic knowledge and to explore implicit information by mining hidden structures without cumbersome lexical analysis.

Mining implicit frequent structures in trees can discover both direct and indirect semantic relations, which largely enhances the accuracy of matching and classifying texts.

The experimental results show that the proposed algorithm remarkably reduces the time and effort spent in training and classifying, which outperforms established competitors in correctness and effectiveness.

American Psychological Association (APA)

Guo, Lin& Zuo, Wanli& Peng, Tao& Yue, Lin. 2015. Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures. Mathematical Problems in Engineering،Vol. 2015, no. 2015, pp.1-9.
https://search.emarefa.net/detail/BIM-1074581

Modern Language Association (MLA)

Guo, Lin…[et al.]. Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures. Mathematical Problems in Engineering No. 2015 (2015), pp.1-9.
https://search.emarefa.net/detail/BIM-1074581

American Medical Association (AMA)

Guo, Lin& Zuo, Wanli& Peng, Tao& Yue, Lin. Text Matching and Categorization: Mining Implicit Semantic Knowledge from Tree-Shape Structures. Mathematical Problems in Engineering. 2015. Vol. 2015, no. 2015, pp.1-9.
https://search.emarefa.net/detail/BIM-1074581

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1074581