Validation of Text Data Preprocessing Using a Neural Network Model

Joint Authors

Lee, WonGyu
Woo, HoSung
Kim, JaMee

Source

Mathematical Problems in Engineering

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-9, 9 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-05-14

Country of Publication

Egypt

No. of Pages

9

Main Subjects

Civil Engineering

Abstract EN

Many artificial intelligence studies focus on designing new neural network models or optimizing hyperparameters to improve model accuracy.

To develop a reliable model, appropriate data are required, and data preprocessing is an essential part of acquiring the data.

Although various studies regard data preprocessing as part of the data exploration process, those studies lack awareness about the need for separate technologies and solutions for preprocessing.

Therefore, this study evaluated combinations of preprocessing types in a text-processing neural network model.

Better performance was observed when two preprocessing types were used than when three or more preprocessing types were used for data purification.

More specifically, using lemmatization and punctuation splitting together, lemmatization and lowering together, and lowering and punctuation splitting together showed positive effects on accuracy.

This study is significant because the results allow better decisions to be made about the selection of the preprocessing types in various research fields, including neural network research.

American Psychological Association (APA)

Woo, HoSung& Kim, JaMee& Lee, WonGyu. 2020. Validation of Text Data Preprocessing Using a Neural Network Model. Mathematical Problems in Engineering،Vol. 2020, no. 2020, pp.1-9.
https://search.emarefa.net/detail/BIM-1193630

Modern Language Association (MLA)

Woo, HoSung…[et al.]. Validation of Text Data Preprocessing Using a Neural Network Model. Mathematical Problems in Engineering No. 2020 (2020), pp.1-9.
https://search.emarefa.net/detail/BIM-1193630

American Medical Association (AMA)

Woo, HoSung& Kim, JaMee& Lee, WonGyu. Validation of Text Data Preprocessing Using a Neural Network Model. Mathematical Problems in Engineering. 2020. Vol. 2020, no. 2020, pp.1-9.
https://search.emarefa.net/detail/BIM-1193630

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1193630