A mapreduce-based quick search approach on large files

Joint Authors

Li, Ye feng
Le, Jia jin
Wang, Mei

Source

The International Arab Journal of Information Technology

Issue

Vol. 16, Issue 5 (30 Sep. 2019), pp.791-797, 7 p.

Publisher

Zarqa University

Publication Date

2019-09-30

Country of Publication

Jordan

No. of Pages

7

Main Subjects

Information Technology and Computer Science

Abstract EN

String search is an important branch of pattern matching for information retrieval in various fields.

In the past four decades, the research importance has been attached on skipping more unnecessary characters to improve the search performance, and never taken into consideration on large scale of data.

In this paper, two major achievements are contributed.

At first, we propose a Quick Search algorithm for data Stream (QSS) on a single machine to support string search in a large text file, as opposed to previous researches that limits to a bound memory.

For the next, we implement the search algorithm on MapReduce framework to improve the velocity of retrieving the search results.

The experiments demonstrate that our approach is fast and effective for large files.

American Psychological Association (APA)

Li, Ye feng& Le, Jia jin& Wang, Mei. 2019. A mapreduce-based quick search approach on large files. The International Arab Journal of Information Technology،Vol. 16, no. 5, pp.791-797.
https://search.emarefa.net/detail/BIM-895062

Modern Language Association (MLA)

Li, Ye feng…[et al.]. A mapreduce-based quick search approach on large files. The International Arab Journal of Information Technology Vol. 16, no. 5 (Sep. 2019), pp.791-797.
https://search.emarefa.net/detail/BIM-895062

American Medical Association (AMA)

Li, Ye feng& Le, Jia jin& Wang, Mei. A mapreduce-based quick search approach on large files. The International Arab Journal of Information Technology. 2019. Vol. 16, no. 5, pp.791-797.
https://search.emarefa.net/detail/BIM-895062

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 796-797

Record ID

BIM-895062