Lexical noise analysis and removal in intelligent search engines

Other Title(s)

تحليل الضوضاء النصية و إزالتها في محركات البحث الذكية

Author

Jabr, Tariq Jabr Ibrahim

Source

Journal of King Abdulaziz University : Computing and Information Technology Sciences

Issue

Vol. 1, Issue 2 (31 Dec. 2012), pp.69-103, 35 p.

Publisher

King Abdul Aziz University Faculty of Computing and Information Technology

Publication Date

2012-12-31

Country of Publication

Saudi Arabia

No. of Pages

35

Main Subjects

Electronic engineering

Abstract EN

In the field of intelligent information retrieval (IR), latent semantic indexing (LSI) is a popular technique used to retrieve information related more in meaning than in lexical matching.

This technique overcomes the problems associated with synonymy and polysemy (common causes of inaccuracy in matching algorithms).

A core component in the process is the use of the singular value decomposition (SVD) which acts as a mathematical model for the lexical noise in the term document matrix (TDM).

This paper investigates various aspects of LSI from the viewpoint of noise modeling and removal in image processing.

A discussion about and an investigation into, mathematical modeling for lexical noise in the TDM is presented.

The work addresses a definition for noise in text processing and seeks to determine the best structure of the TDM.

In other words, the structure of the TDM that would facilitate efficient searching within the LSI.

American Psychological Association (APA)

Jabr, Tariq Jabr Ibrahim. 2012. Lexical noise analysis and removal in intelligent search engines. Journal of King Abdulaziz University : Computing and Information Technology Sciences،Vol. 1, no. 2, pp.69-103.
https://search.emarefa.net/detail/BIM-700870

Modern Language Association (MLA)

Jabr, Tariq Jabr Ibrahim. Lexical noise analysis and removal in intelligent search engines. Journal of King Abdulaziz University : Computing and Information Technology Sciences Vol. 1, no. 2 (2012), pp.69-103.
https://search.emarefa.net/detail/BIM-700870

American Medical Association (AMA)

Jabr, Tariq Jabr Ibrahim. Lexical noise analysis and removal in intelligent search engines. Journal of King Abdulaziz University : Computing and Information Technology Sciences. 2012. Vol. 1, no. 2, pp.69-103.
https://search.emarefa.net/detail/BIM-700870

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 101-102

Record ID

BIM-700870