On Detecting and Removing Superficial Redundancy in Vector Databases

Joint Authors

DeCastro-García, Noemí
Muñoz Castañeda, Ángel Luis
Carriegos, Miguel V.
Fernández Rodríguez, Mario

Source

Mathematical Problems in Engineering

Issue

Vol. 2018, Issue 2018 (31 Dec. 2018), pp.1-14, 14 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2018-05-24

Country of Publication

Egypt

No. of Pages

14

Main Subjects

Civil Engineering

Abstract EN

A mathematical model is proposed in order to obtain an automatized tool to remove any unnecessary data, to compute the level of the redundancy, and to recover the original and filtered database, at any time of the process, in a vector database.

This type of database can be modeled as an oriented directed graph.

Thus, the database is characterized by an adjacency matrix.

Therefore, a record is no longer a row but a matrix.

Then, the problem of cleaning redundancies is addressed from a theoretical point of view.

Superficial redundancy is measured and filtered by using the 1-norm of a matrix.

Algorithms are presented by Python and MapReduce, and a case study of a real cybersecurity database is performed.

American Psychological Association (APA)

DeCastro-García, Noemí& Muñoz Castañeda, Ángel Luis& Fernández Rodríguez, Mario& Carriegos, Miguel V.. 2018. On Detecting and Removing Superficial Redundancy in Vector Databases. Mathematical Problems in Engineering،Vol. 2018, no. 2018, pp.1-14.
https://search.emarefa.net/detail/BIM-1207019

Modern Language Association (MLA)

DeCastro-García, Noemí…[et al.]. On Detecting and Removing Superficial Redundancy in Vector Databases. Mathematical Problems in Engineering No. 2018 (2018), pp.1-14.
https://search.emarefa.net/detail/BIM-1207019

American Medical Association (AMA)

DeCastro-García, Noemí& Muñoz Castañeda, Ángel Luis& Fernández Rodríguez, Mario& Carriegos, Miguel V.. On Detecting and Removing Superficial Redundancy in Vector Databases. Mathematical Problems in Engineering. 2018. Vol. 2018, no. 2018, pp.1-14.
https://search.emarefa.net/detail/BIM-1207019

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1207019