A schema-free instance matching algorithm based on virtual document similarity

Joint Authors

Mustafa, Siham
Amrush, Siham

Source

The International Arab Journal of Information Technology

Issue

Vol. 19, Issue 3A (s) (31 May. 2022), pp.432-441, 10 p.

Publisher

Zarqa University Deanship of Scientific Research

Publication Date

2022-05-31

Country of Publication

Jordan

No. of Pages

10

Main Subjects

Information Technology and Computer Science

Abstract EN

With the continuous development of semantic web, especially of the web of data, several knowledge bases expressed by ontologies are independently created and added to the Linked Open Data (LOD) cloud, on a daily basis.

A major challenge for the LOD paradigm is to discover resources that refer to the same real-world object, in order to interlink web resources and hold large scale data integration and sharing.

In this context, instance matching is a promising solution.

It aims to link co-referent instances belonging to heterogeneous knowledge bases with owl: same as links.

Several state-of-the-art existing approaches addressing this issue are based on the prior schema-level matching's, which does not avoid the limitation of heterogeneity at the property-level.

In this paper, we propose a schema-free, scalable and efficient instance matching approach that is independent from matching results at the schema-level.

We transform the instance matching problem to a document similarity problem and we solve it by a Clustering technique that uses an Ascendant Hierarchical Clustering algorithm to group similar instances in the same clusters.

Furthermore, we design multiple validating patterns that use some structural information to validate obtained mappings and eliminate wrong ones.

Experiments on instance matching track from Ontology Alignment Evaluation Initiative (OAEI) show that our approach gets prominent results compared to several participating systems in OAEI’2019, OAEI’2020 and OAEI’2021.

American Psychological Association (APA)

Amrush, Siham& Mustafa, Siham. 2022. A schema-free instance matching algorithm based on virtual document similarity. The International Arab Journal of Information Technology،Vol. 19, no. 3A (s), pp.432-441.
https://search.emarefa.net/detail/BIM-1437108

Modern Language Association (MLA)

Amrush, Siham& Mustafa, Siham. A schema-free instance matching algorithm based on virtual document similarity. The International Arab Journal of Information Technology Vol. 19, no. 3A (Special issue) (2022), pp.432-441.
https://search.emarefa.net/detail/BIM-1437108

American Medical Association (AMA)

Amrush, Siham& Mustafa, Siham. A schema-free instance matching algorithm based on virtual document similarity. The International Arab Journal of Information Technology. 2022. Vol. 19, no. 3A (s), pp.432-441.
https://search.emarefa.net/detail/BIM-1437108

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 439-440

Record ID

BIM-1437108