A framework of summarizing XML documents with schemas

Joint Authors

Lv, Teng
Yan, Ping

Source

The International Arab Journal of Information Technology

Issue

Vol. 10, Issue 1 (31 Jan. 2013)9 p.

Publisher

Zarqa University

Publication Date

2013-01-31

Country of Publication

Jordan

No. of Pages

9

Main Subjects

Media and Communication

Topics

Abstract EN

XML has become one of the de facto standards of data exchange and representation in many applications.

An XML document is usually too complex and large to understand and use for a human being.

A summarized XML document of the original document is useful in such cases.

Three standards are given to evaluate the final summarized XML document : document size, information content, and information importance.

A framework of summarizing an XML document based both on the document itself and the schema is given, which applies schema to summarize XML documents because there are many important semantic and structural information implied by the schema.

In our framework, redundant data are first removed by abnormal functional dependencies and schema structure.

Then tags and values of the XML document are summarized based on the document itself and schema.

Our framework is a semi-automatic approach which can help users to summarize an XML document in the sense that some parameters must be specified by the users.

Experiments show that the framework can make the summarized XML document has a good balance of document size, information content, and information importance comparing with the original one.

American Psychological Association (APA)

Lv, Teng& Yan, Ping. 2013. A framework of summarizing XML documents with schemas. The International Arab Journal of Information Technology،Vol. 10, no. 1.
https://search.emarefa.net/detail/BIM-311971

Modern Language Association (MLA)

Lv, Teng& Yan, Ping. A framework of summarizing XML documents with schemas. The International Arab Journal of Information Technology Vol. 10, no. 1 (Jan. 2013).
https://search.emarefa.net/detail/BIM-311971

American Medical Association (AMA)

Lv, Teng& Yan, Ping. A framework of summarizing XML documents with schemas. The International Arab Journal of Information Technology. 2013. Vol. 10, no. 1.
https://search.emarefa.net/detail/BIM-311971

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references.

Record ID

BIM-311971