Information-Balance-Aware Approximated Summarization of Data Provenance

المؤلفون المشاركون

Pei, Jisheng
Ye, Xiaojun

المصدر

Scientific Programming

العدد

المجلد 2017، العدد 2017 (31 ديسمبر/كانون الأول 2017)، ص ص. 1-11، 11ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2017-09-12

دولة النشر

مصر

عدد الصفحات

11

التخصصات الرئيسية

الرياضيات

الملخص EN

Extracting useful knowledge from data provenance information has been challenging because provenance information is often overwhelmingly enormous for users to understand.

Recently, it has been proposed that we may summarize data provenance items by grouping semantically related provenance annotations so as to achieve concise provenance representation.

Users may provide their intended use of the provenance data in terms of provisioning, and the quality of provenance summarization could be optimized for smaller size and closer distance between the provisioning results derived from the summarization and those from the original provenance.

However, apart from the intended provisioning use, we notice that more dedicated and diverse user requirements can be expressed and considered in the summarization process by assigning importance weights to provenance elements.

Moreover, we introduce information balance index (IBI), an entropy based measurement, to dynamically evaluate the amount of information retained by the summary to check how it suits user requirements.

An alternative provenance summarization algorithm that supports manipulation of information balance is presented.

Case studies and experiments show that, in summarization process, information balance can be effectively steered towards user-defined goals and requirement-driven variants of the provenance summarizations can be achieved to support a series of interesting scenarios.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Pei, Jisheng& Ye, Xiaojun. 2017. Information-Balance-Aware Approximated Summarization of Data Provenance. Scientific Programming،Vol. 2017, no. 2017, pp.1-11.
https://search.emarefa.net/detail/BIM-1203411

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Pei, Jisheng& Ye, Xiaojun. Information-Balance-Aware Approximated Summarization of Data Provenance. Scientific Programming No. 2017 (2017), pp.1-11.
https://search.emarefa.net/detail/BIM-1203411

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Pei, Jisheng& Ye, Xiaojun. Information-Balance-Aware Approximated Summarization of Data Provenance. Scientific Programming. 2017. Vol. 2017, no. 2017, pp.1-11.
https://search.emarefa.net/detail/BIM-1203411

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1203411