An efficient algorithm for extracting infrequent itemsets from weblog

Joint Authors

Bakariya, Brijesh
Thakur, Ghanshyam
Chaturvedi, Kapil

Source

The International Arab Journal of Information Technology

Issue

Vol. 16, Issue 2 (31 Mar. 2019)6 p.

Publisher

Zarqa University

Publication Date

2019-03-31

Country of Publication

Jordan

No. of Pages

6

Main Subjects

Information Technology and Computer Science

Abstract EN

Weblog data contains unstructured information.

Due to this, extracting frequent pattern from weblog databases is a very challenging task.

A power set lattice strategy is adopted for handling that kind of problem.

In this lattice, the top label contains full set and at the bottom label contains empty set.

Most number of algorithms follows bottom-up strategy, i.e.

combining smaller to larger sets.

Efficient lattice traversal techniques are presented which quickly identify all the long frequent itemsets and their subsets if required.

This strategy is suitable for discovering frequent itemsets but it might not be worth being used for infrequent itemsets.

In this paper, we propose Infrequent Itemset Mining for Weblog (IIMW) algorithm; it is a top-down breadth-first level-wise algorithm for discovering infrequent itemsets.

We have compared our algorithm IIMW to Apriori-Rare, Apriori-Inverse and generated result in with different parameters such as candidate itemset, frequent itemset, time, transaction database and support threshold.

American Psychological Association (APA)

Bakariya, Brijesh& Thakur, Ghanshyam& Chaturvedi, Kapil. 2019. An efficient algorithm for extracting infrequent itemsets from weblog. The International Arab Journal of Information Technology،Vol. 16, no. 2.
https://search.emarefa.net/detail/BIM-894947

Modern Language Association (MLA)

Bakariya, Brijesh…[et al.]. An efficient algorithm for extracting infrequent itemsets from weblog. The International Arab Journal of Information Technology Vol. 16, no. 2 (Mar. 2019).
https://search.emarefa.net/detail/BIM-894947

American Medical Association (AMA)

Bakariya, Brijesh& Thakur, Ghanshyam& Chaturvedi, Kapil. An efficient algorithm for extracting infrequent itemsets from weblog. The International Arab Journal of Information Technology. 2019. Vol. 16, no. 2.
https://search.emarefa.net/detail/BIM-894947

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-894947