CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background

المؤلفون المشاركون

Qiu, Lirong
Yu, Jia

المصدر

Complexity

العدد

المجلد 2018، العدد 2018 (31 ديسمبر/كانون الأول 2018)، ص ص. 1-10، 10ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2018-05-16

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

الفلسفة

الملخص EN

In the present big data background, how to effectively excavate useful information is the problem that big data is facing now.

The purpose of this study is to construct a more effective method of mining interest preferences of users in a particular field in the context of today’s big data.

We mainly use a large number of user text data from microblog to study.

LDA is an effective method of text mining, but it will not play a very good role in applying LDA directly to a large number of short texts in microblog.

In today’s more effective topic modeling project, short texts need to be aggregated into long texts to avoid data sparsity.

However, aggregated short texts are mixed with a lot of noise, reducing the accuracy of mining the user’s interest preferences.

In this paper, we propose Combining Latent Dirichlet Allocation (CLDA), a new topic model that can learn the potential topics of microblog short texts and long texts simultaneously.

The data sparsity of short texts is avoided by aggregating long texts to assist in learning short texts.

Short text filtering long text is reused to improve mining accuracy, making long texts and short texts effectively combined.

Experimental results in a real microblog data set show that CLDA outperforms many advanced models in mining user interest, and we also confirm that CLDA also has good performance in recommending systems.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Qiu, Lirong& Yu, Jia. 2018. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity،Vol. 2018, no. 2018, pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Qiu, Lirong& Yu, Jia. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity No. 2018 (2018), pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Qiu, Lirong& Yu, Jia. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity. 2018. Vol. 2018, no. 2018, pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1133267