CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background

Joint Authors

Qiu, Lirong
Yu, Jia

Source

Complexity

Issue

Vol. 2018, Issue 2018 (31 Dec. 2018), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2018-05-16

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Philosophy

Abstract EN

In the present big data background, how to effectively excavate useful information is the problem that big data is facing now.

The purpose of this study is to construct a more effective method of mining interest preferences of users in a particular field in the context of today’s big data.

We mainly use a large number of user text data from microblog to study.

LDA is an effective method of text mining, but it will not play a very good role in applying LDA directly to a large number of short texts in microblog.

In today’s more effective topic modeling project, short texts need to be aggregated into long texts to avoid data sparsity.

However, aggregated short texts are mixed with a lot of noise, reducing the accuracy of mining the user’s interest preferences.

In this paper, we propose Combining Latent Dirichlet Allocation (CLDA), a new topic model that can learn the potential topics of microblog short texts and long texts simultaneously.

The data sparsity of short texts is avoided by aggregating long texts to assist in learning short texts.

Short text filtering long text is reused to improve mining accuracy, making long texts and short texts effectively combined.

Experimental results in a real microblog data set show that CLDA outperforms many advanced models in mining user interest, and we also confirm that CLDA also has good performance in recommending systems.

American Psychological Association (APA)

Qiu, Lirong& Yu, Jia. 2018. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity،Vol. 2018, no. 2018, pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

Modern Language Association (MLA)

Qiu, Lirong& Yu, Jia. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity No. 2018 (2018), pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

American Medical Association (AMA)

Qiu, Lirong& Yu, Jia. CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background. Complexity. 2018. Vol. 2018, no. 2018, pp.1-10.
https://search.emarefa.net/detail/BIM-1133267

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1133267