Classification of Textual E-Mail Spam Using Data Mining Techniques

المؤلفون المشاركون

Nazirova, Saadat A.
Aliguliyev, Ramiz M.
Alguliev, Rasim M.

المصدر

Applied Computational Intelligence and Soft Computing

العدد

المجلد 2011، العدد 2011 (31 ديسمبر/كانون الأول 2011)، ص ص. 1-8، 8ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2011-11-10

دولة النشر

مصر

عدد الصفحات

8

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

A new method for clustering of spam messages collected in bases of antispam system is offered.

The genetic algorithm is developed for solving clustering problems.

The objective function is a maximization of similarity between messages in clusters, which is defined by k-nearest neighbor algorithm.

Application of genetic algorithm for solving constrained problems faces the problem of constant support of chromosomes which reduces convergence process.

Therefore, for acceleration of convergence of genetic algorithm, a penalty function that prevents occurrence of infeasible chromosomes at ranging of values of function of fitness is used.

After classification, knowledge extraction is applied in order to get information about classes.

Multidocument summarization method is used to get the information portrait of each cluster of spam messages.

Classifying and parametrizing spam templates, it will be also possible to define the thematic dependence from geographical dependence (e.g., what subjects prevail in spam messages sent from certain countries).

Thus, the offered system will be capable to reveal purposeful information attacks if those occur.

Analyzing origins of the spam messages from collection, it is possible to define and solve the organized social networks of spammers.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Alguliev, Rasim M.& Aliguliyev, Ramiz M.& Nazirova, Saadat A.. 2011. Classification of Textual E-Mail Spam Using Data Mining Techniques. Applied Computational Intelligence and Soft Computing،Vol. 2011, no. 2011, pp.1-8.
https://search.emarefa.net/detail/BIM-470437

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Alguliev, Rasim M.…[et al.]. Classification of Textual E-Mail Spam Using Data Mining Techniques. Applied Computational Intelligence and Soft Computing No. 2011 (2011), pp.1-8.
https://search.emarefa.net/detail/BIM-470437

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Alguliev, Rasim M.& Aliguliyev, Ramiz M.& Nazirova, Saadat A.. Classification of Textual E-Mail Spam Using Data Mining Techniques. Applied Computational Intelligence and Soft Computing. 2011. Vol. 2011, no. 2011, pp.1-8.
https://search.emarefa.net/detail/BIM-470437

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-470437