Exploring cancer risk factors using data mining techniques : a case study from Yemen

Other Title(s)

استكشاف عوامل خطر السرطان باستخدام تقنيات التنقيب في البيانات : دراسة حالة من اليمن

Joint Authors

al-Hashidi, Abd Allah
Muhsin, Abd al-Qadir Muhammad
Sallam, Abd Allah A.

Source

Journal of Science and Technology

Issue

Vol. 23, Issue 2 (31 Dec. 2018), pp.1-30, 30 p.

Publisher

University of Science and Technology Faculty of Computing and Engineering

Publication Date

2018-12-31

Country of Publication

Yemen

No. of Pages

30

Main Subjects

Engineering & Technology Sciences (Multidisciplinary)

Abstract EN

Data Mining is a hot scientific research topic that has many applications in various life aspects.

Healthcare and medicine are among those aspects that attracted data mining researchers who sought to solve decision-making problems.

Cancer diagnosis, treatment, and prediction are procedures that have been using data mining for decades.

In Yemen, some cancer risk factors seem to be different from those in other parts of the world.

By mining the data available at National Cancer Control Foundation (NCCF), useful knowledge have been extracted.

In this paper, decision tree classification was selected for building a model to predict cancer risk factors.

As the NCCF database contained data describe some social life aspects, environmental circumstances, lifestyle, etc., mining those data can contribute in the endeavors of clearing ambiguity about cancer risk factors in Yemen.

The informative attributes that were selected for model building included gender, marital status, number of family members, province, chewing Qat, chewing tobacco (Shamaa), smoking, age, relatives with cancer, and cancer class.

These data was prepared for Knowledge Data Discovery process.

Then, it was prepared for feeding into C4.5 learning algorithms.

The results shown that smoking, chewing tobacco (Shamaa), province of residence, marital status, and age are the most important cancer risk factors.

The model produced found of high performance.

In addition, the rules extracted from the model tree can also be of high value for both people and healthcare sector.

American Psychological Association (APA)

al-Hashidi, Abd Allah& Sallam, Abd Allah A.& Muhsin, Abd al-Qadir Muhammad. 2018. Exploring cancer risk factors using data mining techniques : a case study from Yemen. Journal of Science and Technology،Vol. 23, no. 2, pp.1-30.
https://search.emarefa.net/detail/BIM-1314535

Modern Language Association (MLA)

al-Hashidi, Abd Allah…[et al.]. Exploring cancer risk factors using data mining techniques : a case study from Yemen. Journal of Science and Technology Vol. 23, no. 2 (2018), pp.1-30.
https://search.emarefa.net/detail/BIM-1314535

American Medical Association (AMA)

al-Hashidi, Abd Allah& Sallam, Abd Allah A.& Muhsin, Abd al-Qadir Muhammad. Exploring cancer risk factors using data mining techniques : a case study from Yemen. Journal of Science and Technology. 2018. Vol. 23, no. 2, pp.1-30.
https://search.emarefa.net/detail/BIM-1314535

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 28-30

Record ID

BIM-1314535