A Two-Step Resume Information Extraction Algorithm

المؤلفون المشاركون

Niu, Zhendong
Chen, Jie
Zhang, Chunxia

المصدر

Mathematical Problems in Engineering

العدد

المجلد 2018، العدد 2018 (31 ديسمبر/كانون الأول 2018)، ص ص. 1-8، 8ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2018-05-08

دولة النشر

مصر

عدد الصفحات

8

التخصصات الرئيسية

هندسة مدنية

الملخص EN

With the rapid growth of Internet-based recruiting, there are a great number of personal resumes among recruiting systems.

To gain more attention from the recruiters, most resumes are written in diverse formats, including varying font size, font colour, and table cells.

However, the diversity of format is harmful to data mining, such as resume information extraction, automatic job matching, and candidates ranking.

Supervised methods and rule-based methods have been proposed to extract facts from resumes, but they strongly rely on hierarchical structure information and large amounts of labelled data, which are hard to collect in reality.

In this paper, we propose a two-step resume information extraction approach.

In the first step, raw text of resume is identified as different resume blocks.

To achieve the goal, we design a novel feature, Writing Style, to model sentence syntax information.

Besides word index and punctuation index, word lexical attribute and prediction results of classifiers are included in Writing Style.

In the second step, multiple classifiers are employed to identify different attributes of fact information in resumes.

Experimental results on a real-world dataset show that the algorithm is feasible and effective.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Chen, Jie& Zhang, Chunxia& Niu, Zhendong. 2018. A Two-Step Resume Information Extraction Algorithm. Mathematical Problems in Engineering،Vol. 2018, no. 2018, pp.1-8.
https://search.emarefa.net/detail/BIM-1208069

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Chen, Jie…[et al.]. A Two-Step Resume Information Extraction Algorithm. Mathematical Problems in Engineering No. 2018 (2018), pp.1-8.
https://search.emarefa.net/detail/BIM-1208069

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Chen, Jie& Zhang, Chunxia& Niu, Zhendong. A Two-Step Resume Information Extraction Algorithm. Mathematical Problems in Engineering. 2018. Vol. 2018, no. 2018, pp.1-8.
https://search.emarefa.net/detail/BIM-1208069

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1208069