An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division

Joint Authors

Xia, Dawen
Wang, Binfeng
Li, Yantao
Rong, Zhuobo
Zhang, Zili

Source

Discrete Dynamics in Nature and Society

Issue

Vol. 2015, Issue 2015 (31 Dec. 2015), pp.1-18, 18 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2015-09-15

Country of Publication

Egypt

No. of Pages

18

Main Subjects

Mathematics

Abstract EN

Traffic subarea division is vital for traffic system management and traffic network analysis in intelligent transportation systems (ITSs).

Since existing methods may not be suitable for big traffic data processing, this paper presents a MapReduce-based Parallel Three-Phase K -Means (Par3PKM) algorithm for solving traffic subarea division problem on a widely adopted Hadoop distributed computing platform.

Specifically, we first modify the distance metric and initialization strategy of K -Means and then employ a MapReduce paradigm to redesign the optimized K -Means algorithm for parallel clustering of large-scale taxi trajectories.

Moreover, we propose a boundary identifying method to connect the borders of clustering results for each cluster.

Finally, we divide traffic subarea of Beijing based on real-world trajectory data sets generated by 12,000 taxis in a period of one month using the proposed approach.

Experimental evaluation results indicate that when compared with K -Means, Par2PK-Means, and ParCLARA, Par3PKM achieves higher efficiency, more accuracy, and better scalability and can effectively divide traffic subarea with big taxi trajectory data.

American Psychological Association (APA)

Xia, Dawen& Wang, Binfeng& Li, Yantao& Rong, Zhuobo& Zhang, Zili. 2015. An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division. Discrete Dynamics in Nature and Society،Vol. 2015, no. 2015, pp.1-18.
https://search.emarefa.net/detail/BIM-1060769

Modern Language Association (MLA)

Xia, Dawen…[et al.]. An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division. Discrete Dynamics in Nature and Society No. 2015 (2015), pp.1-18.
https://search.emarefa.net/detail/BIM-1060769

American Medical Association (AMA)

Xia, Dawen& Wang, Binfeng& Li, Yantao& Rong, Zhuobo& Zhang, Zili. An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division. Discrete Dynamics in Nature and Society. 2015. Vol. 2015, no. 2015, pp.1-18.
https://search.emarefa.net/detail/BIM-1060769

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1060769