Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data

Joint Authors

Song, Jie
Liu, Hui
Jiang, Jingqing
Hou, Yaowei

Source

Complexity

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-14, 14 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-10-15

Country of Publication

Egypt

No. of Pages

14

Main Subjects

Philosophy

Abstract EN

Cities in the big data era hold the massive urban data to create valuable information and digitally enhanced services.

Sources of urban data are generally categorized as one of the three types: official, social, and sensorial, which are from the government and enterprises, social networks of citizens, and the sensor network.

These types typically differ significantly from each other but are consolidated together for the smart urban services.

Based on the sophisticated consolidation approaches, we argue that a new challenge, fragment complexity that represents a well-integrated data has appropriate but fragmentary schema and difficult to be queried, is ignored in the state-of-art urban data management.

Comparing with predefined and rigid schema, fragmentary schema means a dataset contains millions of attributes but nonorthogonally distributed among tables, and of course, values of these attributes are even massive.

As far as a query is concerned, locating where these attributes are being stored is the first encountered problem, while traditional value-based query optimization has no contributions.

To address this problem, we propose an index on massive attributes as an attributes-oriented optimization, namely, attribute index.

Attribute index is a secondary index for locating files in which the target attributes are stored.

It contains three parts: ATree for searching keys, DTree for locating keys among files, and ADLinks as a mapping table between ATree and DTree.

In this paper, the index architecture, logical structure and algorithms, the implementation details, the creation process, the integration to the existing key-value store, and the urban application scenario are described.

Experiments show that, in comparison with B + -Tree, LSM-Tree, and AVL-Tree, the query time of ATree is 1.1x, 1.5x, and 1.2x faster, respectively.

Finally, we integrate our proposition with HBase, namely, UrbanBase, whose query performance is 1.3x faster than the original HBase.

American Psychological Association (APA)

Liu, Hui& Jiang, Jingqing& Hou, Yaowei& Song, Jie. 2020. Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data. Complexity،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1145252

Modern Language Association (MLA)

Liu, Hui…[et al.]. Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data. Complexity No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1145252

American Medical Association (AMA)

Liu, Hui& Jiang, Jingqing& Hou, Yaowei& Song, Jie. Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data. Complexity. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1145252

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1145252