Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services

Joint Authors

Chrimes, Dillon
Zamani, Hamid

Source

Computational and Mathematical Methods in Medicine

Issue

Vol. 2017, Issue 2017 (31 Dec. 2017), pp.1-16, 16 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2017-12-11

Country of Publication

Egypt

No. of Pages

16

Main Subjects

Medicine

Abstract EN

Big data analytics (BDA) is important to reduce healthcare costs.

However, there are many challenges of data aggregation, maintenance, integration, translation, analysis, and security/privacy.

The study objective to establish an interactive BDA platform with simulated patient data using open-source software technologies was achieved by construction of a platform framework with Hadoop Distributed File System (HDFS) using HBase (key-value NoSQL database).

Distributed data structures were generated from benchmarked hospital-specific metadata of nine billion patient records.

At optimized iteration, HDFS ingestion of HFiles to HBase store files revealed sustained availability over hundreds of iterations; however, to complete MapReduce to HBase required a week (for 10 TB) and a month for three billion (30 TB) indexed patient records, respectively.

Found inconsistencies of MapReduce limited the capacity to generate and replicate data efficiently.

Apache Spark and Drill showed high performance with high usability for technical support but poor usability for clinical services.

Hospital system based on patient-centric data was challenging in using HBase, whereby not all data profiles were fully integrated with the complex patient-to-hospital relationships.

However, we recommend using HBase to achieve secured patient data while querying entire hospital volumes in a simplified clinical event model across clinical services.

American Psychological Association (APA)

Chrimes, Dillon& Zamani, Hamid. 2017. Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services. Computational and Mathematical Methods in Medicine،Vol. 2017, no. 2017, pp.1-16.
https://search.emarefa.net/detail/BIM-1142235

Modern Language Association (MLA)

Chrimes, Dillon& Zamani, Hamid. Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services. Computational and Mathematical Methods in Medicine No. 2017 (2017), pp.1-16.
https://search.emarefa.net/detail/BIM-1142235

American Medical Association (AMA)

Chrimes, Dillon& Zamani, Hamid. Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services. Computational and Mathematical Methods in Medicine. 2017. Vol. 2017, no. 2017, pp.1-16.
https://search.emarefa.net/detail/BIM-1142235

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1142235