Conference proceeding
Dart: A Geographic Information System on Hadoop
2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, pp 90-97
01 Jun 2015
Abstract
In the field of big data research, analytics on spatio-temporal data from social media is one of the fastest growing areas and poses a major challenge on research and application. An efficient and flexible computing and storage platform is needed for users to analyze spatio-temporal patterns in huge amount of social media data. This paper introduces a scalable and distributed geographic information system, called Dart, based on Hadoop and HBase. Dart provides a hybrid table schema to store spatial data in HBase so that the Reduce process can be omitted for operations like calculating the mean center and the median center. It employs reasonable pre-splitting and hash techniques to avoid data imbalance and hot region problems. It also supports massive spatial data analysis like K-Nearest Neighbors (KNN) and Geometric Median Distribution. In our experiments, we evaluate the performance of Dart by processing 160 GB Twitter data on an Amazon EC2 cluster. The experimental results show that Dart is very scalable and efficient.
Metrics
Details
- Title
- Dart: A Geographic Information System on Hadoop
- Creators
- Hong Zhang - University of WyomingZhibo Sun - University of WyomingZixia Liu - University of WyomingChen Xu - University of WyomingLiqiang Wang - University of Wyoming
- Contributors
- C Pu (Editor)A Mohindra (Editor)
- Publication Details
- 2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, pp 90-97
- Publisher
- IEEE
- Number of pages
- 8
- Grant note
- 1622292 / Office of Advanced Cyberinfrastructure (OAC); National Science Foundation (NSF); NSF - Directorate for Computer & Information Science & Engineering (CISE)
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Computer Science (Computing)
- Web of Science ID
- WOS:000380473600012
- Scopus ID
- 2-s2.0-84960083921
- Other Identifier
- 991021871063004721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Computer Science, Information Systems
- Computer Science, Theory & Methods
- Engineering, Electrical & Electronic