Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities

A Research Roadmap

Sushil K. Prasad, Danial Aghajarian, Michael McDermott, Dhara Shah, Mohamed Mokbel, Satish Puri, Sergio J. Rey, Shashi Shekhar, Yiqun Xe, Ranga Raju Vatsavai, Fusheng Wang, Yanhui Liang, Hoang Vo, Shaowen Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages232-250
Number of pages19
ISBN (Electronic)9781538619964
DOIs
Publication statusPublished - 7 Sep 2017
Externally publishedYes
Event6th IEEE International Congress on Big Data, BigData Congress 2017 - Honolulu, United States
Duration: 25 Jun 201730 Jun 2017

Other

Other6th IEEE International Congress on Big Data, BigData Congress 2017
CountryUnited States
CityHonolulu
Period25/6/1730/6/17

Fingerprint

Social sciences
Sensors
Processing
Information management
Data mining
Buoys
Soil moisture
Medical imaging
Parallel algorithms
Magnetic resonance imaging
Ecosystems
Particle accelerators
Data structures
Remote sensing
Visualization
Earth (planet)
Cameras
Roadmap
Parallel processing
Climate

Keywords

  • CyberGIS
  • High performance computing
  • Map-reduce systems
  • Medical images
  • Parallel algorithms and data structures
  • Remote sensing data
  • Spatial data mining
  • Spatial econometrics

ASJC Scopus subject areas

  • Information Systems
  • Artificial Intelligence
  • Computer Networks and Communications
  • Information Systems and Management

Cite this

Prasad, S. K., Aghajarian, D., McDermott, M., Shah, D., Mokbel, M., Puri, S., ... Wang, S. (2017). Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities: A Research Roadmap. In Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017 (pp. 232-250). [8029331] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigDataCongress.2017.39

Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities : A Research Roadmap. / Prasad, Sushil K.; Aghajarian, Danial; McDermott, Michael; Shah, Dhara; Mokbel, Mohamed; Puri, Satish; Rey, Sergio J.; Shekhar, Shashi; Xe, Yiqun; Vatsavai, Ranga Raju; Wang, Fusheng; Liang, Yanhui; Vo, Hoang; Wang, Shaowen.

Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017. Institute of Electrical and Electronics Engineers Inc., 2017. p. 232-250 8029331.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Prasad, SK, Aghajarian, D, McDermott, M, Shah, D, Mokbel, M, Puri, S, Rey, SJ, Shekhar, S, Xe, Y, Vatsavai, RR, Wang, F, Liang, Y, Vo, H & Wang, S 2017, Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities: A Research Roadmap. in Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017., 8029331, Institute of Electrical and Electronics Engineers Inc., pp. 232-250, 6th IEEE International Congress on Big Data, BigData Congress 2017, Honolulu, United States, 25/6/17. https://doi.org/10.1109/BigDataCongress.2017.39
Prasad SK, Aghajarian D, McDermott M, Shah D, Mokbel M, Puri S et al. Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities: A Research Roadmap. In Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017. Institute of Electrical and Electronics Engineers Inc. 2017. p. 232-250. 8029331 https://doi.org/10.1109/BigDataCongress.2017.39
Prasad, Sushil K. ; Aghajarian, Danial ; McDermott, Michael ; Shah, Dhara ; Mokbel, Mohamed ; Puri, Satish ; Rey, Sergio J. ; Shekhar, Shashi ; Xe, Yiqun ; Vatsavai, Ranga Raju ; Wang, Fusheng ; Liang, Yanhui ; Vo, Hoang ; Wang, Shaowen. / Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities : A Research Roadmap. Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 232-250
@inproceedings{d6fb1c3a9021482989d90cf7b8dcbfcc,
title = "Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities: A Research Roadmap",
abstract = "This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.",
keywords = "CyberGIS, High performance computing, Map-reduce systems, Medical images, Parallel algorithms and data structures, Remote sensing data, Spatial data mining, Spatial econometrics",
author = "Prasad, {Sushil K.} and Danial Aghajarian and Michael McDermott and Dhara Shah and Mohamed Mokbel and Satish Puri and Rey, {Sergio J.} and Shashi Shekhar and Yiqun Xe and Vatsavai, {Ranga Raju} and Fusheng Wang and Yanhui Liang and Hoang Vo and Shaowen Wang",
year = "2017",
month = "9",
day = "7",
doi = "10.1109/BigDataCongress.2017.39",
language = "English",
pages = "232--250",
booktitle = "Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Parallel Processing over Spatial-Temporal Datasets from Geo, Bio, Climate and Social Science Communities

T2 - A Research Roadmap

AU - Prasad, Sushil K.

AU - Aghajarian, Danial

AU - McDermott, Michael

AU - Shah, Dhara

AU - Mokbel, Mohamed

AU - Puri, Satish

AU - Rey, Sergio J.

AU - Shekhar, Shashi

AU - Xe, Yiqun

AU - Vatsavai, Ranga Raju

AU - Wang, Fusheng

AU - Liang, Yanhui

AU - Vo, Hoang

AU - Wang, Shaowen

PY - 2017/9/7

Y1 - 2017/9/7

N2 - This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.

AB - This vision paper reviews the current state-ofart and lays out emerging research challenges in parallel processing of spatial-temporal large datasets relevant to a variety of scientific communities. The spatio-temporal data, whether captured through remote sensors (global earth observations), ground and ocean sensors (e.g., soil moisture sensors, buoys), social media and hand-held, traffic-related sensors and cameras, medical imaging (e.g., MRI), or large scale simulations (e.g., climate) have always been 'big.' A common thread among all these big collections of datasets is that they are spatial and temporal. Processing and analyzing these datasets requires high-performance computing (HPC) infrastructures. Various agencies, scientific communities and increasingly the society at large rely on spatial data management, analysis, and spatial data mining to gain insights and produce actionable plans. Therefore, an ecosystem of integrated and reliable software infrastructure is required for spatialtemporal big data management and analysis that will serve as crucial tools for solving a wide set of research problems from different scientific and engineering areas and to empower users with next-generation tools. This vision requires a multidisciplinary effort to significantly advance domain research and have a broad impact on the society. The areas of research discussed in this paper include (i) spatial data mining, (ii) data analytics over remote sensing data, (iii) processing medical images, (iv) spatial econometrics analyses, (v) Map-Reducebased systems for spatial computation and visualization, (vi) CyberGIS systems, and (vii) foundational parallel algorithms and data structures for polygonal datasets, and why HPC infrastructures, including harnessing graphics accelerators, are needed for time-critical applications.

KW - CyberGIS

KW - High performance computing

KW - Map-reduce systems

KW - Medical images

KW - Parallel algorithms and data structures

KW - Remote sensing data

KW - Spatial data mining

KW - Spatial econometrics

UR - http://www.scopus.com/inward/record.url?scp=85032384688&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85032384688&partnerID=8YFLogxK

U2 - 10.1109/BigDataCongress.2017.39

DO - 10.1109/BigDataCongress.2017.39

M3 - Conference contribution

SP - 232

EP - 250

BT - Proceedings - 2017 IEEE 6th International Congress on Big Data, BigData Congress 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -