High performance statistical computing with parallel R

Applications to biology and climate modelling

Nagiza F. Samatova, Marcia Branstetter, Auroop R. Ganguly, Robert Hettich, Shiraj Khan, Guruprasad Kora, Jiangtian Li, Xiaosong Ma, Chongle Pan, Arie Shoshani, Srikanth Yoginath

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Ultrascale computing and high-throughput experimental technologies have enabled the production of scientific data about complex natural phenomena. With this opportunity, comes a new problem - the massive quantities of data so produced. Answers to fundamental questions about the nature of those phenomena remain largely hidden in the produced data. The goal of this work is to provide a scalable high performance statistical data analysis framework to help scientists perform interactive analyses of these raw data to extract knowledge. Towards this goal we have been developing an open source parallel statistical analysis package, called Parallel R, that lets scientists employ a wide range of statistical analysis routines on high performance shared and distributed memory architectures without having to deal with the intricacies of parallelizing these routines.

Original languageEnglish
Article number069
Pages (from-to)505-509
Number of pages5
JournalJournal of Physics: Conference Series
Volume46
Issue number1
DOIs
Publication statusPublished - 1 Oct 2006
Externally publishedYes

Fingerprint

biology
statistical analysis
climate
distributed memory

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this

Samatova, N. F., Branstetter, M., Ganguly, A. R., Hettich, R., Khan, S., Kora, G., ... Yoginath, S. (2006). High performance statistical computing with parallel R: Applications to biology and climate modelling. Journal of Physics: Conference Series, 46(1), 505-509. [069]. https://doi.org/10.1088/1742-6596/46/1/069

High performance statistical computing with parallel R : Applications to biology and climate modelling. / Samatova, Nagiza F.; Branstetter, Marcia; Ganguly, Auroop R.; Hettich, Robert; Khan, Shiraj; Kora, Guruprasad; Li, Jiangtian; Ma, Xiaosong; Pan, Chongle; Shoshani, Arie; Yoginath, Srikanth.

In: Journal of Physics: Conference Series, Vol. 46, No. 1, 069, 01.10.2006, p. 505-509.

Research output: Contribution to journalArticle

Samatova, NF, Branstetter, M, Ganguly, AR, Hettich, R, Khan, S, Kora, G, Li, J, Ma, X, Pan, C, Shoshani, A & Yoginath, S 2006, 'High performance statistical computing with parallel R: Applications to biology and climate modelling', Journal of Physics: Conference Series, vol. 46, no. 1, 069, pp. 505-509. https://doi.org/10.1088/1742-6596/46/1/069
Samatova, Nagiza F. ; Branstetter, Marcia ; Ganguly, Auroop R. ; Hettich, Robert ; Khan, Shiraj ; Kora, Guruprasad ; Li, Jiangtian ; Ma, Xiaosong ; Pan, Chongle ; Shoshani, Arie ; Yoginath, Srikanth. / High performance statistical computing with parallel R : Applications to biology and climate modelling. In: Journal of Physics: Conference Series. 2006 ; Vol. 46, No. 1. pp. 505-509.
@article{aedfc8c992054fa4bba180681cc63013,
title = "High performance statistical computing with parallel R: Applications to biology and climate modelling",
abstract = "Ultrascale computing and high-throughput experimental technologies have enabled the production of scientific data about complex natural phenomena. With this opportunity, comes a new problem - the massive quantities of data so produced. Answers to fundamental questions about the nature of those phenomena remain largely hidden in the produced data. The goal of this work is to provide a scalable high performance statistical data analysis framework to help scientists perform interactive analyses of these raw data to extract knowledge. Towards this goal we have been developing an open source parallel statistical analysis package, called Parallel R, that lets scientists employ a wide range of statistical analysis routines on high performance shared and distributed memory architectures without having to deal with the intricacies of parallelizing these routines.",
author = "Samatova, {Nagiza F.} and Marcia Branstetter and Ganguly, {Auroop R.} and Robert Hettich and Shiraj Khan and Guruprasad Kora and Jiangtian Li and Xiaosong Ma and Chongle Pan and Arie Shoshani and Srikanth Yoginath",
year = "2006",
month = "10",
day = "1",
doi = "10.1088/1742-6596/46/1/069",
language = "English",
volume = "46",
pages = "505--509",
journal = "Journal of Physics: Conference Series",
issn = "1742-6588",
publisher = "IOP Publishing Ltd.",
number = "1",

}

TY - JOUR

T1 - High performance statistical computing with parallel R

T2 - Applications to biology and climate modelling

AU - Samatova, Nagiza F.

AU - Branstetter, Marcia

AU - Ganguly, Auroop R.

AU - Hettich, Robert

AU - Khan, Shiraj

AU - Kora, Guruprasad

AU - Li, Jiangtian

AU - Ma, Xiaosong

AU - Pan, Chongle

AU - Shoshani, Arie

AU - Yoginath, Srikanth

PY - 2006/10/1

Y1 - 2006/10/1

N2 - Ultrascale computing and high-throughput experimental technologies have enabled the production of scientific data about complex natural phenomena. With this opportunity, comes a new problem - the massive quantities of data so produced. Answers to fundamental questions about the nature of those phenomena remain largely hidden in the produced data. The goal of this work is to provide a scalable high performance statistical data analysis framework to help scientists perform interactive analyses of these raw data to extract knowledge. Towards this goal we have been developing an open source parallel statistical analysis package, called Parallel R, that lets scientists employ a wide range of statistical analysis routines on high performance shared and distributed memory architectures without having to deal with the intricacies of parallelizing these routines.

AB - Ultrascale computing and high-throughput experimental technologies have enabled the production of scientific data about complex natural phenomena. With this opportunity, comes a new problem - the massive quantities of data so produced. Answers to fundamental questions about the nature of those phenomena remain largely hidden in the produced data. The goal of this work is to provide a scalable high performance statistical data analysis framework to help scientists perform interactive analyses of these raw data to extract knowledge. Towards this goal we have been developing an open source parallel statistical analysis package, called Parallel R, that lets scientists employ a wide range of statistical analysis routines on high performance shared and distributed memory architectures without having to deal with the intricacies of parallelizing these routines.

UR - http://www.scopus.com/inward/record.url?scp=33749049541&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33749049541&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/46/1/069

DO - 10.1088/1742-6596/46/1/069

M3 - Article

VL - 46

SP - 505

EP - 509

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

SN - 1742-6588

IS - 1

M1 - 069

ER -