Parallel classification for data mining on shared-memory multiprocessors

Mohammed J. Zaki, Ching Tien Ho, Rakesh Agrawal

Research output: Chapter in Book/Report/Conference proceedingChapter

71 Citations (Scopus)

Abstract

We present parallel algorithms for building decision-tree classifiers on shared-memory multiprocessor (SMP) systems. The proposed algorithms span the gamut of data and task parallelism. The data parallelism is based on attribute scheduling among processors. This basic scheme is extended with task pipelining and dynamic load balancing to yield faster implementations. The task parallel approach uses dynamic subtree partitioning among processors. Our performance evaluation shows that the construction of a decision-tree classifier can be effectively parallelized on an SMP machine with good speedup.

Original languageEnglish
Title of host publicationProceedings - International Conference on Data Engineering
Place of PublicationLos Alamitos, CA, United States
PublisherInstitute of Electrical and Electronics Engineers Computer Society
Pages198-205
Number of pages8
Publication statusPublished - 1 Jan 1999
Externally publishedYes
EventProceedings of the 1999 15th International Conference on Data Engineering, ICDE-99 - Sydney, NSW, AUS
Duration: 23 Mar 199926 Mar 1999

Other

OtherProceedings of the 1999 15th International Conference on Data Engineering, ICDE-99
CitySydney, NSW, AUS
Period23/3/9926/3/99

Fingerprint

Decision trees
Data mining
Classifiers
Data storage equipment
Dynamic loads
Parallel algorithms
Resource allocation
Scheduling

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Zaki, M. J., Ho, C. T., & Agrawal, R. (1999). Parallel classification for data mining on shared-memory multiprocessors. In Proceedings - International Conference on Data Engineering (pp. 198-205). Los Alamitos, CA, United States: Institute of Electrical and Electronics Engineers Computer Society.

Parallel classification for data mining on shared-memory multiprocessors. / Zaki, Mohammed J.; Ho, Ching Tien; Agrawal, Rakesh.

Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States : Institute of Electrical and Electronics Engineers Computer Society, 1999. p. 198-205.

Research output: Chapter in Book/Report/Conference proceedingChapter

Zaki, MJ, Ho, CT & Agrawal, R 1999, Parallel classification for data mining on shared-memory multiprocessors. in Proceedings - International Conference on Data Engineering. Institute of Electrical and Electronics Engineers Computer Society, Los Alamitos, CA, United States, pp. 198-205, Proceedings of the 1999 15th International Conference on Data Engineering, ICDE-99, Sydney, NSW, AUS, 23/3/99.
Zaki MJ, Ho CT, Agrawal R. Parallel classification for data mining on shared-memory multiprocessors. In Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States: Institute of Electrical and Electronics Engineers Computer Society. 1999. p. 198-205
Zaki, Mohammed J. ; Ho, Ching Tien ; Agrawal, Rakesh. / Parallel classification for data mining on shared-memory multiprocessors. Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States : Institute of Electrical and Electronics Engineers Computer Society, 1999. pp. 198-205
@inbook{86727a50bc2b49608d890433bcb10e28,
title = "Parallel classification for data mining on shared-memory multiprocessors",
abstract = "We present parallel algorithms for building decision-tree classifiers on shared-memory multiprocessor (SMP) systems. The proposed algorithms span the gamut of data and task parallelism. The data parallelism is based on attribute scheduling among processors. This basic scheme is extended with task pipelining and dynamic load balancing to yield faster implementations. The task parallel approach uses dynamic subtree partitioning among processors. Our performance evaluation shows that the construction of a decision-tree classifier can be effectively parallelized on an SMP machine with good speedup.",
author = "Zaki, {Mohammed J.} and Ho, {Ching Tien} and Rakesh Agrawal",
year = "1999",
month = "1",
day = "1",
language = "English",
pages = "198--205",
booktitle = "Proceedings - International Conference on Data Engineering",
publisher = "Institute of Electrical and Electronics Engineers Computer Society",

}

TY - CHAP

T1 - Parallel classification for data mining on shared-memory multiprocessors

AU - Zaki, Mohammed J.

AU - Ho, Ching Tien

AU - Agrawal, Rakesh

PY - 1999/1/1

Y1 - 1999/1/1

N2 - We present parallel algorithms for building decision-tree classifiers on shared-memory multiprocessor (SMP) systems. The proposed algorithms span the gamut of data and task parallelism. The data parallelism is based on attribute scheduling among processors. This basic scheme is extended with task pipelining and dynamic load balancing to yield faster implementations. The task parallel approach uses dynamic subtree partitioning among processors. Our performance evaluation shows that the construction of a decision-tree classifier can be effectively parallelized on an SMP machine with good speedup.

AB - We present parallel algorithms for building decision-tree classifiers on shared-memory multiprocessor (SMP) systems. The proposed algorithms span the gamut of data and task parallelism. The data parallelism is based on attribute scheduling among processors. This basic scheme is extended with task pipelining and dynamic load balancing to yield faster implementations. The task parallel approach uses dynamic subtree partitioning among processors. Our performance evaluation shows that the construction of a decision-tree classifier can be effectively parallelized on an SMP machine with good speedup.

UR - http://www.scopus.com/inward/record.url?scp=0032627946&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032627946&partnerID=8YFLogxK

M3 - Chapter

SP - 198

EP - 205

BT - Proceedings - International Conference on Data Engineering

PB - Institute of Electrical and Electronics Engineers Computer Society

CY - Los Alamitos, CA, United States

ER -