Parallel sequence mining on shared-memory machines

Mohammed J. Zaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present pSPADE, a parallel algorithm for fast discovery of frequent sequences in large databases. pSPADE decomposes the original search space into smaller sufix-based classes. Each class can be solved in main-memory using efficient search techniques, and simple join operations. Further each class can be solved independently on each pro-cessor requiring no synchronization. However, dynamic inter-class and intra-class load balancing must be exploited to ensure that each processor gets an equal amount of work. Experiments on a 12 processor SGI Origin 2000 shared memory system show good speedup and excellent scaleup results.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages161-189
Number of pages29
Volume1759
ISBN (Print)3540671943, 9783540671947
Publication statusPublished - 2002
Externally publishedYes
Event5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999 - San Diego, United States
Duration: 15 Aug 199915 Aug 1999

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1759
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999
CountryUnited States
CitySan Diego
Period15/8/9915/8/99

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Zaki, M. J. (2002). Parallel sequence mining on shared-memory machines. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1759, pp. 161-189). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1759). Springer Verlag.