PRISM

A prime-encoding approach for frequent sequence mining

Karam Gouda, Mosab Hassaan, Mohammed J. Zaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)

Abstract

Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called PRISM, for mining frequent sequences. PRISM utilizes a vertical approach for enumeration and support counting, based on the novel notion of prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that PRISM outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.

Original languageEnglish
Title of host publicationProceedings - IEEE International Conference on Data Mining, ICDM
Pages487-492
Number of pages6
DOIs
Publication statusPublished - 1 Dec 2007
Externally publishedYes
Event7th IEEE International Conference on Data Mining, ICDM 2007 - Omaha, NE, United States
Duration: 28 Oct 200731 Oct 2007

Other

Other7th IEEE International Conference on Data Mining, ICDM 2007
CountryUnited States
CityOmaha, NE
Period28/10/0731/10/07

Fingerprint

Factorization
Data mining

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Gouda, K., Hassaan, M., & Zaki, M. J. (2007). PRISM: A prime-encoding approach for frequent sequence mining. In Proceedings - IEEE International Conference on Data Mining, ICDM (pp. 487-492). [4470278] https://doi.org/10.1109/ICDM.2007.33

PRISM : A prime-encoding approach for frequent sequence mining. / Gouda, Karam; Hassaan, Mosab; Zaki, Mohammed J.

Proceedings - IEEE International Conference on Data Mining, ICDM. 2007. p. 487-492 4470278.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Gouda, K, Hassaan, M & Zaki, MJ 2007, PRISM: A prime-encoding approach for frequent sequence mining. in Proceedings - IEEE International Conference on Data Mining, ICDM., 4470278, pp. 487-492, 7th IEEE International Conference on Data Mining, ICDM 2007, Omaha, NE, United States, 28/10/07. https://doi.org/10.1109/ICDM.2007.33
Gouda K, Hassaan M, Zaki MJ. PRISM: A prime-encoding approach for frequent sequence mining. In Proceedings - IEEE International Conference on Data Mining, ICDM. 2007. p. 487-492. 4470278 https://doi.org/10.1109/ICDM.2007.33
Gouda, Karam ; Hassaan, Mosab ; Zaki, Mohammed J. / PRISM : A prime-encoding approach for frequent sequence mining. Proceedings - IEEE International Conference on Data Mining, ICDM. 2007. pp. 487-492
@inproceedings{aaf11c83bbad43a8898940921179a110,
title = "PRISM: A prime-encoding approach for frequent sequence mining",
abstract = "Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called PRISM, for mining frequent sequences. PRISM utilizes a vertical approach for enumeration and support counting, based on the novel notion of prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that PRISM outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.",
author = "Karam Gouda and Mosab Hassaan and Zaki, {Mohammed J.}",
year = "2007",
month = "12",
day = "1",
doi = "10.1109/ICDM.2007.33",
language = "English",
isbn = "0769530184",
pages = "487--492",
booktitle = "Proceedings - IEEE International Conference on Data Mining, ICDM",

}

TY - GEN

T1 - PRISM

T2 - A prime-encoding approach for frequent sequence mining

AU - Gouda, Karam

AU - Hassaan, Mosab

AU - Zaki, Mohammed J.

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called PRISM, for mining frequent sequences. PRISM utilizes a vertical approach for enumeration and support counting, based on the novel notion of prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that PRISM outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.

AB - Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called PRISM, for mining frequent sequences. PRISM utilizes a vertical approach for enumeration and support counting, based on the novel notion of prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that PRISM outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.

UR - http://www.scopus.com/inward/record.url?scp=48949110139&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=48949110139&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2007.33

DO - 10.1109/ICDM.2007.33

M3 - Conference contribution

SN - 0769530184

SN - 9780769530185

SP - 487

EP - 492

BT - Proceedings - IEEE International Conference on Data Mining, ICDM

ER -