SCHISM

A new approach for interesting subspace mining

Karlton Sequeira, Mohammed Zaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

62 Citations (Scopus)

Abstract

High-dimensional data pose challenges to traditional clustering algorithms due to their inherent sparsity and data tend to cluster in different and possibly overlapping subspaces of the entire feature space. Finding such subspaces is called subspace mining. We present SCHISM, a new algorithm for mining interesting subspaces, using the notions of support and Chernoff-Hoeffding bounds. We use a vertical representation of the dataset, and use a depth-first search with backtracking to find maximal interesting subspaces. We test our algorithm on a number of high-dimensional synthetic and real datasets to test its effectiveness.

Original languageEnglish
Title of host publicationProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004
EditorsR. Rastogi, K. Morik, M. Bramer, X. Wu
Pages186-193
Number of pages8
DOIs
Publication statusPublished - 1 Dec 2004
Externally publishedYes
EventProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 - Brighton, United Kingdom
Duration: 1 Nov 20044 Nov 2004

Other

OtherProceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004
CountryUnited Kingdom
CityBrighton
Period1/11/044/11/04

Fingerprint

Clustering algorithms

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Sequeira, K., & Zaki, M. (2004). SCHISM: A new approach for interesting subspace mining. In R. Rastogi, K. Morik, M. Bramer, & X. Wu (Eds.), Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004 (pp. 186-193) https://doi.org/10.1109/ICDM.2004.10099

SCHISM : A new approach for interesting subspace mining. / Sequeira, Karlton; Zaki, Mohammed.

Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004. ed. / R. Rastogi; K. Morik; M. Bramer; X. Wu. 2004. p. 186-193.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sequeira, K & Zaki, M 2004, SCHISM: A new approach for interesting subspace mining. in R Rastogi, K Morik, M Bramer & X Wu (eds), Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004. pp. 186-193, Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004, Brighton, United Kingdom, 1/11/04. https://doi.org/10.1109/ICDM.2004.10099
Sequeira K, Zaki M. SCHISM: A new approach for interesting subspace mining. In Rastogi R, Morik K, Bramer M, Wu X, editors, Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004. 2004. p. 186-193 https://doi.org/10.1109/ICDM.2004.10099
Sequeira, Karlton ; Zaki, Mohammed. / SCHISM : A new approach for interesting subspace mining. Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004. editor / R. Rastogi ; K. Morik ; M. Bramer ; X. Wu. 2004. pp. 186-193
@inproceedings{629f3c0dc74947039097466429aebcf7,
title = "SCHISM: A new approach for interesting subspace mining",
abstract = "High-dimensional data pose challenges to traditional clustering algorithms due to their inherent sparsity and data tend to cluster in different and possibly overlapping subspaces of the entire feature space. Finding such subspaces is called subspace mining. We present SCHISM, a new algorithm for mining interesting subspaces, using the notions of support and Chernoff-Hoeffding bounds. We use a vertical representation of the dataset, and use a depth-first search with backtracking to find maximal interesting subspaces. We test our algorithm on a number of high-dimensional synthetic and real datasets to test its effectiveness.",
author = "Karlton Sequeira and Mohammed Zaki",
year = "2004",
month = "12",
day = "1",
doi = "10.1109/ICDM.2004.10099",
language = "English",
isbn = "0769521428",
pages = "186--193",
editor = "R. Rastogi and K. Morik and M. Bramer and X. Wu",
booktitle = "Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004",

}

TY - GEN

T1 - SCHISM

T2 - A new approach for interesting subspace mining

AU - Sequeira, Karlton

AU - Zaki, Mohammed

PY - 2004/12/1

Y1 - 2004/12/1

N2 - High-dimensional data pose challenges to traditional clustering algorithms due to their inherent sparsity and data tend to cluster in different and possibly overlapping subspaces of the entire feature space. Finding such subspaces is called subspace mining. We present SCHISM, a new algorithm for mining interesting subspaces, using the notions of support and Chernoff-Hoeffding bounds. We use a vertical representation of the dataset, and use a depth-first search with backtracking to find maximal interesting subspaces. We test our algorithm on a number of high-dimensional synthetic and real datasets to test its effectiveness.

AB - High-dimensional data pose challenges to traditional clustering algorithms due to their inherent sparsity and data tend to cluster in different and possibly overlapping subspaces of the entire feature space. Finding such subspaces is called subspace mining. We present SCHISM, a new algorithm for mining interesting subspaces, using the notions of support and Chernoff-Hoeffding bounds. We use a vertical representation of the dataset, and use a depth-first search with backtracking to find maximal interesting subspaces. We test our algorithm on a number of high-dimensional synthetic and real datasets to test its effectiveness.

UR - http://www.scopus.com/inward/record.url?scp=19544389465&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=19544389465&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2004.10099

DO - 10.1109/ICDM.2004.10099

M3 - Conference contribution

SN - 0769521428

SN - 9780769521428

SP - 186

EP - 193

BT - Proceedings - Fourth IEEE International Conference on Data Mining, ICDM 2004

A2 - Rastogi, R.

A2 - Morik, K.

A2 - Bramer, M.

A2 - Wu, X.

ER -