On integrated clustering and outlier detection

Lionel Ott, Linsey Pang, Fabio Ramos, Sanjay Chawla

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Citations (Scopus)

Abstract

We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems
PublisherNeural information processing systems foundation
Pages1359-1367
Number of pages9
Volume2
EditionJanuary
Publication statusPublished - 2014
Externally publishedYes
Event28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 - Montreal, Canada
Duration: 8 Dec 201413 Dec 2014

Other

Other28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014
CountryCanada
CityMontreal
Period8/12/1413/12/14

Fingerprint

Scalability

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Cite this

Ott, L., Pang, L., Ramos, F., & Chawla, S. (2014). On integrated clustering and outlier detection. In Advances in Neural Information Processing Systems (January ed., Vol. 2, pp. 1359-1367). Neural information processing systems foundation.

On integrated clustering and outlier detection. / Ott, Lionel; Pang, Linsey; Ramos, Fabio; Chawla, Sanjay.

Advances in Neural Information Processing Systems. Vol. 2 January. ed. Neural information processing systems foundation, 2014. p. 1359-1367.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ott, L, Pang, L, Ramos, F & Chawla, S 2014, On integrated clustering and outlier detection. in Advances in Neural Information Processing Systems. January edn, vol. 2, Neural information processing systems foundation, pp. 1359-1367, 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014, Montreal, Canada, 8/12/14.
Ott L, Pang L, Ramos F, Chawla S. On integrated clustering and outlier detection. In Advances in Neural Information Processing Systems. January ed. Vol. 2. Neural information processing systems foundation. 2014. p. 1359-1367
Ott, Lionel ; Pang, Linsey ; Ramos, Fabio ; Chawla, Sanjay. / On integrated clustering and outlier detection. Advances in Neural Information Processing Systems. Vol. 2 January. ed. Neural information processing systems foundation, 2014. pp. 1359-1367
@inproceedings{c9854759d58846ce8dbf4ce073e47c6c,
title = "On integrated clustering and outlier detection",
abstract = "We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.",
author = "Lionel Ott and Linsey Pang and Fabio Ramos and Sanjay Chawla",
year = "2014",
language = "English",
volume = "2",
pages = "1359--1367",
booktitle = "Advances in Neural Information Processing Systems",
publisher = "Neural information processing systems foundation",
edition = "January",

}

TY - GEN

T1 - On integrated clustering and outlier detection

AU - Ott, Lionel

AU - Pang, Linsey

AU - Ramos, Fabio

AU - Chawla, Sanjay

PY - 2014

Y1 - 2014

N2 - We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.

AB - We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.

UR - http://www.scopus.com/inward/record.url?scp=84937938726&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84937938726&partnerID=8YFLogxK

M3 - Conference contribution

VL - 2

SP - 1359

EP - 1367

BT - Advances in Neural Information Processing Systems

PB - Neural information processing systems foundation

ER -