Comparative behaviour of recent incremental and non-incremental clustering methods on text

An extended study

Jean Charles Lamirel, RaghvenPhDa Mall, Mumtaz Ahmad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on "real world data". An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.

Original languageEnglish
Title of host publicationModern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings
Pages19-28
Number of pages10
EditionPART 1
DOIs
Publication statusPublished - 25 Jul 2011
Externally publishedYes
Event24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011 - Syracuse, NY, United States
Duration: 28 Jun 20111 Jul 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6703 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011
CountryUnited States
CitySyracuse, NY
Period28/6/111/7/11

Fingerprint

Clustering Methods
Defects
Quality Evaluation
Quality Measures
K-means
Figure
Benchmark
Evaluation
Text
Simulation

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Lamirel, J. C., Mall, R., & Ahmad, M. (2011). Comparative behaviour of recent incremental and non-incremental clustering methods on text: An extended study. In Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings (PART 1 ed., pp. 19-28). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6703 LNAI, No. PART 1). https://doi.org/10.1007/978-3-642-21822-4_3

Comparative behaviour of recent incremental and non-incremental clustering methods on text : An extended study. / Lamirel, Jean Charles; Mall, RaghvenPhDa; Ahmad, Mumtaz.

Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings. PART 1. ed. 2011. p. 19-28 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6703 LNAI, No. PART 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lamirel, JC, Mall, R & Ahmad, M 2011, Comparative behaviour of recent incremental and non-incremental clustering methods on text: An extended study. in Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings. PART 1 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 6703 LNAI, pp. 19-28, 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Syracuse, NY, United States, 28/6/11. https://doi.org/10.1007/978-3-642-21822-4_3
Lamirel JC, Mall R, Ahmad M. Comparative behaviour of recent incremental and non-incremental clustering methods on text: An extended study. In Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings. PART 1 ed. 2011. p. 19-28. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). https://doi.org/10.1007/978-3-642-21822-4_3
Lamirel, Jean Charles ; Mall, RaghvenPhDa ; Ahmad, Mumtaz. / Comparative behaviour of recent incremental and non-incremental clustering methods on text : An extended study. Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings. PART 1. ed. 2011. pp. 19-28 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1).
@inproceedings{70dd544b8c3e4c23a81612c5f415cdf4,
title = "Comparative behaviour of recent incremental and non-incremental clustering methods on text: An extended study",
abstract = "This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on {"}real world data{"}. An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.",
author = "Lamirel, {Jean Charles} and RaghvenPhDa Mall and Mumtaz Ahmad",
year = "2011",
month = "7",
day = "25",
doi = "10.1007/978-3-642-21822-4_3",
language = "English",
isbn = "9783642218217",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 1",
pages = "19--28",
booktitle = "Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings",
edition = "PART 1",

}

TY - GEN

T1 - Comparative behaviour of recent incremental and non-incremental clustering methods on text

T2 - An extended study

AU - Lamirel, Jean Charles

AU - Mall, RaghvenPhDa

AU - Ahmad, Mumtaz

PY - 2011/7/25

Y1 - 2011/7/25

N2 - This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on "real world data". An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.

AB - This paper represents an attempt to throw some light on the quality and on the defects of some recent clustering methods, either they are incremental or not, on "real world data". An extended evaluation of the methods is achieved through the use of textual datasets of increasing complexity. The third test dataset is a highly polythematic dataset that figures out a static simulation of evolving data. It thus represents an interesting benchmark for comparing the behaviour of incremental and non incremental methods. The focus is put on neural clustering methods but the standard K-means method is included as reference in the comparison. Generic quality measures are used for quality evaluation.

UR - http://www.scopus.com/inward/record.url?scp=79960485125&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960485125&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-21822-4_3

DO - 10.1007/978-3-642-21822-4_3

M3 - Conference contribution

SN - 9783642218217

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 19

EP - 28

BT - Modern Approaches in Applied Intelligence - 24th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2011, Proceedings

ER -