A fuzzy extension of some classical concordance measures and an efficient algorithm for their computation

Michele Ceccarelli, Antonio Maratea

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Many indexes have been proposed in literature for the comparison of two crisp data partitions, as resulting from two different classifications attempts, two different clustering solutions or the comparison of a predicted vs. a true labeling. Most of these indexes implementations have a computational cost of O(N 2) (where N is the number of data points) and this fact may limit their usage in very big datasets or their integration in computational-intensive validation strategies. Furthermore, their extension to fuzzy partitions is not obvious. In this paper we analyze efficient algorithms to compute many classical indexes (most notably the Jaccard coefficient and the Rand index) in O(d 2∈+∈N) (where d is the number of different classes/clusters) and propose a straightforward procedure to extend their computation to fuzzy partitions. The fuzzy extension is based on a pseudo-count concept and provides a natural framework for including memberships in computation of binary similarity indexes, not limited to the ones here revised. Results on simulated data using the Jaccard coefficient highlight a higher consistence of its proposed fuzzy extension with respect to its crisp counterpart.

Original languageEnglish
Title of host publicationKnowledge-Based Intelligent Information and Engineering Systems - 12th International Conference, KES 2008, Proceedings
Pages755-763
Number of pages9
EditionPART 3
DOIs
Publication statusPublished - 24 Dec 2008
Externally publishedYes
Event12th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2008 - Zagreb, Croatia
Duration: 3 Sep 20085 Sep 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 3
Volume5179 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other12th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2008
CountryCroatia
CityZagreb
Period3/9/085/9/08

Keywords

  • Cluster stability
  • Concordance measure
  • Efficient algorithm
  • Validity index

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A fuzzy extension of some classical concordance measures and an efficient algorithm for their computation'. Together they form a unique fingerprint.

  • Cite this

    Ceccarelli, M., & Maratea, A. (2008). A fuzzy extension of some classical concordance measures and an efficient algorithm for their computation. In Knowledge-Based Intelligent Information and Engineering Systems - 12th International Conference, KES 2008, Proceedings (PART 3 ed., pp. 755-763). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5179 LNAI, No. PART 3). https://doi.org/10.1007/978-3-540-85567-5-94