Functional clustering algorithm for high-dimensional proteomics data

Halima Bensmail, Buddana Aruna, O. John Semmes, Abdelali Haoudi

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.

Original languageEnglish
Pages (from-to)80-86
Number of pages7
JournalJournal of Biomedicine and Biotechnology
Volume2005
Issue number2
DOIs
Publication statusPublished - 30 Jun 2005
Externally publishedYes

Fingerprint

Clustering algorithms
Proteomics
Cluster Analysis
T-cells
Viruses
Throughput
Deltaretrovirus
Proteins

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Genetics
  • Applied Microbiology and Biotechnology

Cite this

Functional clustering algorithm for high-dimensional proteomics data. / Bensmail, Halima; Aruna, Buddana; Semmes, O. John; Haoudi, Abdelali.

In: Journal of Biomedicine and Biotechnology, Vol. 2005, No. 2, 30.06.2005, p. 80-86.

Research output: Contribution to journalArticle

Bensmail, Halima ; Aruna, Buddana ; Semmes, O. John ; Haoudi, Abdelali. / Functional clustering algorithm for high-dimensional proteomics data. In: Journal of Biomedicine and Biotechnology. 2005 ; Vol. 2005, No. 2. pp. 80-86.
@article{b6435e0842554161a5c4c2c2c85d0aee,
title = "Functional clustering algorithm for high-dimensional proteomics data",
abstract = "Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.",
author = "Halima Bensmail and Buddana Aruna and Semmes, {O. John} and Abdelali Haoudi",
year = "2005",
month = "6",
day = "30",
doi = "10.1155/JBB.2005.80",
language = "English",
volume = "2005",
pages = "80--86",
journal = "BioMed Research International",
issn = "2314-6133",
publisher = "Hindawi Publishing Corporation",
number = "2",

}

TY - JOUR

T1 - Functional clustering algorithm for high-dimensional proteomics data

AU - Bensmail, Halima

AU - Aruna, Buddana

AU - Semmes, O. John

AU - Haoudi, Abdelali

PY - 2005/6/30

Y1 - 2005/6/30

N2 - Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.

AB - Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.

UR - http://www.scopus.com/inward/record.url?scp=27744537866&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=27744537866&partnerID=8YFLogxK

U2 - 10.1155/JBB.2005.80

DO - 10.1155/JBB.2005.80

M3 - Article

VL - 2005

SP - 80

EP - 86

JO - BioMed Research International

JF - BioMed Research International

SN - 2314-6133

IS - 2

ER -