EXMOTIF: Efficient structured motif extraction

Yongqiang Zhang, Mohammed J. Zaki

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Background: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences. Results: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites. Conclusion: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at: http://www.cs.rpi.edu/~zaki/software/ exMotif/.

Original languageEnglish
Article number21
JournalAlgorithms for Molecular Biology
Volume1
Issue number1
DOIs
Publication statusPublished - 16 Nov 2006
Externally publishedYes

Fingerprint

DNA Sequence
DNA sequences
Composite
Binding sites
Quorum
Transcription Factor
Binding Sites
Open Source
Template
Mining
Bioinformatics
Transcription factors
Efficient Algorithms
Composite materials
Computational Biology
Software
Transcription Factors
Background

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Applied Mathematics
  • Molecular Biology
  • Structural Biology

Cite this

EXMOTIF : Efficient structured motif extraction. / Zhang, Yongqiang; Zaki, Mohammed J.

In: Algorithms for Molecular Biology, Vol. 1, No. 1, 21, 16.11.2006.

Research output: Contribution to journalArticle

Zhang, Yongqiang ; Zaki, Mohammed J. / EXMOTIF : Efficient structured motif extraction. In: Algorithms for Molecular Biology. 2006 ; Vol. 1, No. 1.
@article{f0893c13db6d447889af2a7351cd38f4,
title = "EXMOTIF: Efficient structured motif extraction",
abstract = "Background: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences. Results: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites. Conclusion: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at: http://www.cs.rpi.edu/~zaki/software/ exMotif/.",
author = "Yongqiang Zhang and Zaki, {Mohammed J.}",
year = "2006",
month = "11",
day = "16",
doi = "10.1186/1748-7188-1-21",
language = "English",
volume = "1",
journal = "Algorithms for Molecular Biology",
issn = "1748-7188",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - EXMOTIF

T2 - Efficient structured motif extraction

AU - Zhang, Yongqiang

AU - Zaki, Mohammed J.

PY - 2006/11/16

Y1 - 2006/11/16

N2 - Background: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences. Results: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites. Conclusion: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at: http://www.cs.rpi.edu/~zaki/software/ exMotif/.

AB - Background: Extracting motifs from sequences is a mainstay of bioinformatics. We look at the problem of mining structured motifs, which allow variable length gaps between simple motif components. We propose an efficient algorithm, called EXMOTIF, that given some sequence(s), and a structured motif template, extracts all frequent structured motifs that have quorum q. Potential applications of our method include the extraction of single/composite regulatory binding sites in DNA sequences. Results: EXMOTIF is efficient in terms of both time and space and is shown empirically to outperform RISO, a state-of-the-art algorithm. It is also successful in finding potential single/composite transcription factor binding sites. Conclusion: EXMOTIF is a useful and efficient tool in discovering structured motifs, especially in DNA sequences. The algorithm is available as open-source at: http://www.cs.rpi.edu/~zaki/software/ exMotif/.

UR - http://www.scopus.com/inward/record.url?scp=34248356673&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34248356673&partnerID=8YFLogxK

U2 - 10.1186/1748-7188-1-21

DO - 10.1186/1748-7188-1-21

M3 - Article

AN - SCOPUS:34248356673

VL - 1

JO - Algorithms for Molecular Biology

JF - Algorithms for Molecular Biology

SN - 1748-7188

IS - 1

M1 - 21

ER -