Approximate graph mining with label costs

Pranay Anchuri, Mohammed J. Zaki, Omer Barkol, Shahar Golan, Moshe Shamy

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Many real-world graphs have complex labels on the nodes and edges. Mining only exact patterns yields limited insights, since it may be hard to find exact matches. However, in many domains it is relatively easy to compute some cost (or distance) between different labels. Using this information, it becomes possible to mine a much richer set of approximate subgraph patterns, which preserve the topology but allow bounded label mismatches. We present novel and scalable methods to efficiently solve the approximate isomorphism problem. We show that the mined approximate patterns yield interesting patterns in several real-world graphs ranging from IT and protein interaction networks to protein structures.

Original languageEnglish
Title of host publicationHP Laboratories Technical Report
Edition36
Publication statusPublished - 24 Jun 2013
Externally publishedYes

Fingerprint

Labels
Proteins
Costs
Topology

Keywords

  • Approximation techniques
  • CMDB
  • Data mining
  • Graph analysis

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Cite this

Anchuri, P., Zaki, M. J., Barkol, O., Golan, S., & Shamy, M. (2013). Approximate graph mining with label costs. In HP Laboratories Technical Report (36 ed.)

Approximate graph mining with label costs. / Anchuri, Pranay; Zaki, Mohammed J.; Barkol, Omer; Golan, Shahar; Shamy, Moshe.

HP Laboratories Technical Report. 36. ed. 2013.

Research output: Chapter in Book/Report/Conference proceedingChapter

Anchuri, P, Zaki, MJ, Barkol, O, Golan, S & Shamy, M 2013, Approximate graph mining with label costs. in HP Laboratories Technical Report. 36 edn.
Anchuri P, Zaki MJ, Barkol O, Golan S, Shamy M. Approximate graph mining with label costs. In HP Laboratories Technical Report. 36 ed. 2013
Anchuri, Pranay ; Zaki, Mohammed J. ; Barkol, Omer ; Golan, Shahar ; Shamy, Moshe. / Approximate graph mining with label costs. HP Laboratories Technical Report. 36. ed. 2013.
@inbook{fc3a047cc2f74725852b1997a5641d2f,
title = "Approximate graph mining with label costs",
abstract = "Many real-world graphs have complex labels on the nodes and edges. Mining only exact patterns yields limited insights, since it may be hard to find exact matches. However, in many domains it is relatively easy to compute some cost (or distance) between different labels. Using this information, it becomes possible to mine a much richer set of approximate subgraph patterns, which preserve the topology but allow bounded label mismatches. We present novel and scalable methods to efficiently solve the approximate isomorphism problem. We show that the mined approximate patterns yield interesting patterns in several real-world graphs ranging from IT and protein interaction networks to protein structures.",
keywords = "Approximation techniques, CMDB, Data mining, Graph analysis",
author = "Pranay Anchuri and Zaki, {Mohammed J.} and Omer Barkol and Shahar Golan and Moshe Shamy",
year = "2013",
month = "6",
day = "24",
language = "English",
booktitle = "HP Laboratories Technical Report",
edition = "36",

}

TY - CHAP

T1 - Approximate graph mining with label costs

AU - Anchuri, Pranay

AU - Zaki, Mohammed J.

AU - Barkol, Omer

AU - Golan, Shahar

AU - Shamy, Moshe

PY - 2013/6/24

Y1 - 2013/6/24

N2 - Many real-world graphs have complex labels on the nodes and edges. Mining only exact patterns yields limited insights, since it may be hard to find exact matches. However, in many domains it is relatively easy to compute some cost (or distance) between different labels. Using this information, it becomes possible to mine a much richer set of approximate subgraph patterns, which preserve the topology but allow bounded label mismatches. We present novel and scalable methods to efficiently solve the approximate isomorphism problem. We show that the mined approximate patterns yield interesting patterns in several real-world graphs ranging from IT and protein interaction networks to protein structures.

AB - Many real-world graphs have complex labels on the nodes and edges. Mining only exact patterns yields limited insights, since it may be hard to find exact matches. However, in many domains it is relatively easy to compute some cost (or distance) between different labels. Using this information, it becomes possible to mine a much richer set of approximate subgraph patterns, which preserve the topology but allow bounded label mismatches. We present novel and scalable methods to efficiently solve the approximate isomorphism problem. We show that the mined approximate patterns yield interesting patterns in several real-world graphs ranging from IT and protein interaction networks to protein structures.

KW - Approximation techniques

KW - CMDB

KW - Data mining

KW - Graph analysis

UR - http://www.scopus.com/inward/record.url?scp=84879076194&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84879076194&partnerID=8YFLogxK

M3 - Chapter

BT - HP Laboratories Technical Report

ER -