Scalable Online Betweenness Centrality in Evolving Graphs

Nicolas Kourtellis, Gianmarco Morales, Francesco Bonchi

Research output: Contribution to journalArticle

26 Citations (Scopus)

Abstract

Betweenness centrality is a classic measure that quantifies the importance of a graph element (vertex or edge) according to the fraction of shortest paths passing through it. This measure is notoriously expensive to compute, and the best known algorithm runs in O(nm) time. The problems of efficiency and scalability are exacerbated in a dynamic setting, where the input is an evolving graph seen edge by edge, and the goal is to keep the betweenness centrality up to date. In this paper, we propose the first truly scalable algorithm for online computation of betweenness centrality of both vertices and edges in an evolving graph where new edges are added and existing edges are removed. Our algorithm is carefully engineered with out-of-core techniques and tailored for modern parallel stream processing engines that run on clusters of shared-nothing commodity hardware. Hence, it is amenable to real-world deployment. We experiment on graphs that are two orders of magnitude larger than previous studies. Our method is able to keep the betweenness centrality measures up-to-date online, i.e., the time to update the measures is smaller than the inter-arrival time between two consecutive updates.

Original languageEnglish
Article number7079456
Pages (from-to)2494-2506
Number of pages13
JournalIEEE Transactions on Knowledge and Data Engineering
Volume27
Issue number9
DOIs
Publication statusPublished - 1 Sep 2015
Externally publishedYes

Fingerprint

Scalability
Engines
Hardware
Processing
Experiments

Keywords

  • Betweenness centrality
  • big data
  • evolving graphs
  • streaming scalable distributed algorithms

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

Scalable Online Betweenness Centrality in Evolving Graphs. / Kourtellis, Nicolas; Morales, Gianmarco; Bonchi, Francesco.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 27, No. 9, 7079456, 01.09.2015, p. 2494-2506.

Research output: Contribution to journalArticle

Kourtellis, Nicolas ; Morales, Gianmarco ; Bonchi, Francesco. / Scalable Online Betweenness Centrality in Evolving Graphs. In: IEEE Transactions on Knowledge and Data Engineering. 2015 ; Vol. 27, No. 9. pp. 2494-2506.
@article{3d20bd59b5e04aa999921a1a343ec264,
title = "Scalable Online Betweenness Centrality in Evolving Graphs",
abstract = "Betweenness centrality is a classic measure that quantifies the importance of a graph element (vertex or edge) according to the fraction of shortest paths passing through it. This measure is notoriously expensive to compute, and the best known algorithm runs in O(nm) time. The problems of efficiency and scalability are exacerbated in a dynamic setting, where the input is an evolving graph seen edge by edge, and the goal is to keep the betweenness centrality up to date. In this paper, we propose the first truly scalable algorithm for online computation of betweenness centrality of both vertices and edges in an evolving graph where new edges are added and existing edges are removed. Our algorithm is carefully engineered with out-of-core techniques and tailored for modern parallel stream processing engines that run on clusters of shared-nothing commodity hardware. Hence, it is amenable to real-world deployment. We experiment on graphs that are two orders of magnitude larger than previous studies. Our method is able to keep the betweenness centrality measures up-to-date online, i.e., the time to update the measures is smaller than the inter-arrival time between two consecutive updates.",
keywords = "Betweenness centrality, big data, evolving graphs, streaming scalable distributed algorithms",
author = "Nicolas Kourtellis and Gianmarco Morales and Francesco Bonchi",
year = "2015",
month = "9",
day = "1",
doi = "10.1109/TKDE.2015.2419666",
language = "English",
volume = "27",
pages = "2494--2506",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "9",

}

TY - JOUR

T1 - Scalable Online Betweenness Centrality in Evolving Graphs

AU - Kourtellis, Nicolas

AU - Morales, Gianmarco

AU - Bonchi, Francesco

PY - 2015/9/1

Y1 - 2015/9/1

N2 - Betweenness centrality is a classic measure that quantifies the importance of a graph element (vertex or edge) according to the fraction of shortest paths passing through it. This measure is notoriously expensive to compute, and the best known algorithm runs in O(nm) time. The problems of efficiency and scalability are exacerbated in a dynamic setting, where the input is an evolving graph seen edge by edge, and the goal is to keep the betweenness centrality up to date. In this paper, we propose the first truly scalable algorithm for online computation of betweenness centrality of both vertices and edges in an evolving graph where new edges are added and existing edges are removed. Our algorithm is carefully engineered with out-of-core techniques and tailored for modern parallel stream processing engines that run on clusters of shared-nothing commodity hardware. Hence, it is amenable to real-world deployment. We experiment on graphs that are two orders of magnitude larger than previous studies. Our method is able to keep the betweenness centrality measures up-to-date online, i.e., the time to update the measures is smaller than the inter-arrival time between two consecutive updates.

AB - Betweenness centrality is a classic measure that quantifies the importance of a graph element (vertex or edge) according to the fraction of shortest paths passing through it. This measure is notoriously expensive to compute, and the best known algorithm runs in O(nm) time. The problems of efficiency and scalability are exacerbated in a dynamic setting, where the input is an evolving graph seen edge by edge, and the goal is to keep the betweenness centrality up to date. In this paper, we propose the first truly scalable algorithm for online computation of betweenness centrality of both vertices and edges in an evolving graph where new edges are added and existing edges are removed. Our algorithm is carefully engineered with out-of-core techniques and tailored for modern parallel stream processing engines that run on clusters of shared-nothing commodity hardware. Hence, it is amenable to real-world deployment. We experiment on graphs that are two orders of magnitude larger than previous studies. Our method is able to keep the betweenness centrality measures up-to-date online, i.e., the time to update the measures is smaller than the inter-arrival time between two consecutive updates.

KW - Betweenness centrality

KW - big data

KW - evolving graphs

KW - streaming scalable distributed algorithms

UR - http://www.scopus.com/inward/record.url?scp=84939226346&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84939226346&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2015.2419666

DO - 10.1109/TKDE.2015.2419666

M3 - Article

VL - 27

SP - 2494

EP - 2506

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 9

M1 - 7079456

ER -