The little engine(s) that could: Scaling online social networks

Josep M. Pujol, Vijay Erramilli, Georgos Siganos, Xiaoyuan Yang, Nikos Laoutaris, Parminder Chhabra, Pablo Rodriguez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

132 Citations (Scopus)


The difficulty of scaling Online Social Networks (OSNs) has introduced new system design challenges that has often caused costly re-architecting for services like Twitter and Facebook. The complexity of interconnection of users in social networks has introduced new scalability challenges. Conventional vertical scaling by resorting to full replication can be a costly proposition. Horizontal scaling by partitioning and distributing data among multiples servers - e.g. using DHTs - can lead to costly inter-server communication. We design, implement, and evaluate SPAR, a social partitioning and replication middle-ware that transparently leverages the social graph structure to achieve data locality while minimizing replication. SPAR guarantees that for all users in an OSN, their direct neighbor's data is co-located in the same server. The gains from this approach are multi-fold: application developers can assume local semantics, i.e., develop as they would for a single server; scalability is achieved by adding commodity servers with low memory and network I/O requirements; and redundancy is achieved at a fraction of the cost. We detail our system design and an evaluation based on datasets from Twitter, Orkut, and Facebook, with a working implementation. We show that SPAR incurs minimum overhead, and can help a well-known open-source Twitter clone reach Twitter's scale without changing a line of its application logic and achieves higher throughput than Cassandra, Facebook's DHT based key-value store database.

Original languageEnglish
Title of host publicationSIGCOMM'10 - Proceedings of the SIGCOMM 2010 Conference
Number of pages12
Publication statusPublished - 15 Nov 2010
Externally publishedYes
Event7th International Conference on Autonomic Computing, SIGCOMM 2010 - New Delhi, India
Duration: 30 Aug 20103 Sep 2010


Other7th International Conference on Autonomic Computing, SIGCOMM 2010
CityNew Delhi



  • partition
  • replication
  • scalability
  • social networks

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Theoretical Computer Science

Cite this

Pujol, J. M., Erramilli, V., Siganos, G., Yang, X., Laoutaris, N., Chhabra, P., & Rodriguez, P. (2010). The little engine(s) that could: Scaling online social networks. In SIGCOMM'10 - Proceedings of the SIGCOMM 2010 Conference (pp. 375-386)