Information extraction from multimedia web documents: an open-source platform and testbed

David Paul Dupplaw, Michael Matthews, Richard Johansson, Giulia Boato, Andrea Costanzo, Marco Fontani, Enrico Minack, Elena Demidova, Roi Blanco, Thomas Griffiths, Paul Lewis, Jonathon Hare, Alessandro Moschitti

Research output: Contribution to journalArticle

1 Citation (Scopus)


The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval.

Original languageEnglish
Pages (from-to)97-111
Number of pages15
JournalInternational Journal of Multimedia Information Retrieval
Issue number2
Publication statusPublished - 1 Jun 2014



  • Image analysis
  • Multimedia retrieval
  • Open-source software
  • Opinion analysis
  • Text analysis
  • Web analysis

ASJC Scopus subject areas

  • Information Systems
  • Media Technology
  • Library and Information Sciences

Cite this

Dupplaw, D. P., Matthews, M., Johansson, R., Boato, G., Costanzo, A., Fontani, M., Minack, E., Demidova, E., Blanco, R., Griffiths, T., Lewis, P., Hare, J., & Moschitti, A. (2014). Information extraction from multimedia web documents: an open-source platform and testbed. International Journal of Multimedia Information Retrieval, 3(2), 97-111.