An expressive framework and efficient algorithms for the analysis of collaborative tagging

Mahashweta Das, Saravanan Thirumuruganathan, Sihem Amer-Yahia, Gautam Das, Cong Yu

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

The rise of Web 2.0 is signaled by sites such as Flickr, del.icio.us, and YouTube, and social tagging is essential to their success. A typical tagging action involves three components, user, item (e.g., photos in Flickr), and tags (i.e., words or phrases). Analyzing how tags are assigned by certain users to certain items has important implications in helping users search for desired information. In this paper, we develop a dual mining framework to explore tagging behavior. This framework is centered around two opposing measures, similarity and diversity, applied to one or more tagging components, and therefore enables a wide range of analysis scenarios such as characterizing similar users tagging diverse items with similar tags or diverse users tagging similar items with diverse tags. By adopting different concrete measures for similarity and diversity in the framework, we show that a wide range of concrete analysis problems can be defined and they are NP-Complete in general. We design four sets of efficient algorithms for solving many of those problems and demonstrate, through comprehensive experiments over real data, that our algorithms significantly out-perform the exact brute-force approach without compromising analysis result quality.

Original languageEnglish
Pages (from-to)201-226
Number of pages26
JournalVLDB Journal
Volume23
Issue number2
DOIs
Publication statusPublished - 1 Jan 2014
Externally publishedYes

Fingerprint

Concretes
Experiments

Keywords

  • Algorithm
  • Collaborative tagging
  • Dual mining framework
  • Optimization

ASJC Scopus subject areas

  • Information Systems
  • Hardware and Architecture

Cite this

An expressive framework and efficient algorithms for the analysis of collaborative tagging. / Das, Mahashweta; Thirumuruganathan, Saravanan; Amer-Yahia, Sihem; Das, Gautam; Yu, Cong.

In: VLDB Journal, Vol. 23, No. 2, 01.01.2014, p. 201-226.

Research output: Contribution to journalArticle

Das, Mahashweta ; Thirumuruganathan, Saravanan ; Amer-Yahia, Sihem ; Das, Gautam ; Yu, Cong. / An expressive framework and efficient algorithms for the analysis of collaborative tagging. In: VLDB Journal. 2014 ; Vol. 23, No. 2. pp. 201-226.
@article{97e53de20da1484abef9da9315b7cee3,
title = "An expressive framework and efficient algorithms for the analysis of collaborative tagging",
abstract = "The rise of Web 2.0 is signaled by sites such as Flickr, del.icio.us, and YouTube, and social tagging is essential to their success. A typical tagging action involves three components, user, item (e.g., photos in Flickr), and tags (i.e., words or phrases). Analyzing how tags are assigned by certain users to certain items has important implications in helping users search for desired information. In this paper, we develop a dual mining framework to explore tagging behavior. This framework is centered around two opposing measures, similarity and diversity, applied to one or more tagging components, and therefore enables a wide range of analysis scenarios such as characterizing similar users tagging diverse items with similar tags or diverse users tagging similar items with diverse tags. By adopting different concrete measures for similarity and diversity in the framework, we show that a wide range of concrete analysis problems can be defined and they are NP-Complete in general. We design four sets of efficient algorithms for solving many of those problems and demonstrate, through comprehensive experiments over real data, that our algorithms significantly out-perform the exact brute-force approach without compromising analysis result quality.",
keywords = "Algorithm, Collaborative tagging, Dual mining framework, Optimization",
author = "Mahashweta Das and Saravanan Thirumuruganathan and Sihem Amer-Yahia and Gautam Das and Cong Yu",
year = "2014",
month = "1",
day = "1",
doi = "10.1007/s00778-013-0341-y",
language = "English",
volume = "23",
pages = "201--226",
journal = "VLDB Journal",
issn = "1066-8888",
publisher = "Springer New York",
number = "2",

}

TY - JOUR

T1 - An expressive framework and efficient algorithms for the analysis of collaborative tagging

AU - Das, Mahashweta

AU - Thirumuruganathan, Saravanan

AU - Amer-Yahia, Sihem

AU - Das, Gautam

AU - Yu, Cong

PY - 2014/1/1

Y1 - 2014/1/1

N2 - The rise of Web 2.0 is signaled by sites such as Flickr, del.icio.us, and YouTube, and social tagging is essential to their success. A typical tagging action involves three components, user, item (e.g., photos in Flickr), and tags (i.e., words or phrases). Analyzing how tags are assigned by certain users to certain items has important implications in helping users search for desired information. In this paper, we develop a dual mining framework to explore tagging behavior. This framework is centered around two opposing measures, similarity and diversity, applied to one or more tagging components, and therefore enables a wide range of analysis scenarios such as characterizing similar users tagging diverse items with similar tags or diverse users tagging similar items with diverse tags. By adopting different concrete measures for similarity and diversity in the framework, we show that a wide range of concrete analysis problems can be defined and they are NP-Complete in general. We design four sets of efficient algorithms for solving many of those problems and demonstrate, through comprehensive experiments over real data, that our algorithms significantly out-perform the exact brute-force approach without compromising analysis result quality.

AB - The rise of Web 2.0 is signaled by sites such as Flickr, del.icio.us, and YouTube, and social tagging is essential to their success. A typical tagging action involves three components, user, item (e.g., photos in Flickr), and tags (i.e., words or phrases). Analyzing how tags are assigned by certain users to certain items has important implications in helping users search for desired information. In this paper, we develop a dual mining framework to explore tagging behavior. This framework is centered around two opposing measures, similarity and diversity, applied to one or more tagging components, and therefore enables a wide range of analysis scenarios such as characterizing similar users tagging diverse items with similar tags or diverse users tagging similar items with diverse tags. By adopting different concrete measures for similarity and diversity in the framework, we show that a wide range of concrete analysis problems can be defined and they are NP-Complete in general. We design four sets of efficient algorithms for solving many of those problems and demonstrate, through comprehensive experiments over real data, that our algorithms significantly out-perform the exact brute-force approach without compromising analysis result quality.

KW - Algorithm

KW - Collaborative tagging

KW - Dual mining framework

KW - Optimization

UR - http://www.scopus.com/inward/record.url?scp=84897571767&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84897571767&partnerID=8YFLogxK

U2 - 10.1007/s00778-013-0341-y

DO - 10.1007/s00778-013-0341-y

M3 - Article

VL - 23

SP - 201

EP - 226

JO - VLDB Journal

JF - VLDB Journal

SN - 1066-8888

IS - 2

ER -