Leveraging Web intelligence for finding interesting research datasets

Ayush Singhal, Ravindra Kasturi, Vidyashankar Sivakumar, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

The problem of user's interest to item matching is at the core of recommendation systems and search engines. This problem is well studied in different contexts such as item, document, music and movie recommendations. For the purpose of recommendation these systems store the context or the meta-data information about the item of interest (e.g. user rating for books, tags, price etc). However, the general approaches for finding relevant items for recommendation cannot be directly applied in the case when the context or meta-data information about the item of interest is missing. In this paper we describe an algorithmic approach to handle this problem of missing context for items. In the proposed approach we have extended the context of user's interest and developed an unsupervised algorithm to find the items of interest for the user. Finally the items are ranked based on their relevance to the user's interest. We study this problem in the domain of dataset recommendation where the meta-data information about the datasets is missing due to lack of coherent and complete repository for the research datasets. We evaluate the performance of the proposed framework with real world dataset consisting of 20 user queries. We find that the proposed framework can recommend datasets for user queries with a recall of 90% in the top-4 recommendations. We also compared the performance of the dataset finding algorithm with the state of art supervised classification approach. We get a significant improvement of 36% using the proposed algorithm.

Original languageEnglish
Title of host publicationProceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
Pages321-328
Number of pages8
Volume1
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event2013 12th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013 - Atlanta, GA
Duration: 17 Nov 201320 Nov 2013

Other

Other2013 12th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
CityAtlanta, GA
Period17/11/1320/11/13

Fingerprint

Metadata
Recommender systems
Search engines

Keywords

  • Interest-item matching
  • Recommender systems
  • Search engines
  • Web knowledge

ASJC Scopus subject areas

  • Artificial Intelligence

Cite this

Singhal, A., Kasturi, R., Sivakumar, V., & Srivastava, J. (2013). Leveraging Web intelligence for finding interesting research datasets. In Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013 (Vol. 1, pp. 321-328). [6690032] https://doi.org/10.1109/WI-IAT.2013.46

Leveraging Web intelligence for finding interesting research datasets. / Singhal, Ayush; Kasturi, Ravindra; Sivakumar, Vidyashankar; Srivastava, Jaideep.

Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013. Vol. 1 2013. p. 321-328 6690032.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Singhal, A, Kasturi, R, Sivakumar, V & Srivastava, J 2013, Leveraging Web intelligence for finding interesting research datasets. in Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013. vol. 1, 6690032, pp. 321-328, 2013 12th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013, Atlanta, GA, 17/11/13. https://doi.org/10.1109/WI-IAT.2013.46
Singhal A, Kasturi R, Sivakumar V, Srivastava J. Leveraging Web intelligence for finding interesting research datasets. In Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013. Vol. 1. 2013. p. 321-328. 6690032 https://doi.org/10.1109/WI-IAT.2013.46
Singhal, Ayush ; Kasturi, Ravindra ; Sivakumar, Vidyashankar ; Srivastava, Jaideep. / Leveraging Web intelligence for finding interesting research datasets. Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013. Vol. 1 2013. pp. 321-328
@inproceedings{09dd02567a564cb4a250ad00a5c10ff6,
title = "Leveraging Web intelligence for finding interesting research datasets",
abstract = "The problem of user's interest to item matching is at the core of recommendation systems and search engines. This problem is well studied in different contexts such as item, document, music and movie recommendations. For the purpose of recommendation these systems store the context or the meta-data information about the item of interest (e.g. user rating for books, tags, price etc). However, the general approaches for finding relevant items for recommendation cannot be directly applied in the case when the context or meta-data information about the item of interest is missing. In this paper we describe an algorithmic approach to handle this problem of missing context for items. In the proposed approach we have extended the context of user's interest and developed an unsupervised algorithm to find the items of interest for the user. Finally the items are ranked based on their relevance to the user's interest. We study this problem in the domain of dataset recommendation where the meta-data information about the datasets is missing due to lack of coherent and complete repository for the research datasets. We evaluate the performance of the proposed framework with real world dataset consisting of 20 user queries. We find that the proposed framework can recommend datasets for user queries with a recall of 90{\%} in the top-4 recommendations. We also compared the performance of the dataset finding algorithm with the state of art supervised classification approach. We get a significant improvement of 36{\%} using the proposed algorithm.",
keywords = "Interest-item matching, Recommender systems, Search engines, Web knowledge",
author = "Ayush Singhal and Ravindra Kasturi and Vidyashankar Sivakumar and Jaideep Srivastava",
year = "2013",
doi = "10.1109/WI-IAT.2013.46",
language = "English",
isbn = "9781479929023",
volume = "1",
pages = "321--328",
booktitle = "Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013",

}

TY - GEN

T1 - Leveraging Web intelligence for finding interesting research datasets

AU - Singhal, Ayush

AU - Kasturi, Ravindra

AU - Sivakumar, Vidyashankar

AU - Srivastava, Jaideep

PY - 2013

Y1 - 2013

N2 - The problem of user's interest to item matching is at the core of recommendation systems and search engines. This problem is well studied in different contexts such as item, document, music and movie recommendations. For the purpose of recommendation these systems store the context or the meta-data information about the item of interest (e.g. user rating for books, tags, price etc). However, the general approaches for finding relevant items for recommendation cannot be directly applied in the case when the context or meta-data information about the item of interest is missing. In this paper we describe an algorithmic approach to handle this problem of missing context for items. In the proposed approach we have extended the context of user's interest and developed an unsupervised algorithm to find the items of interest for the user. Finally the items are ranked based on their relevance to the user's interest. We study this problem in the domain of dataset recommendation where the meta-data information about the datasets is missing due to lack of coherent and complete repository for the research datasets. We evaluate the performance of the proposed framework with real world dataset consisting of 20 user queries. We find that the proposed framework can recommend datasets for user queries with a recall of 90% in the top-4 recommendations. We also compared the performance of the dataset finding algorithm with the state of art supervised classification approach. We get a significant improvement of 36% using the proposed algorithm.

AB - The problem of user's interest to item matching is at the core of recommendation systems and search engines. This problem is well studied in different contexts such as item, document, music and movie recommendations. For the purpose of recommendation these systems store the context or the meta-data information about the item of interest (e.g. user rating for books, tags, price etc). However, the general approaches for finding relevant items for recommendation cannot be directly applied in the case when the context or meta-data information about the item of interest is missing. In this paper we describe an algorithmic approach to handle this problem of missing context for items. In the proposed approach we have extended the context of user's interest and developed an unsupervised algorithm to find the items of interest for the user. Finally the items are ranked based on their relevance to the user's interest. We study this problem in the domain of dataset recommendation where the meta-data information about the datasets is missing due to lack of coherent and complete repository for the research datasets. We evaluate the performance of the proposed framework with real world dataset consisting of 20 user queries. We find that the proposed framework can recommend datasets for user queries with a recall of 90% in the top-4 recommendations. We also compared the performance of the dataset finding algorithm with the state of art supervised classification approach. We get a significant improvement of 36% using the proposed algorithm.

KW - Interest-item matching

KW - Recommender systems

KW - Search engines

KW - Web knowledge

UR - http://www.scopus.com/inward/record.url?scp=84893248944&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893248944&partnerID=8YFLogxK

U2 - 10.1109/WI-IAT.2013.46

DO - 10.1109/WI-IAT.2013.46

M3 - Conference contribution

SN - 9781479929023

VL - 1

SP - 321

EP - 328

BT - Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013

ER -