Cleaning your wrong google scholar entries

Shuang Hao, Yi Xu, Nan Tang, Guoliang Li, Jianhua Feng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Entity categorization-the process of grouping entities into categories for some specific purpose-is an important problem with a great many applications, such as Google Scholar and Amazon products. Unfortunately, many real-world categories contain mis-categorized entities, such as publications in one's Google Scholar page that are published by the others. We have proposed a general framework for a new research problem-discovering mis-categorized entities. In this demonstration, we have developed a Google Chrome extension, namely GSCleaner, as one important application of our studied problem. The attendees will have the opportunity to experience the following features: (1) mis-categorized entity discovery-The attendee can check mis-categorized entities on anyone's Google Scholar page; and (2) Cleaning onsite-Any attendee can login and clean his Google Scholar page using GSCleaner.We describe our novel rule-based framework to discover mis-categorized entities. We also propose effective optimization techniques to apply the rules. Some empirical results show the effectiveness of GSCleaner on discovering mis-categorized entities.

Original languageEnglish
Title of host publicationProceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1597-1600
Number of pages4
ISBN (Electronic)9781538655207
DOIs
Publication statusPublished - 24 Oct 2018
Event34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, France
Duration: 16 Apr 201819 Apr 2018

Other

Other34th IEEE International Conference on Data Engineering, ICDE 2018
CountryFrance
CityParis
Period16/4/1819/4/18

    Fingerprint

Keywords

  • Google Scholar cleaner
  • Mis categorized entity
  • Rule based framework
  • Signature

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management
  • Hardware and Architecture

Cite this

Hao, S., Xu, Y., Tang, N., Li, G., & Feng, J. (2018). Cleaning your wrong google scholar entries. In Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 (pp. 1597-1600). [8509406] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDE.2018.00185