Cleaning your wrong google scholar entries

Shuang Hao, Yi Xu, Nan Tang, Guoliang Li, Jianhua Feng

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Entity categorization-the process of grouping entities into categories for some specific purpose-is an important problem with a great many applications, such as Google Scholar and Amazon products. Unfortunately, many real-world categories contain mis-categorized entities, such as publications in one's Google Scholar page that are published by the others. We have proposed a general framework for a new research problem-discovering mis-categorized entities. In this demonstration, we have developed a Google Chrome extension, namely GSCleaner, as one important application of our studied problem. The attendees will have the opportunity to experience the following features: (1) mis-categorized entity discovery-The attendee can check mis-categorized entities on anyone's Google Scholar page; and (2) Cleaning onsite-Any attendee can login and clean his Google Scholar page using GSCleaner.We describe our novel rule-based framework to discover mis-categorized entities. We also propose effective optimization techniques to apply the rules. Some empirical results show the effectiveness of GSCleaner on discovering mis-categorized entities.

Original languageEnglish
Title of host publicationProceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1597-1600
Number of pages4
ISBN (Electronic)9781538655207
DOIs
Publication statusPublished - 24 Oct 2018
Event34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, France
Duration: 16 Apr 201819 Apr 2018

Other

Other34th IEEE International Conference on Data Engineering, ICDE 2018
CountryFrance
CityParis
Period16/4/1819/4/18

Fingerprint

Cleaning
Demonstrations
Google Scholar

Keywords

  • Google Scholar cleaner
  • Mis categorized entity
  • Rule based framework
  • Signature

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management
  • Hardware and Architecture

Cite this

Hao, S., Xu, Y., Tang, N., Li, G., & Feng, J. (2018). Cleaning your wrong google scholar entries. In Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 (pp. 1597-1600). [8509406] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDE.2018.00185

Cleaning your wrong google scholar entries. / Hao, Shuang; Xu, Yi; Tang, Nan; Li, Guoliang; Feng, Jianhua.

Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 1597-1600 8509406.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hao, S, Xu, Y, Tang, N, Li, G & Feng, J 2018, Cleaning your wrong google scholar entries. in Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018., 8509406, Institute of Electrical and Electronics Engineers Inc., pp. 1597-1600, 34th IEEE International Conference on Data Engineering, ICDE 2018, Paris, France, 16/4/18. https://doi.org/10.1109/ICDE.2018.00185
Hao S, Xu Y, Tang N, Li G, Feng J. Cleaning your wrong google scholar entries. In Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1597-1600. 8509406 https://doi.org/10.1109/ICDE.2018.00185
Hao, Shuang ; Xu, Yi ; Tang, Nan ; Li, Guoliang ; Feng, Jianhua. / Cleaning your wrong google scholar entries. Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 1597-1600
@inproceedings{b08cb00ac19f4f7ba786059ba10178d0,
title = "Cleaning your wrong google scholar entries",
abstract = "Entity categorization-the process of grouping entities into categories for some specific purpose-is an important problem with a great many applications, such as Google Scholar and Amazon products. Unfortunately, many real-world categories contain mis-categorized entities, such as publications in one's Google Scholar page that are published by the others. We have proposed a general framework for a new research problem-discovering mis-categorized entities. In this demonstration, we have developed a Google Chrome extension, namely GSCleaner, as one important application of our studied problem. The attendees will have the opportunity to experience the following features: (1) mis-categorized entity discovery-The attendee can check mis-categorized entities on anyone's Google Scholar page; and (2) Cleaning onsite-Any attendee can login and clean his Google Scholar page using GSCleaner.We describe our novel rule-based framework to discover mis-categorized entities. We also propose effective optimization techniques to apply the rules. Some empirical results show the effectiveness of GSCleaner on discovering mis-categorized entities.",
keywords = "Google Scholar cleaner, Mis categorized entity, Rule based framework, Signature",
author = "Shuang Hao and Yi Xu and Nan Tang and Guoliang Li and Jianhua Feng",
year = "2018",
month = "10",
day = "24",
doi = "10.1109/ICDE.2018.00185",
language = "English",
pages = "1597--1600",
booktitle = "Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Cleaning your wrong google scholar entries

AU - Hao, Shuang

AU - Xu, Yi

AU - Tang, Nan

AU - Li, Guoliang

AU - Feng, Jianhua

PY - 2018/10/24

Y1 - 2018/10/24

N2 - Entity categorization-the process of grouping entities into categories for some specific purpose-is an important problem with a great many applications, such as Google Scholar and Amazon products. Unfortunately, many real-world categories contain mis-categorized entities, such as publications in one's Google Scholar page that are published by the others. We have proposed a general framework for a new research problem-discovering mis-categorized entities. In this demonstration, we have developed a Google Chrome extension, namely GSCleaner, as one important application of our studied problem. The attendees will have the opportunity to experience the following features: (1) mis-categorized entity discovery-The attendee can check mis-categorized entities on anyone's Google Scholar page; and (2) Cleaning onsite-Any attendee can login and clean his Google Scholar page using GSCleaner.We describe our novel rule-based framework to discover mis-categorized entities. We also propose effective optimization techniques to apply the rules. Some empirical results show the effectiveness of GSCleaner on discovering mis-categorized entities.

AB - Entity categorization-the process of grouping entities into categories for some specific purpose-is an important problem with a great many applications, such as Google Scholar and Amazon products. Unfortunately, many real-world categories contain mis-categorized entities, such as publications in one's Google Scholar page that are published by the others. We have proposed a general framework for a new research problem-discovering mis-categorized entities. In this demonstration, we have developed a Google Chrome extension, namely GSCleaner, as one important application of our studied problem. The attendees will have the opportunity to experience the following features: (1) mis-categorized entity discovery-The attendee can check mis-categorized entities on anyone's Google Scholar page; and (2) Cleaning onsite-Any attendee can login and clean his Google Scholar page using GSCleaner.We describe our novel rule-based framework to discover mis-categorized entities. We also propose effective optimization techniques to apply the rules. Some empirical results show the effectiveness of GSCleaner on discovering mis-categorized entities.

KW - Google Scholar cleaner

KW - Mis categorized entity

KW - Rule based framework

KW - Signature

UR - http://www.scopus.com/inward/record.url?scp=85057130260&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057130260&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2018.00185

DO - 10.1109/ICDE.2018.00185

M3 - Conference contribution

SP - 1597

EP - 1600

BT - Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -