App Miscategorization Detection

A Case Study on Google Play

Didi Surian, Suranga Seneviratne, Aruna Seneviratne, Sanjay Chawla

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

An ongoing challenge in the rapidly evolving app market ecosystem is to maintain the integrity of app categories. At the time of registration, app developers have to select, what they believe, is the most appropriate category for their apps. Besides the inherent ambiguity of selecting the right category, the approach leaves open the possibility of misuse and potential gaming by the registrant. Periodically, the app store will refine the list of categories available and potentially reassign the apps. However, it has been observed that the mismatch between the description of the app and the category it belongs to, continues to persist. Although some common mechanisms (e.g., a complaint-driven or manual checking) exist, they limit the response time to detect miscategorized apps and still open the challenge on categorization. We introduce FRAC+: (FR)amework for (A)pp (C)ategorization. FRAC+ has the following salient features: (i) it is based on a data-driven topic model and automatically suggests the categories appropriate for the app store, and (ii) it can detect miscategorizated apps. Extensive experiments attest to the performance of FRAC+. Experiments on Google Play shows that FRAC+'s topics are more aligned with Google's new categories and 0.35-1.10 percent game apps are detected to be miscategorized.

Original languageEnglish
Article number7885558
Pages (from-to)1591-1604
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume29
Issue number8
DOIs
Publication statusPublished - 1 Aug 2017

Fingerprint

Application programs
Response time (computer systems)
Ecosystems
Experiments

Keywords

  • App categorization
  • app market
  • miscategorization detection
  • mixture model
  • von-mises fisher distribution

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

App Miscategorization Detection : A Case Study on Google Play. / Surian, Didi; Seneviratne, Suranga; Seneviratne, Aruna; Chawla, Sanjay.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 29, No. 8, 7885558, 01.08.2017, p. 1591-1604.

Research output: Contribution to journalArticle

Surian, Didi ; Seneviratne, Suranga ; Seneviratne, Aruna ; Chawla, Sanjay. / App Miscategorization Detection : A Case Study on Google Play. In: IEEE Transactions on Knowledge and Data Engineering. 2017 ; Vol. 29, No. 8. pp. 1591-1604.
@article{b666c1b3fb164b959e223a91ce64c3e1,
title = "App Miscategorization Detection: A Case Study on Google Play",
abstract = "An ongoing challenge in the rapidly evolving app market ecosystem is to maintain the integrity of app categories. At the time of registration, app developers have to select, what they believe, is the most appropriate category for their apps. Besides the inherent ambiguity of selecting the right category, the approach leaves open the possibility of misuse and potential gaming by the registrant. Periodically, the app store will refine the list of categories available and potentially reassign the apps. However, it has been observed that the mismatch between the description of the app and the category it belongs to, continues to persist. Although some common mechanisms (e.g., a complaint-driven or manual checking) exist, they limit the response time to detect miscategorized apps and still open the challenge on categorization. We introduce FRAC+: (FR)amework for (A)pp (C)ategorization. FRAC+ has the following salient features: (i) it is based on a data-driven topic model and automatically suggests the categories appropriate for the app store, and (ii) it can detect miscategorizated apps. Extensive experiments attest to the performance of FRAC+. Experiments on Google Play shows that FRAC+'s topics are more aligned with Google's new categories and 0.35-1.10 percent game apps are detected to be miscategorized.",
keywords = "App categorization, app market, miscategorization detection, mixture model, von-mises fisher distribution",
author = "Didi Surian and Suranga Seneviratne and Aruna Seneviratne and Sanjay Chawla",
year = "2017",
month = "8",
day = "1",
doi = "10.1109/TKDE.2017.2686851",
language = "English",
volume = "29",
pages = "1591--1604",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE Computer Society",
number = "8",

}

TY - JOUR

T1 - App Miscategorization Detection

T2 - A Case Study on Google Play

AU - Surian, Didi

AU - Seneviratne, Suranga

AU - Seneviratne, Aruna

AU - Chawla, Sanjay

PY - 2017/8/1

Y1 - 2017/8/1

N2 - An ongoing challenge in the rapidly evolving app market ecosystem is to maintain the integrity of app categories. At the time of registration, app developers have to select, what they believe, is the most appropriate category for their apps. Besides the inherent ambiguity of selecting the right category, the approach leaves open the possibility of misuse and potential gaming by the registrant. Periodically, the app store will refine the list of categories available and potentially reassign the apps. However, it has been observed that the mismatch between the description of the app and the category it belongs to, continues to persist. Although some common mechanisms (e.g., a complaint-driven or manual checking) exist, they limit the response time to detect miscategorized apps and still open the challenge on categorization. We introduce FRAC+: (FR)amework for (A)pp (C)ategorization. FRAC+ has the following salient features: (i) it is based on a data-driven topic model and automatically suggests the categories appropriate for the app store, and (ii) it can detect miscategorizated apps. Extensive experiments attest to the performance of FRAC+. Experiments on Google Play shows that FRAC+'s topics are more aligned with Google's new categories and 0.35-1.10 percent game apps are detected to be miscategorized.

AB - An ongoing challenge in the rapidly evolving app market ecosystem is to maintain the integrity of app categories. At the time of registration, app developers have to select, what they believe, is the most appropriate category for their apps. Besides the inherent ambiguity of selecting the right category, the approach leaves open the possibility of misuse and potential gaming by the registrant. Periodically, the app store will refine the list of categories available and potentially reassign the apps. However, it has been observed that the mismatch between the description of the app and the category it belongs to, continues to persist. Although some common mechanisms (e.g., a complaint-driven or manual checking) exist, they limit the response time to detect miscategorized apps and still open the challenge on categorization. We introduce FRAC+: (FR)amework for (A)pp (C)ategorization. FRAC+ has the following salient features: (i) it is based on a data-driven topic model and automatically suggests the categories appropriate for the app store, and (ii) it can detect miscategorizated apps. Extensive experiments attest to the performance of FRAC+. Experiments on Google Play shows that FRAC+'s topics are more aligned with Google's new categories and 0.35-1.10 percent game apps are detected to be miscategorized.

KW - App categorization

KW - app market

KW - miscategorization detection

KW - mixture model

KW - von-mises fisher distribution

UR - http://www.scopus.com/inward/record.url?scp=85029113458&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85029113458&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2017.2686851

DO - 10.1109/TKDE.2017.2686851

M3 - Article

VL - 29

SP - 1591

EP - 1604

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 8

M1 - 7885558

ER -