Determining the user intent of web search engine queries

Bernard J. Jansen, Danielle L. Booth, Amanda Spink

Research output: Chapter in Book/Report/Conference proceedingConference contribution

128 Citations (Scopus)

Abstract

Determining the user intent of Web searches is a difficult problem due to the sparse data available concerning the searcher. In this paper, we examine a method to determine the user intent underlying Web search engine queries. We qualitatively analyze samples of queries from seven transaction logs from three different Web search engines containing more than five million queries. From this analysis, we identified characteristics of user queries based on three broad classifications of user intent. The classifications of informational, navigational, and transactional represent the type of content destination the searcher desired as expressed by their query. We implemented our classification algorithm and automatically classified a separate Web search engine transaction log of over a million queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the classification to the results from our algorithm. This comparison showed that our automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is generally vague or multi-faceted, pointing to the need to for probabilistic classification. We illustrate how knowledge of searcher intent might be used to enhance future Web search engines.

Original languageEnglish
Title of host publication16th International World Wide Web Conference, WWW2007
Pages1149-1150
Number of pages2
DOIs
Publication statusPublished - 22 Oct 2007
Event16th International World Wide Web Conference, WWW2007 - Banff, AB, Canada
Duration: 8 May 200712 May 2007

Publication series

Name16th International World Wide Web Conference, WWW2007

Other

Other16th International World Wide Web Conference, WWW2007
CountryCanada
CityBanff, AB
Period8/5/0712/5/07

    Fingerprint

Keywords

  • Search engines
  • User intent
  • Web queries
  • Web searching

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Cite this

Jansen, B. J., Booth, D. L., & Spink, A. (2007). Determining the user intent of web search engine queries. In 16th International World Wide Web Conference, WWW2007 (pp. 1149-1150). (16th International World Wide Web Conference, WWW2007). https://doi.org/10.1145/1242572.1242739