Determining the informational, navigational, and transactional intent of Web queries

Bernard Jansen, Danielle L. Booth, Amanda Spink

Research output: Contribution to journalArticle

264 Citations (Scopus)

Abstract

In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.

Original languageEnglish
Pages (from-to)1251-1266
Number of pages16
JournalInformation Processing and Management
Volume44
Issue number3
DOIs
Publication statusPublished - May 2008
Externally publishedYes

Fingerprint

Search engines
search engine
Application programs
World Wide Web
Query
Search engine
Software
Web search
Knowledge use
software

Keywords

  • Search engines
  • User intent
  • Web queries
  • Web searching

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Library and Information Sciences

Cite this

Determining the informational, navigational, and transactional intent of Web queries. / Jansen, Bernard; Booth, Danielle L.; Spink, Amanda.

In: Information Processing and Management, Vol. 44, No. 3, 05.2008, p. 1251-1266.

Research output: Contribution to journalArticle

@article{a43fd0a07e7e4d6c9df033355f7ff25d,
title = "Determining the informational, navigational, and transactional intent of Web queries",
abstract = "In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80{\%} of Web queries are informational in nature, with about 10{\%} each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74{\%}. Of the remaining 25{\%} of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.",
keywords = "Search engines, User intent, Web queries, Web searching",
author = "Bernard Jansen and Booth, {Danielle L.} and Amanda Spink",
year = "2008",
month = "5",
doi = "10.1016/j.ipm.2007.07.015",
language = "English",
volume = "44",
pages = "1251--1266",
journal = "Information Processing and Management",
issn = "0306-4573",
publisher = "Elsevier Limited",
number = "3",

}

TY - JOUR

T1 - Determining the informational, navigational, and transactional intent of Web queries

AU - Jansen, Bernard

AU - Booth, Danielle L.

AU - Spink, Amanda

PY - 2008/5

Y1 - 2008/5

N2 - In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.

AB - In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.

KW - Search engines

KW - User intent

KW - Web queries

KW - Web searching

UR - http://www.scopus.com/inward/record.url?scp=40649101172&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=40649101172&partnerID=8YFLogxK

U2 - 10.1016/j.ipm.2007.07.015

DO - 10.1016/j.ipm.2007.07.015

M3 - Article

VL - 44

SP - 1251

EP - 1266

JO - Information Processing and Management

JF - Information Processing and Management

SN - 0306-4573

IS - 3

ER -