How to define searching sessions on web search engines

Bernard Jansen, Amanda Spink, Vinish Kathuria

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Citations (Scopus)

Abstract

In this research, we investigate three techniques for defining user sessions on Web search engines. We analyze 2,465,145 interactions from 534,507 Web searchers. We compare three methods for defining sessions using: 1) Internet Protocol address and cookie; 2) Internet Protocol address, cookie, and a temporal limit on intra-session interactions; and 3) Internet Protocol address, cookie, and query reformulation patterns. Research results shows that defining sessions by query reformulation provides the best measure of session identification, with a nearly 95% accuracy. This method also results in an 82% increase in the number of sessions compared to Internet Protocol address and cookie alone. Regardless of the method, mean session length was fewer than three queries and the mean session duration was less than 30 minutes. Implications are that unique sessions may be a better indicator than the common industry metric of unique visitors for measuring search traffic. Results of this research may lead to tools to better support Web searching.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages92-109
Number of pages18
Volume4811 LNAI
Publication statusPublished - 2007
Externally publishedYes
Event8th International Workshop on Knowledge Discovery on the Web, WebKDD 2006 - Philadelphia, PA
Duration: 20 Aug 200620 Aug 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4811 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other8th International Workshop on Knowledge Discovery on the Web, WebKDD 2006
CityPhiladelphia, PA
Period20/8/0620/8/06

Fingerprint

Search Engine
Internet protocols
Web Search
Search engines
Internet
Research
Query
Reformulation
Industry
Interaction
Traffic
Metric

Keywords

  • Markov states
  • Query reformulation
  • Web queries
  • Web sessions

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

Jansen, B., Spink, A., & Kathuria, V. (2007). How to define searching sessions on web search engines. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4811 LNAI, pp. 92-109). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4811 LNAI).

How to define searching sessions on web search engines. / Jansen, Bernard; Spink, Amanda; Kathuria, Vinish.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4811 LNAI 2007. p. 92-109 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4811 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Jansen, B, Spink, A & Kathuria, V 2007, How to define searching sessions on web search engines. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 4811 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4811 LNAI, pp. 92-109, 8th International Workshop on Knowledge Discovery on the Web, WebKDD 2006, Philadelphia, PA, 20/8/06.
Jansen B, Spink A, Kathuria V. How to define searching sessions on web search engines. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4811 LNAI. 2007. p. 92-109. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Jansen, Bernard ; Spink, Amanda ; Kathuria, Vinish. / How to define searching sessions on web search engines. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4811 LNAI 2007. pp. 92-109 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{0be545b9469c4579ad580bae78b16292,
title = "How to define searching sessions on web search engines",
abstract = "In this research, we investigate three techniques for defining user sessions on Web search engines. We analyze 2,465,145 interactions from 534,507 Web searchers. We compare three methods for defining sessions using: 1) Internet Protocol address and cookie; 2) Internet Protocol address, cookie, and a temporal limit on intra-session interactions; and 3) Internet Protocol address, cookie, and query reformulation patterns. Research results shows that defining sessions by query reformulation provides the best measure of session identification, with a nearly 95{\%} accuracy. This method also results in an 82{\%} increase in the number of sessions compared to Internet Protocol address and cookie alone. Regardless of the method, mean session length was fewer than three queries and the mean session duration was less than 30 minutes. Implications are that unique sessions may be a better indicator than the common industry metric of unique visitors for measuring search traffic. Results of this research may lead to tools to better support Web searching.",
keywords = "Markov states, Query reformulation, Web queries, Web sessions",
author = "Bernard Jansen and Amanda Spink and Vinish Kathuria",
year = "2007",
language = "English",
isbn = "354077484X",
volume = "4811 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "92--109",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - How to define searching sessions on web search engines

AU - Jansen, Bernard

AU - Spink, Amanda

AU - Kathuria, Vinish

PY - 2007

Y1 - 2007

N2 - In this research, we investigate three techniques for defining user sessions on Web search engines. We analyze 2,465,145 interactions from 534,507 Web searchers. We compare three methods for defining sessions using: 1) Internet Protocol address and cookie; 2) Internet Protocol address, cookie, and a temporal limit on intra-session interactions; and 3) Internet Protocol address, cookie, and query reformulation patterns. Research results shows that defining sessions by query reformulation provides the best measure of session identification, with a nearly 95% accuracy. This method also results in an 82% increase in the number of sessions compared to Internet Protocol address and cookie alone. Regardless of the method, mean session length was fewer than three queries and the mean session duration was less than 30 minutes. Implications are that unique sessions may be a better indicator than the common industry metric of unique visitors for measuring search traffic. Results of this research may lead to tools to better support Web searching.

AB - In this research, we investigate three techniques for defining user sessions on Web search engines. We analyze 2,465,145 interactions from 534,507 Web searchers. We compare three methods for defining sessions using: 1) Internet Protocol address and cookie; 2) Internet Protocol address, cookie, and a temporal limit on intra-session interactions; and 3) Internet Protocol address, cookie, and query reformulation patterns. Research results shows that defining sessions by query reformulation provides the best measure of session identification, with a nearly 95% accuracy. This method also results in an 82% increase in the number of sessions compared to Internet Protocol address and cookie alone. Regardless of the method, mean session length was fewer than three queries and the mean session duration was less than 30 minutes. Implications are that unique sessions may be a better indicator than the common industry metric of unique visitors for measuring search traffic. Results of this research may lead to tools to better support Web searching.

KW - Markov states

KW - Query reformulation

KW - Web queries

KW - Web sessions

UR - http://www.scopus.com/inward/record.url?scp=38549108348&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38549108348&partnerID=8YFLogxK

M3 - Conference contribution

SN - 354077484X

SN - 9783540774846

VL - 4811 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 92

EP - 109

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -