Real time search user behavior

Bernard Jansen, Gerry Campbell, Matthew Gregg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Citations (Scopus)

Abstract

Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

Original languageEnglish
Title of host publicationConference on Human Factors in Computing Systems - Proceedings
Pages3961-3966
Number of pages6
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010 - Atlanta, GA
Duration: 10 Apr 201015 Apr 2010

Other

Other28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010
CityAtlanta, GA
Period10/4/1015/4/10

Fingerprint

Search engines
Application programs
World Wide Web
Interfaces (computer)
Engines

Keywords

  • Collecta
  • Real time content
  • Real time search
  • Twitter

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Jansen, B., Campbell, G., & Gregg, M. (2010). Real time search user behavior. In Conference on Human Factors in Computing Systems - Proceedings (pp. 3961-3966) https://doi.org/10.1145/1753846.1754086

Real time search user behavior. / Jansen, Bernard; Campbell, Gerry; Gregg, Matthew.

Conference on Human Factors in Computing Systems - Proceedings. 2010. p. 3961-3966.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Jansen, B, Campbell, G & Gregg, M 2010, Real time search user behavior. in Conference on Human Factors in Computing Systems - Proceedings. pp. 3961-3966, 28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010, Atlanta, GA, 10/4/10. https://doi.org/10.1145/1753846.1754086
Jansen B, Campbell G, Gregg M. Real time search user behavior. In Conference on Human Factors in Computing Systems - Proceedings. 2010. p. 3961-3966 https://doi.org/10.1145/1753846.1754086
Jansen, Bernard ; Campbell, Gerry ; Gregg, Matthew. / Real time search user behavior. Conference on Human Factors in Computing Systems - Proceedings. 2010. pp. 3961-3966
@inproceedings{ef700779eb524bc1af9da0f397ac89d7,
title = "Real time search user behavior",
abstract = "Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60{\%} of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30{\%} were unique (used only once in the entire dataset). The most frequent query accounted for 0.003{\%} of the query set. Less than 8{\%} of the terms were unique. The most frequently used terms accounted for only 0.03{\%} of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.",
keywords = "Collecta, Real time content, Real time search, Twitter",
author = "Bernard Jansen and Gerry Campbell and Matthew Gregg",
year = "2010",
doi = "10.1145/1753846.1754086",
language = "English",
isbn = "9781605589312",
pages = "3961--3966",
booktitle = "Conference on Human Factors in Computing Systems - Proceedings",

}

TY - GEN

T1 - Real time search user behavior

AU - Jansen, Bernard

AU - Campbell, Gerry

AU - Gregg, Matthew

PY - 2010

Y1 - 2010

N2 - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

AB - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

KW - Collecta

KW - Real time content

KW - Real time search

KW - Twitter

UR - http://www.scopus.com/inward/record.url?scp=77953100508&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953100508&partnerID=8YFLogxK

U2 - 10.1145/1753846.1754086

DO - 10.1145/1753846.1754086

M3 - Conference contribution

SN - 9781605589312

SP - 3961

EP - 3966

BT - Conference on Human Factors in Computing Systems - Proceedings

ER -