Grouping Web page references into transactions for mining World Wide Web browsing patterns

R. Cooley, B. Mobasher, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

63 Citations (Scopus)

Abstract

Web-based organizations often generate and collect large volumes of data in their daily operations. Analyzing such data involves the discovery of meaningful relationships from a large collection of primarily unstructured data, often stored in Web server access logs. While traditional domains for data mining, such as point of sale databases, have naturally defined transactions, there is no convenient method of clustering web references into transactions. This paper identifies a model of user browsing behavior that separates web page references into those made for navigation purposes and those for information content purposes. A transaction identification method based on the browsing model is defined and successfully tested against other methods, such as the maximal forward reference algorithm proposed in [1]. Transactions identified by the proposed methods are used to discover association rules from real world data using the WEBMINER system.

Original languageEnglish
Title of host publicationProceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX
Editors Anon
Place of PublicationPiscataway, NJ, United States
PublisherIEEE
Pages2-9
Number of pages8
Publication statusPublished - 1997
Externally publishedYes
EventProceedings of the 1997 IEEE Knowledge & Data Engineering Exchange Workshop, KDEX - Newport Beach, CA, USA
Duration: 4 Nov 19974 Nov 1997

Other

OtherProceedings of the 1997 IEEE Knowledge & Data Engineering Exchange Workshop, KDEX
CityNewport Beach, CA, USA
Period4/11/974/11/97

Fingerprint

World Wide Web
Websites
Association rules
Data mining
Identification (control systems)
Sales
Navigation
Servers

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Cooley, R., Mobasher, B., & Srivastava, J. (1997). Grouping Web page references into transactions for mining World Wide Web browsing patterns. In Anon (Ed.), Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX (pp. 2-9). Piscataway, NJ, United States: IEEE.

Grouping Web page references into transactions for mining World Wide Web browsing patterns. / Cooley, R.; Mobasher, B.; Srivastava, Jaideep.

Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX. ed. / Anon. Piscataway, NJ, United States : IEEE, 1997. p. 2-9.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cooley, R, Mobasher, B & Srivastava, J 1997, Grouping Web page references into transactions for mining World Wide Web browsing patterns. in Anon (ed.), Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX. IEEE, Piscataway, NJ, United States, pp. 2-9, Proceedings of the 1997 IEEE Knowledge & Data Engineering Exchange Workshop, KDEX, Newport Beach, CA, USA, 4/11/97.
Cooley R, Mobasher B, Srivastava J. Grouping Web page references into transactions for mining World Wide Web browsing patterns. In Anon, editor, Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX. Piscataway, NJ, United States: IEEE. 1997. p. 2-9
Cooley, R. ; Mobasher, B. ; Srivastava, Jaideep. / Grouping Web page references into transactions for mining World Wide Web browsing patterns. Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX. editor / Anon. Piscataway, NJ, United States : IEEE, 1997. pp. 2-9
@inproceedings{e954b1acedbf4644bd1c5a8e5740c1a7,
title = "Grouping Web page references into transactions for mining World Wide Web browsing patterns",
abstract = "Web-based organizations often generate and collect large volumes of data in their daily operations. Analyzing such data involves the discovery of meaningful relationships from a large collection of primarily unstructured data, often stored in Web server access logs. While traditional domains for data mining, such as point of sale databases, have naturally defined transactions, there is no convenient method of clustering web references into transactions. This paper identifies a model of user browsing behavior that separates web page references into those made for navigation purposes and those for information content purposes. A transaction identification method based on the browsing model is defined and successfully tested against other methods, such as the maximal forward reference algorithm proposed in [1]. Transactions identified by the proposed methods are used to discover association rules from real world data using the WEBMINER system.",
author = "R. Cooley and B. Mobasher and Jaideep Srivastava",
year = "1997",
language = "English",
pages = "2--9",
editor = "Anon",
booktitle = "Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX",
publisher = "IEEE",

}

TY - GEN

T1 - Grouping Web page references into transactions for mining World Wide Web browsing patterns

AU - Cooley, R.

AU - Mobasher, B.

AU - Srivastava, Jaideep

PY - 1997

Y1 - 1997

N2 - Web-based organizations often generate and collect large volumes of data in their daily operations. Analyzing such data involves the discovery of meaningful relationships from a large collection of primarily unstructured data, often stored in Web server access logs. While traditional domains for data mining, such as point of sale databases, have naturally defined transactions, there is no convenient method of clustering web references into transactions. This paper identifies a model of user browsing behavior that separates web page references into those made for navigation purposes and those for information content purposes. A transaction identification method based on the browsing model is defined and successfully tested against other methods, such as the maximal forward reference algorithm proposed in [1]. Transactions identified by the proposed methods are used to discover association rules from real world data using the WEBMINER system.

AB - Web-based organizations often generate and collect large volumes of data in their daily operations. Analyzing such data involves the discovery of meaningful relationships from a large collection of primarily unstructured data, often stored in Web server access logs. While traditional domains for data mining, such as point of sale databases, have naturally defined transactions, there is no convenient method of clustering web references into transactions. This paper identifies a model of user browsing behavior that separates web page references into those made for navigation purposes and those for information content purposes. A transaction identification method based on the browsing model is defined and successfully tested against other methods, such as the maximal forward reference algorithm proposed in [1]. Transactions identified by the proposed methods are used to discover association rules from real world data using the WEBMINER system.

UR - http://www.scopus.com/inward/record.url?scp=0031360049&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031360049&partnerID=8YFLogxK

M3 - Conference contribution

SP - 2

EP - 9

BT - Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX

A2 - Anon, null

PB - IEEE

CY - Piscataway, NJ, United States

ER -