LOGML: Log markup language for web usage mining

John R. Punin, Mukkai S. Krishnamoorthy, Mohammed J. Zaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

18 Citations (Scopus)

Abstract

Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages88-112
Number of pages25
Volume2356
ISBN (Print)3540439692, 9783540439691
Publication statusPublished - 2002
Externally publishedYes
Event3rd International Workshop on MiningWeb Log Data, WEBKDD, 2001 - San Francisco, United States
Duration: 26 Aug 200126 Aug 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2356
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other3rd International Workshop on MiningWeb Log Data, WEBKDD, 2001
CountryUnited States
CitySan Francisco
Period26/8/0126/8/01

Fingerprint

Web Usage Mining
Markup languages
World Wide Web
Websites
Web Graph
Mining
Software agents
XML
Cleaning
Frequent Pattern
Language
User Behavior
Simplicity
Preparation
Robot

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Punin, J. R., Krishnamoorthy, M. S., & Zaki, M. J. (2002). LOGML: Log markup language for web usage mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2356, pp. 88-112). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2356). Springer Verlag.

LOGML : Log markup language for web usage mining. / Punin, John R.; Krishnamoorthy, Mukkai S.; Zaki, Mohammed J.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2356 Springer Verlag, 2002. p. 88-112 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2356).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Punin, JR, Krishnamoorthy, MS & Zaki, MJ 2002, LOGML: Log markup language for web usage mining. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 2356, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2356, Springer Verlag, pp. 88-112, 3rd International Workshop on MiningWeb Log Data, WEBKDD, 2001, San Francisco, United States, 26/8/01.
Punin JR, Krishnamoorthy MS, Zaki MJ. LOGML: Log markup language for web usage mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2356. Springer Verlag. 2002. p. 88-112. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Punin, John R. ; Krishnamoorthy, Mukkai S. ; Zaki, Mohammed J. / LOGML : Log markup language for web usage mining. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 2356 Springer Verlag, 2002. pp. 88-112 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{483230af8d564594a8e9bb2d3b4d5333,
title = "LOGML: Log markup language for web usage mining",
abstract = "Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.",
author = "Punin, {John R.} and Krishnamoorthy, {Mukkai S.} and Zaki, {Mohammed J.}",
year = "2002",
language = "English",
isbn = "3540439692",
volume = "2356",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "88--112",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - LOGML

T2 - Log markup language for web usage mining

AU - Punin, John R.

AU - Krishnamoorthy, Mukkai S.

AU - Zaki, Mohammed J.

PY - 2002

Y1 - 2002

N2 - Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.

AB - Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.

UR - http://www.scopus.com/inward/record.url?scp=84943161269&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84943161269&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84943161269

SN - 3540439692

SN - 9783540439691

VL - 2356

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 88

EP - 112

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -