LOGML: Log markup language for web usage mining

John R. Punin, Mukkai S. Krishnamoorthy, Mohammed J. Zaki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

18 Citations (Scopus)

Abstract

Web Usage Mining refers to the discovery of interesting information from user navigational behavior as stored in web access logs. While extracting simple information from web logs is easy, mining complex structural information is very challenging. Data cleaning and preparation constitute a very significant effort before mining can even be applied. We propose two new XML applications, XGMML and LOGML to help us in this task. XGMML is a graph description language and LOGML is a web-log report description language. We generate a web graph in XGMML format for a web site using the web robot of the WWWPal system. We generate web-log reports in LOGML format for a web site from web log files and the web graph. We further illustrate the usefulness of LOGML in web usage mining; we show the simplicity with which mining algorithms (for extracting increasingly complex frequent patterns) can be specified and implemented efficiently using LOGML.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages88-112
Number of pages25
Volume2356
ISBN (Print)3540439692, 9783540439691
Publication statusPublished - 2002
Externally publishedYes
Event3rd International Workshop on MiningWeb Log Data, WEBKDD, 2001 - San Francisco, United States
Duration: 26 Aug 200126 Aug 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2356
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other3rd International Workshop on MiningWeb Log Data, WEBKDD, 2001
CountryUnited States
CitySan Francisco
Period26/8/0126/8/01

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Punin, J. R., Krishnamoorthy, M. S., & Zaki, M. J. (2002). LOGML: Log markup language for web usage mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2356, pp. 88-112). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2356). Springer Verlag.