Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/17555

TítuloClickstream data warehousing for web crawlers profiling
Autor(es)Lourenço, Anália
Belo, Orlando
Palavras-chaveData warehousing
Clickstream data
Web housing
Web usage mining
Web crawler profiling
Data2011
EditoraIAENG
RevistaLecture Notes in Engineering and Computer Science
Resumo(s)Web sites routinely monitor visitor traffic as a useful measure of their overall success. However, simple summaries such as the total number of visits per month provide little insight about individual site patterns, especially in a changing environment like the Web. In this paper it is described an approach to usage profiling based on clickstream data collected on several Web servers' sites and stored in a specialized clickstream data warehousing. We aim at providing valuable insights about common users, but also preventing unauthorised access to contents and any form of overload that might deteriorate site performance. Common crawler detection heuristics help to classify sessions, enabling the construction of site-specific profile training sets. Then, classification algorithms are used for building predictive models that can evaluate unseen sessions, namely their nature and potential site hazard, when they are still ongoing.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/17555
ISBN9789881821065
ISSN2078-0958
Arbitragem científicayes
AcessoAcesso restrito UMinho
Aparece nas coleções:CEB - Artigos em Livros de Atas / Papers in Proceedings
CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
2011-CI-WCE-Lourenço&Belo-CRP-1p.pdf
Acesso restrito!
1ª Página do Artigo87,73 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID