Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/17555
Título: | Clickstream data warehousing for web crawlers profiling |
Autor(es): | Lourenço, Anália Belo, Orlando |
Palavras-chave: | Data warehousing Clickstream data Web housing Web usage mining Web crawler profiling |
Data: | 2011 |
Editora: | IAENG |
Revista: | Lecture Notes in Engineering and Computer Science |
Resumo(s): | Web sites routinely monitor visitor traffic as a useful measure of their overall success. However, simple summaries such as the total number of visits per month provide little insight about individual site patterns, especially in a changing environment like the Web. In this paper it is described an approach to usage profiling based on clickstream data collected on several Web servers' sites and stored in a specialized clickstream data warehousing. We aim at providing valuable insights about common users, but also preventing unauthorised access to contents and any form of overload that might deteriorate site performance. Common crawler detection heuristics help to classify sessions, enabling the construction of site-specific profile training sets. Then, classification algorithms are used for building predictive models that can evaluate unseen sessions, namely their nature and potential site hazard, when they are still ongoing. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/17555 |
ISBN: | 9789881821065 |
ISSN: | 2078-0958 |
Arbitragem científica: | yes |
Acesso: | Acesso restrito UMinho |
Aparece nas coleções: | CEB - Artigos em Livros de Atas / Papers in Proceedings |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
2011-CI-WCE-Lourenço&Belo-CRP-1p.pdf Acesso restrito! | 1ª Página do Artigo | 87,73 kB | Adobe PDF | Ver/Abrir |