Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/54562
Título: | Web crawler profiling and containment through navigation pattern mining |
Autor(es): | Lourenço, Anália Maria Garcia Belo, Orlando |
Palavras-chave: | Clickstream Processing Crawling Profiling Data Webhousing Navigation Patterns Web Usage Mining |
Data: | 1-Nov-2009 |
Editora: | International Association for Development of the Information Society (IADIS) |
Resumo(s): | Web profiles may support the analysis of Web site popularity as well as the detection of unwanted and illegitimate activities such as fraud. Yet, profiling techniques often fail to account for different usage, processing regular sessions, crawler sessions and proxy sessions in a similar way. This paper proposes an integrated approach to Web crawler profiling and containment. A data Webhousing embracing standard crawler detection techniques supplies the profiles to be further analysed through navigation pattern mining. The ability to adapt crawler identification to particular Web scenarios, the incremental analysis of navigation patterns, and the capacity of monitoring server performance and preventing crawler-related hazards are considered main strengths of this approach. Experiments over six-month Web server logs of a non-commercial Web site evidence the benefits of focused Web profiling and, in particular, of this approach. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/54562 |
ISBN: | 978-972-8924-93-5 |
Versão da editora: | http://www.iadisportal.org/digital-library/mdownload/web-crawler-profiling-and-containment-through-navigation-pattern-mining |
Arbitragem científica: | yes |
Acesso: | Acesso restrito autor |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
2009-CI-WWWIADIS-Lourenco&Belo-CRP.pdf Acesso restrito! | 282,24 kB | Adobe PDF | Ver/Abrir |