Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/36280

Registo completo
Campo DCValorIdioma
dc.contributor.authorPereira, Pedropor
dc.contributor.authorMacedo, Joaquimpor
dc.contributor.authorCraveiro, Olgapor
dc.contributor.authorMadeira, Henriquepor
dc.date.accessioned2015-07-20T14:31:46Z-
dc.date.available2015-07-20T14:31:46Z-
dc.date.issued2014-
dc.identifier.isbn978-3-319-06027-9-
dc.identifier.issn0302-9743por
dc.identifier.urihttps://hdl.handle.net/1822/36280-
dc.description.abstractThere is a plethora of information inside the Web. Even the top commercial search engines can not download and index all the available information. So, in the recent years, there are several research works on the design and implementation of focused topic crawlers and also on geographic scope crawlers. Despite other areas of information retrieval, research on Web crawling is not using the temporal information extracted from Web pages in the used crawling criteria. Therefore, our research challenge is the use of temporal data extracted from Web pages as the main crawling criteria to satisfy a given temporal focus. The importance of the time dimension is quite amplified when combined with topic or geography, but now we want to study it isolated. The used approach is based on temporal segmentation of Web pages text. It only follows links within segments tagged with dates in the scope of restriction. A precision around 75% was achieved in preliminary experimental results.por
dc.description.sponsorship(undefined)por
dc.language.isoengpor
dc.publisherSpringer International Publishing AGpor
dc.rightsrestrictedAccesspor
dc.subjectWeb crawlingpor
dc.subjectTemporal text segmentationpor
dc.subjectTemporal information extractionpor
dc.subjectTemporal information retrievalpor
dc.titleTime-aware focused web crawlingpor
dc.typeconferencePaperpor
dc.peerreviewedyespor
sdum.publicationstatuspublishedpor
oaire.citationStartPage534por
oaire.citationEndPage539por
oaire.citationConferencePlaceAmesterdampor
oaire.citationTitleAdvances in Information Retrievalpor
oaire.citationVolumeLecture Notes in Computer Science, vol. 8416por
dc.identifier.doi10.1007/978-3-319-06028-6_53por
dc.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapor
sdum.journalLecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)por
sdum.conferencePublicationAdvances in Information Retrievalpor
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
ECIR2014_WebCrawl.pdf
Acesso restrito!
1,56 MBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID