Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/34001

Registo completo
Campo DCValorIdioma
dc.contributor.authorPaulo, Joãopor
dc.contributor.authorReis, Pedropor
dc.contributor.authorPereira, Josépor
dc.contributor.authorSousa, António Luíspor
dc.date.accessioned2015-02-19T13:51:40Z-
dc.date.available2015-02-19T13:51:40Z-
dc.date.issued2013-11-
dc.identifier.issn0267-6192-
dc.identifier.urihttps://hdl.handle.net/1822/34001-
dc.description.abstractDeduplication has proven to be a valuable technique for eliminating duplicate data in backup and archival systems and is now being applied to new storage environments with distinct requirements and performance trade-offs. Namely, deduplication system are now targeting large-scale cloud computing storage infrastructures holding unprecedented data volumes with a significant share of duplicate content. It is however hard to assess the usefulness of deduplication in particular settings and what techniques provide the best results. In fact, existing disk I/O benchmarks follow simplistic approaches for generating data content leading to unrealistic amounts of duplicates that do not evaluate deduplication systems accurately. Moreover, deduplication systems are now targeting heterogeneous storage environments, with specific duplication ratios, that benchmarks must also simulate. We address these issues with DEDISbench, a novel micro-benchmark for evaluating disk I/O performance of block based deduplication systems. As the main contribution, DEDISbench generates content by following realistic duplicate content distributions extracted from real datasets. Then, as a second contribution, we analyze and extract the duplicates found on three real storage systems, proving that DEDISbench can easily simulate several workloads. The usefulness of DEDISbench is shown by comparing it with Bonnie++ and IOzone open-source disk I/O micro-benchmarks on assessing two open-source deduplication systems, Opendedup and Lessfs, using Ext4 as a baseline. Our results lead to novel insight on the performance of these file systems.por
dc.description.sponsorshipThis work is funded by ERDF - European Regional Development Fund through the COMPETE Programme (operational programme for competitiveness) and by National Funds through the FCT - Fundacao para a Ciencia e a Tecnologia (Portuguese Foundation for Science and Technology) within project RED FCOMP-01-0124-FEDER-010156 and FCT by Ph.D scholarship SFRH-BD-71372-2010.por
dc.language.isoengpor
dc.publisherCRL Publishing por
dc.rightsopenAccesspor
dc.subjectDeduplicationpor
dc.subjectStoragepor
dc.subjectBenchmarkpor
dc.subjectCloud computingpor
dc.titleTowards an accurate evaluation of deduplicated storage systemspor
dc.typearticlepor
dc.peerreviewedyespor
dc.comments795por
sdum.publicationstatuspublishedpor
oaire.citationStartPage73por
oaire.citationEndPage83por
oaire.citationIssue1por
oaire.citationTitleComputer systems science and engineeringpor
oaire.citationVolume29por
dc.publisher.uriCRL Publishingpor
dc.subject.wosScience & Technologypor
sdum.journalComputer systems science and engineeringpor
Aparece nas coleções:HASLab - Artigos em revistas internacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
795.pdf276,3 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID