Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/38817
Título: | DEDISbench: a benchmark for deduplicated storage systems |
Autor(es): | Paulo, João Reis, Pedro Pereira, José Sousa, António Luís |
Palavras-chave: | Deduplication Storage Benchmark Cloud computing |
Data: | 2012 |
Editora: | Springer |
Revista: | Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Resumo(s): | Deduplication is widely accepted as an effective technique for eliminating duplicated data in backup and archival systems. Nowadays, deduplication is also becoming appealing in cloud computing, where large-scale virtualized storage infrastructures hold huge data volumes with a significant share of duplicated content. There have thus been several proposals for embedding deduplication in storage appliances and file systems, providing different performance trade-offs while targeting both user and application data, as well as virtual machine images. It is however hard to determine to what extent is deduplication useful in a particular setting and what technique will provide the best results. In fact, existing disk I/O micro-benchmarks are not designed for evaluating deduplication systems, following simplistic approaches for generating data written that lead to unrealistic amounts of duplicates. We address this with DEDISbench, a novel micro-benchmark for evaluating disk I/O performance of block based deduplication systems. As the main contribution, we introduce the generation of a realistic duplicate distribution based on real datasets. Moreover, DEDISbench also allows simulating access hotspots and different load intensities for I/O operations. The usefulness of DEDISbench is shown by comparing it with Bonnie++ and IOzone open-source disk I/O micro-benchmarks on assessing two open-source deduplication systems, Opendedup and Lessfs, using Ext4 as a baseline. As a secondary contribution, our results lead to novel insight on the performance of these file systems. |
Tipo: | Artigo em ata de conferência |
Descrição: | Lecture Notes in Computer Science, 7566 |
URI: | https://hdl.handle.net/1822/38817 |
ISBN: | 978-3-642-33614-0 |
DOI: | 10.1007/978-3-642-33615-7_9 |
ISSN: | 0302-9743 |
Versão da editora: | http://link.springer.com/chapter/10.1007%2F978-3-642-33615-7_9 |
Arbitragem científica: | yes |
Acesso: | Acesso aberto |
Aparece nas coleções: |