Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/52866
Registo completo
Campo DC | Valor | Idioma |
---|---|---|
dc.contributor.author | Maia, Francisco | por |
dc.contributor.author | Paulo, João | por |
dc.contributor.author | Coelho, Fábio | por |
dc.contributor.author | Neves, Francisco | por |
dc.contributor.author | Pereira, José | por |
dc.contributor.author | Oliveira, Rui Carlos Mendes de | por |
dc.date.accessioned | 2018-03-19T20:59:31Z | - |
dc.date.issued | 2017 | - |
dc.identifier.isbn | 9783319596648 | por |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.uri | https://hdl.handle.net/1822/52866 | - |
dc.description.abstract | With the increasing number of connected devices, it becomes essential to find novel data management solutions that can leverage their computational and storage capabilities. However, developing very large scale data management systems requires tackling a number of interesting distributed systems challenges, namely continuous failures and high levels of node churn. In this context, epidemic-based protocols proved suitable and effective and have been successfully used to build DataFlasks, an epidemic data store for massive scale systems. Ensuring resiliency in this data store comes with a significant cost in storage resources and network bandwidth consumption. Deduplication has proven to be an efficient technique to reduce both costs but, applying it to a large-scale distributed storage system is not a trivial task. In fact, achieving significant space-savings without compromising the resiliency and decentralized design of these storage systems is a relevant research challenge. In this paper, we extend DataFlasks with deduplication to design DDFlasks. This system is evaluated in a real world scenario using Wikipedia snapshots, and the results are twofold. We show that deduplication is able to decrease storage consumption up to 63% and decrease network bandwidth consumption by up to 20%, while maintaining a fullydecentralized and resilient design. | por |
dc.description.sponsorship | The research leading to these results was part-funded by (1) Project TEC4Growth - Pervasive Intelligence, Enhancers and Proofs of Concept with Industrial Impact/NORTE-01-0145-FEDER-000020 is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF); (2) the ERDF European Regional Development Fund through the Operational Programme for Competitiveness and Internationalisation - COMPETE 2020 Programme within project POCI-01-0145-FEDER-006961, and by National Funds through the FCT Portuguese Foundation for Science and Technology as part of project UID/EEA/50014/2013 and by (3) the European Union's Horizon 2020 - The EU Framework Programme for Research and Innovation 2014-2020, under grant agreement No. 732051. | por |
dc.language.iso | eng | por |
dc.publisher | Springer Verlag | por |
dc.rights | restrictedAccess | por |
dc.title | DDFlasks: Deduplicated very large scale data store | por |
dc.type | conferencePaper | por |
dc.peerreviewed | yes | por |
oaire.citationStartPage | 51 | por |
oaire.citationEndPage | 66 | por |
oaire.citationVolume | 10320 | por |
dc.date.updated | 2018-03-16T12:13:43Z | - |
dc.identifier.doi | 10.1007/978-3-319-59665-5_4 | por |
dc.description.publicationversion | info:eu-repo/semantics/publishedVersion | por |
dc.subject.wos | Science & Technology | por |
sdum.export.identifier | 4549 | - |
sdum.journal | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | por |
sdum.conferencePublication | DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, DAIS 2017 | por |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
P-00M-V91.pdf Acesso restrito! | 234,86 kB | Adobe PDF | Ver/Abrir |