Publicação

Distributed exact deduplication for primary storage infrastructures

Ver documento

Detalhes bibliográficos
Resumo:Deduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously.
Autores principais:Paulo, João
Outros Autores:Pereira, José
Assunto:Deduplication Storage systems Distributed systems Cloud computing
Ano:2014
País:Portugal
Tipo de documento:comunicação em conferência
Tipo de acesso:acesso aberto
Instituição associada:Universidade do Minho
Idioma:inglês
Origem:RepositóriUM - Universidade do Minho
_version_ 1866878107992981504
author Paulo, João
author2 Pereira, José
author2_role author
author_facet Paulo, João
Pereira, José
author_role author
contributor_name_str_mv Universidade do Minho
country_str PT
creators_json_txt [{\"Person.name\":\"Paulo, João\"},{\"Person.name\":\"Pereira, José\"}]
datacite.contributors.contributor.contributorName.fl_str_mv Universidade do Minho
datacite.creators.creator.creatorName.fl_str_mv Paulo, João
Pereira, José
datacite.date.Accepted.fl_str_mv 2014-01-01T00:00:00Z
datacite.date.available.fl_str_mv 2015-07-07T11:22:29Z
datacite.date.embargoed.fl_str_mv 2015-07-07T11:22:29Z
datacite.rights.fl_str_mv http://purl.org/coar/access_right/c_abf2
datacite.subjects.subject.fl_str_mv Deduplication
Storage systems
Distributed systems
Cloud computing
datacite.titles.title.fl_str_mv Distributed exact deduplication for primary storage infrastructures
dc.contributor.none.fl_str_mv Universidade do Minho
dc.creator.none.fl_str_mv Paulo, João
Pereira, José
dc.date.Accepted.fl_str_mv 2014-01-01T00:00:00Z
dc.date.available.fl_str_mv 2015-07-07T11:22:29Z
dc.date.embargoed.fl_str_mv 2015-07-07T11:22:29Z
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv https://hdl.handle.net/1822/35971
dc.language.none.fl_str_mv eng
dc.publisher.none.fl_str_mv Springer
dc.rights.none.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.subject.none.fl_str_mv Deduplication
Storage systems
Distributed systems
Cloud computing
dc.title.fl_str_mv Distributed exact deduplication for primary storage infrastructures
dc.type.none.fl_str_mv http://purl.org/coar/resource_type/c_5794
description Deduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously.
dirty 0
eu_rights_str_mv openAccess
format conferencePaper
fulltext.url.fl_str_mv https://prod-dspace.uminho.pt/bitstreams/f129538d-0ba4-41c1-b868-d6569d886116/download
id rum_2b968ef8356340fefcb98685174f962d
identifier.url.fl_str_mv https://hdl.handle.net/1822/35971
instacron_str repositorium
institution Universidade do Minho
instname_str Universidade do Minho
language eng
network_acronym_str rum
network_name_str RepositóriUM - Universidade do Minho
oai_identifier_str oai:repositorium.uminho.pt:1822/35971
organization_str_mv urn:organizationAcronym:repositorium
person_str_mv Paulo, João
Pereira, José
publishDate 2014
publisher.none.fl_str_mv Springer
reponame_str RepositóriUM - Universidade do Minho
repository_id_str urn:repositoryAcronym:rum
service_str_mv urn:repositoryAcronym:rum
spelling engSpringerporDeduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously.application/pdfporDistributed exact deduplication for primary storage infrastructuresPaulo, JoãoPereira, JoséHostingInstitutionOrganizationalUniversidade do Minhoe-mailmailto:repositorium@usdb.uminho.ptrepositorium@usdb.uminho.ptISBNIsPartOf978-3-662-43351-5ISSNIsPartOf0302-9743DOIIsPartOf10.1007/978-3-662-43352-2_52015-07-07T11:22:29Z20142014-01-01T00:00:00ZHandlehttps://hdl.handle.net/1822/35971http://purl.org/coar/access_right/c_abf2open accessDeduplicationStorage systemsDistributed systemsCloud computing296289 bytesother research producthttp://purl.org/coar/resource_type/c_5794conference paperhttp://purl.org/coar/access_right/c_abf2application/pdffulltexthttps://prod-dspace.uminho.pt/bitstreams/f129538d-0ba4-41c1-b868-d6569d886116/download
spellingShingle Distributed exact deduplication for primary storage infrastructures
Paulo, João
Deduplication
Storage systems
Distributed systems
Cloud computing
status SINGLETON
subject.fl_str_mv Deduplication
Storage systems
Distributed systems
Cloud computing
title Distributed exact deduplication for primary storage infrastructures
title_full Distributed exact deduplication for primary storage infrastructures
title_fullStr Distributed exact deduplication for primary storage infrastructures
title_full_unstemmed Distributed exact deduplication for primary storage infrastructures
title_short Distributed exact deduplication for primary storage infrastructures
title_sort Distributed exact deduplication for primary storage infrastructures
topic Deduplication
Storage systems
Distributed systems
Cloud computing
topic_facet Deduplication
Storage systems
Distributed systems
Cloud computing
url https://hdl.handle.net/1822/35971
visible 1