Publicação
Distributed exact deduplication for primary storage infrastructures
| Resumo: | Deduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously. |
|---|---|
| Autores principais: | Paulo, João |
| Outros Autores: | Pereira, José |
| Assunto: | Deduplication Storage systems Distributed systems Cloud computing |
| Ano: | 2014 |
| País: | Portugal |
| Tipo de documento: | comunicação em conferência |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade do Minho |
| Idioma: | inglês |
| Origem: | RepositóriUM - Universidade do Minho |
| _version_ | 1866878107992981504 |
|---|---|
| author | Paulo, João |
| author2 | Pereira, José |
| author2_role | author |
| author_facet | Paulo, João Pereira, José |
| author_role | author |
| contributor_name_str_mv | Universidade do Minho |
| country_str | PT |
| creators_json_txt | [{\"Person.name\":\"Paulo, João\"},{\"Person.name\":\"Pereira, José\"}] |
| datacite.contributors.contributor.contributorName.fl_str_mv | Universidade do Minho |
| datacite.creators.creator.creatorName.fl_str_mv | Paulo, João Pereira, José |
| datacite.date.Accepted.fl_str_mv | 2014-01-01T00:00:00Z |
| datacite.date.available.fl_str_mv | 2015-07-07T11:22:29Z |
| datacite.date.embargoed.fl_str_mv | 2015-07-07T11:22:29Z |
| datacite.rights.fl_str_mv | http://purl.org/coar/access_right/c_abf2 |
| datacite.subjects.subject.fl_str_mv | Deduplication Storage systems Distributed systems Cloud computing |
| datacite.titles.title.fl_str_mv | Distributed exact deduplication for primary storage infrastructures |
| dc.contributor.none.fl_str_mv | Universidade do Minho |
| dc.creator.none.fl_str_mv | Paulo, João Pereira, José |
| dc.date.Accepted.fl_str_mv | 2014-01-01T00:00:00Z |
| dc.date.available.fl_str_mv | 2015-07-07T11:22:29Z |
| dc.date.embargoed.fl_str_mv | 2015-07-07T11:22:29Z |
| dc.format.none.fl_str_mv | application/pdf |
| dc.identifier.none.fl_str_mv | https://hdl.handle.net/1822/35971 |
| dc.language.none.fl_str_mv | eng |
| dc.publisher.none.fl_str_mv | Springer |
| dc.rights.none.fl_str_mv | http://purl.org/coar/access_right/c_abf2 |
| dc.subject.none.fl_str_mv | Deduplication Storage systems Distributed systems Cloud computing |
| dc.title.fl_str_mv | Distributed exact deduplication for primary storage infrastructures |
| dc.type.none.fl_str_mv | http://purl.org/coar/resource_type/c_5794 |
| description | Deduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously. |
| dirty | 0 |
| eu_rights_str_mv | openAccess |
| format | conferencePaper |
| fulltext.url.fl_str_mv | https://prod-dspace.uminho.pt/bitstreams/f129538d-0ba4-41c1-b868-d6569d886116/download |
| id | rum_2b968ef8356340fefcb98685174f962d |
| identifier.url.fl_str_mv | https://hdl.handle.net/1822/35971 |
| instacron_str | repositorium |
| institution | Universidade do Minho |
| instname_str | Universidade do Minho |
| language | eng |
| network_acronym_str | rum |
| network_name_str | RepositóriUM - Universidade do Minho |
| oai_identifier_str | oai:repositorium.uminho.pt:1822/35971 |
| organization_str_mv | urn:organizationAcronym:repositorium |
| person_str_mv | Paulo, João Pereira, José |
| publishDate | 2014 |
| publisher.none.fl_str_mv | Springer |
| reponame_str | RepositóriUM - Universidade do Minho |
| repository_id_str | urn:repositoryAcronym:rum |
| service_str_mv | urn:repositoryAcronym:rum |
| spelling | engSpringerporDeduplication of primary storage volumes in a cloud computing environment is increasingly desirable, as the resulting space savings contribute to the cost effectiveness of a large scale multi-tenant infrastructure. However, traditional archival and backup deduplication systems impose prohibitive overhead for latency-sensitive applications deployed at these infrastructures while, current primary deduplication systems rely on special cluster filesystems, centralized components, or restrictive workload assumptions. We present DEDIS, a fully-distributed and dependable system that performs exact and cluster-wide background deduplication of primary storage. DEDIS does not depend on data locality and works on top of any unsophisticated storage backend, centralized or distributed, that exports a basic shared block device interface. The evaluation of an open-source prototype shows that DEDIS scales out and adds negligible overhead even when deduplication and intensive storage I/O run simultaneously.application/pdfporDistributed exact deduplication for primary storage infrastructuresPaulo, JoãoPereira, JoséHostingInstitutionOrganizationalUniversidade do Minhoe-mailmailto:repositorium@usdb.uminho.ptrepositorium@usdb.uminho.ptISBNIsPartOf978-3-662-43351-5ISSNIsPartOf0302-9743DOIIsPartOf10.1007/978-3-662-43352-2_52015-07-07T11:22:29Z20142014-01-01T00:00:00ZHandlehttps://hdl.handle.net/1822/35971http://purl.org/coar/access_right/c_abf2open accessDeduplicationStorage systemsDistributed systemsCloud computing296289 bytesother research producthttp://purl.org/coar/resource_type/c_5794conference paperhttp://purl.org/coar/access_right/c_abf2application/pdffulltexthttps://prod-dspace.uminho.pt/bitstreams/f129538d-0ba4-41c1-b868-d6569d886116/download |
| spellingShingle | Distributed exact deduplication for primary storage infrastructures Paulo, João Deduplication Storage systems Distributed systems Cloud computing |
| status | SINGLETON |
| subject.fl_str_mv | Deduplication Storage systems Distributed systems Cloud computing |
| title | Distributed exact deduplication for primary storage infrastructures |
| title_full | Distributed exact deduplication for primary storage infrastructures |
| title_fullStr | Distributed exact deduplication for primary storage infrastructures |
| title_full_unstemmed | Distributed exact deduplication for primary storage infrastructures |
| title_short | Distributed exact deduplication for primary storage infrastructures |
| title_sort | Distributed exact deduplication for primary storage infrastructures |
| topic | Deduplication Storage systems Distributed systems Cloud computing |
| topic_facet | Deduplication Storage systems Distributed systems Cloud computing |
| url | https://hdl.handle.net/1822/35971 |
| visible | 1 |