Publicação

A survey of distributed data aggregation algorithms

Ver documento

Detalhes bibliográficos
Resumo:Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.
Autores principais:Jesus, Paulo Alexandre Marques
Outros Autores:Baquero, Carlos; Almeida, Paulo Sérgio
Assunto:Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance
Ano:2015
País:Portugal
Tipo de documento:artigo
Tipo de acesso:acesso aberto
Instituição associada:Universidade do Minho
Idioma:inglês
Origem:RepositóriUM - Universidade do Minho
_version_ 1867437980819390464
author Jesus, Paulo Alexandre Marques
author2 Baquero, Carlos
Almeida, Paulo Sérgio
author2_role author
author
author_facet Jesus, Paulo Alexandre Marques
Baquero, Carlos
Almeida, Paulo Sérgio
author_role author
contributor_name_str_mv RepositóriUM - Universidade do Minho
country_str PT
creators_json_txt [{\"Person.name\":\"Jesus, Paulo Alexandre Marques\"},{\"Person.name\":\"Baquero, Carlos\"},{\"Person.name\":\"Almeida, Paulo Sérgio\"}]
datacite.contributors.contributor.contributorName.fl_str_mv RepositóriUM - Universidade do Minho
datacite.creators.creator.creatorName.fl_str_mv Jesus, Paulo Alexandre Marques
Baquero, Carlos
Almeida, Paulo Sérgio
datacite.date.Accepted.fl_str_mv 2015-01-01T00:00:00Z
datacite.date.available.fl_str_mv 2018-03-05T11:40:52Z
datacite.date.embargoed.fl_str_mv 2018-03-05T11:40:52Z
datacite.rights.fl_str_mv http://purl.org/coar/access_right/c_abf2
datacite.subjects.subject.fl_str_mv Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
datacite.titles.title.fl_str_mv A survey of distributed data aggregation algorithms
dc.contributor.none.fl_str_mv RepositóriUM - Universidade do Minho
dc.creator.none.fl_str_mv Jesus, Paulo Alexandre Marques
Baquero, Carlos
Almeida, Paulo Sérgio
dc.date.Accepted.fl_str_mv 2015-01-01T00:00:00Z
dc.date.available.fl_str_mv 2018-03-05T11:40:52Z
dc.date.embargoed.fl_str_mv 2018-03-05T11:40:52Z
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv https://hdl.handle.net/1822/51509
dc.language.none.fl_str_mv eng
dc.publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
dc.rights.none.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.subject.none.fl_str_mv Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
dc.title.fl_str_mv A survey of distributed data aggregation algorithms
dc.type.none.fl_str_mv http://purl.org/coar/resource_type/c_6501
description Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.
dirty 0
eu_rights_str_mv openAccess
format article
fulltext.url.fl_str_mv https://repositorium.uminho.pt/bitstreams/2f91078d-0e98-4b04-8064-1aa747b7227d/download
id rum_4523b80da07a3cf797ee00e2f8bcb4e4
identifier.url.fl_str_mv https://hdl.handle.net/1822/51509
instacron_str repositorium
institution Universidade do Minho
instname_str Universidade do Minho
language eng
network_acronym_str rum
network_name_str RepositóriUM - Universidade do Minho
oai_identifier_str oai:repositorium.uminho.pt:1822/51509
organization_str_mv urn:organizationAcronym:repositorium
person_str_mv Jesus, Paulo Alexandre Marques
Baquero, Carlos
Almeida, Paulo Sérgio
publishDate 2015
publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers (IEEE)
reponame_str RepositóriUM - Universidade do Minho
repository_id_str urn:repositoryAcronym:rum
service_str_mv urn:repositoryAcronym:rum
spelling engInstitute of Electrical and Electronics Engineers (IEEE)porDistributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.application/pdfporA survey of distributed data aggregation algorithmsJesus, Paulo Alexandre MarquesBaquero, CarlosAlmeida, Paulo SérgioHostingInstitutionOrganizationalRepositóriUM - Universidade do Minhoe-mailmailto:repositorium@usdb.uminho.ptrepositorium@usdb.uminho.ptISSNIsPartOf1553-877XDOIIsPartOf10.1109/COMST.2014.23543982018-03-05T11:40:52Z20152018-02-14T15:53:08Z2015-01-01T00:00:00ZHandlehttps://hdl.handle.net/1822/51509http://purl.org/coar/access_right/c_abf2open accessDistributed algorithmsData aggregationPerformance trade-offsFault-tolerance2128155 bytesliteraturehttp://purl.org/coar/resource_type/c_6501journal articlehttp://purl.org/coar/access_right/c_abf2application/pdffulltexthttps://repositorium.uminho.pt/bitstreams/2f91078d-0e98-4b04-8064-1aa747b7227d/download
spellingShingle A survey of distributed data aggregation algorithms
Jesus, Paulo Alexandre Marques
Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
status SINGLETON
subject.fl_str_mv Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
title A survey of distributed data aggregation algorithms
title_full A survey of distributed data aggregation algorithms
title_fullStr A survey of distributed data aggregation algorithms
title_full_unstemmed A survey of distributed data aggregation algorithms
title_short A survey of distributed data aggregation algorithms
title_sort A survey of distributed data aggregation algorithms
topic Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
topic_facet Distributed algorithms
Data aggregation
Performance trade-offs
Fault-tolerance
url https://hdl.handle.net/1822/51509
visible 1