Publicação
A survey of distributed data aggregation algorithms
| Resumo: | Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics. |
|---|---|
| Autores principais: | Jesus, Paulo Alexandre Marques |
| Outros Autores: | Baquero, Carlos; Almeida, Paulo Sérgio |
| Assunto: | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| Ano: | 2015 |
| País: | Portugal |
| Tipo de documento: | artigo |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade do Minho |
| Idioma: | inglês |
| Origem: | RepositóriUM - Universidade do Minho |
| _version_ | 1867437980819390464 |
|---|---|
| author | Jesus, Paulo Alexandre Marques |
| author2 | Baquero, Carlos Almeida, Paulo Sérgio |
| author2_role | author author |
| author_facet | Jesus, Paulo Alexandre Marques Baquero, Carlos Almeida, Paulo Sérgio |
| author_role | author |
| contributor_name_str_mv | RepositóriUM - Universidade do Minho |
| country_str | PT |
| creators_json_txt | [{\"Person.name\":\"Jesus, Paulo Alexandre Marques\"},{\"Person.name\":\"Baquero, Carlos\"},{\"Person.name\":\"Almeida, Paulo Sérgio\"}] |
| datacite.contributors.contributor.contributorName.fl_str_mv | RepositóriUM - Universidade do Minho |
| datacite.creators.creator.creatorName.fl_str_mv | Jesus, Paulo Alexandre Marques Baquero, Carlos Almeida, Paulo Sérgio |
| datacite.date.Accepted.fl_str_mv | 2015-01-01T00:00:00Z |
| datacite.date.available.fl_str_mv | 2018-03-05T11:40:52Z |
| datacite.date.embargoed.fl_str_mv | 2018-03-05T11:40:52Z |
| datacite.rights.fl_str_mv | http://purl.org/coar/access_right/c_abf2 |
| datacite.subjects.subject.fl_str_mv | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| datacite.titles.title.fl_str_mv | A survey of distributed data aggregation algorithms |
| dc.contributor.none.fl_str_mv | RepositóriUM - Universidade do Minho |
| dc.creator.none.fl_str_mv | Jesus, Paulo Alexandre Marques Baquero, Carlos Almeida, Paulo Sérgio |
| dc.date.Accepted.fl_str_mv | 2015-01-01T00:00:00Z |
| dc.date.available.fl_str_mv | 2018-03-05T11:40:52Z |
| dc.date.embargoed.fl_str_mv | 2018-03-05T11:40:52Z |
| dc.format.none.fl_str_mv | application/pdf |
| dc.identifier.none.fl_str_mv | https://hdl.handle.net/1822/51509 |
| dc.language.none.fl_str_mv | eng |
| dc.publisher.none.fl_str_mv | Institute of Electrical and Electronics Engineers (IEEE) |
| dc.rights.none.fl_str_mv | http://purl.org/coar/access_right/c_abf2 |
| dc.subject.none.fl_str_mv | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| dc.title.fl_str_mv | A survey of distributed data aggregation algorithms |
| dc.type.none.fl_str_mv | http://purl.org/coar/resource_type/c_6501 |
| description | Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics. |
| dirty | 0 |
| eu_rights_str_mv | openAccess |
| format | article |
| fulltext.url.fl_str_mv | https://repositorium.uminho.pt/bitstreams/2f91078d-0e98-4b04-8064-1aa747b7227d/download |
| id | rum_4523b80da07a3cf797ee00e2f8bcb4e4 |
| identifier.url.fl_str_mv | https://hdl.handle.net/1822/51509 |
| instacron_str | repositorium |
| institution | Universidade do Minho |
| instname_str | Universidade do Minho |
| language | eng |
| network_acronym_str | rum |
| network_name_str | RepositóriUM - Universidade do Minho |
| oai_identifier_str | oai:repositorium.uminho.pt:1822/51509 |
| organization_str_mv | urn:organizationAcronym:repositorium |
| person_str_mv | Jesus, Paulo Alexandre Marques Baquero, Carlos Almeida, Paulo Sérgio |
| publishDate | 2015 |
| publisher.none.fl_str_mv | Institute of Electrical and Electronics Engineers (IEEE) |
| reponame_str | RepositóriUM - Universidade do Minho |
| repository_id_str | urn:repositoryAcronym:rum |
| service_str_mv | urn:repositoryAcronym:rum |
| spelling | engInstitute of Electrical and Electronics Engineers (IEEE)porDistributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.application/pdfporA survey of distributed data aggregation algorithmsJesus, Paulo Alexandre MarquesBaquero, CarlosAlmeida, Paulo SérgioHostingInstitutionOrganizationalRepositóriUM - Universidade do Minhoe-mailmailto:repositorium@usdb.uminho.ptrepositorium@usdb.uminho.ptISSNIsPartOf1553-877XDOIIsPartOf10.1109/COMST.2014.23543982018-03-05T11:40:52Z20152018-02-14T15:53:08Z2015-01-01T00:00:00ZHandlehttps://hdl.handle.net/1822/51509http://purl.org/coar/access_right/c_abf2open accessDistributed algorithmsData aggregationPerformance trade-offsFault-tolerance2128155 bytesliteraturehttp://purl.org/coar/resource_type/c_6501journal articlehttp://purl.org/coar/access_right/c_abf2application/pdffulltexthttps://repositorium.uminho.pt/bitstreams/2f91078d-0e98-4b04-8064-1aa747b7227d/download |
| spellingShingle | A survey of distributed data aggregation algorithms Jesus, Paulo Alexandre Marques Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| status | SINGLETON |
| subject.fl_str_mv | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| title | A survey of distributed data aggregation algorithms |
| title_full | A survey of distributed data aggregation algorithms |
| title_fullStr | A survey of distributed data aggregation algorithms |
| title_full_unstemmed | A survey of distributed data aggregation algorithms |
| title_short | A survey of distributed data aggregation algorithms |
| title_sort | A survey of distributed data aggregation algorithms |
| topic | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| topic_facet | Distributed algorithms Data aggregation Performance trade-offs Fault-tolerance |
| url | https://hdl.handle.net/1822/51509 |
| visible | 1 |