Publicação

DataHub e Apache Atlas: uma análise comparativa de ferramentas de catalogação de dados

Ver documento

Detalhes bibliográficos
Resumo:Big Data introduces a significant increase of complexity to projects, in which, the use of inadequate data will inevitably produce inadequate and incorrect analysis. Data Catalogs centralize the system’s metadata into one place, providing a global view of the stored data, so it is essential to use appropriate data catalog tools. The choice of the tool that best suits the needs of the projects must be well-founded. This paper uses the OSSpal methodology, usually used for comparing open-source technologies, to do a comparative analysis of two tools: DataHub and Apache Atlas.
Autores principais:Rodrigues, Diogo
Outros Autores:Almeida, Mariana; Guimarães, Pedro; Santos, Maribel Yasmina
Assunto:Apache Atlas Comparative analysis Data catalog DataHub OSSpal methodology Catalogação de dados Análise comparativa Metodologia OSSpal
Ano:2022
País:Portugal
Tipo de documento:comunicação em conferência
Tipo de acesso:acesso aberto
Instituição associada:Universidade do Minho
Idioma:português
Origem:RepositóriUM - Universidade do Minho
Descrição
Resumo:Big Data introduces a significant increase of complexity to projects, in which, the use of inadequate data will inevitably produce inadequate and incorrect analysis. Data Catalogs centralize the system’s metadata into one place, providing a global view of the stored data, so it is essential to use appropriate data catalog tools. The choice of the tool that best suits the needs of the projects must be well-founded. This paper uses the OSSpal methodology, usually used for comparing open-source technologies, to do a comparative analysis of two tools: DataHub and Apache Atlas.