Publicação
Mining github software repositories to look for programming language cocktails
| Resumo: | In light of specific development needs, it is common to concurrently apply different technologies to build complex applications. Given that lowering risks, costs, and other negative factors, while improving their positive counterparts is paramount to a better development environment, it becomes relevant to find out what technologies work best for each intended purpose in a project. In order to reach these findings, it is necessary to analyse and study the technologies applied in these projects and how they interconnect and relate to each other. The theory behind Programming Cocktails (meaning the set of programming technologies - Ingredients - that are used to develop complex systems) can support these analysis. However, due to the sheer amount of data that is required to construct and analyse these Cocktails, it becomes unsustainable to manually obtain them. From the desire to accelerate this process comes the need for a tool that automates the data collection and its conversion into an appropriate format for analysis. As such, the project proposed in this paper revolves around the development of a web-scraping application that can generate Cocktail Identity Cards (CIC) from source code repositories hosted on GitHub. Said CICs contain the Ingredients (programming languages, libraries and frameworks) used in the corresponding GitHub repository and follow the ontology previously established in a larger research project to model each Programming Cocktail. This paper presents a survey of current Source Version Control Systems (SVCSs) and web-scrapping technologies, an overview of Programming Cocktails and its current foundations, and the design of a tool that can automate the gathering of CICs from GitHub repositories. |
|---|---|
| Autores principais: | Loureiro, João |
| Outros Autores: | Costa Neto, Alvaro; Pereira, Maria João; Henriques, Pedro Rangel |
| Assunto: | Software repository mining Source version control GitHub scraping Programming cocktails |
| Ano: | 2025 |
| País: | Portugal |
| Tipo de documento: | comunicação em conferência |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Instituto Politécnico de Bragança |
| Idioma: | inglês |
| Origem: | Biblioteca Digital do IPB |
Registos relacionados
school Adoção de métricas ágeis integradas com a plataforma GITHUB
por: Santos, Bruna Amorim dos
Publicado em: (2022)
por: Santos, Bruna Amorim dos
Publicado em: (2022)
article Sahub - stackoverflow and comments integrations
por: Oliveira, André
Publicado em: (2024)
por: Oliveira, André
Publicado em: (2024)
article An Ontology to Understand Programming Cocktails
por: Neto Costa, Alvaro
Publicado em: (2024)
por: Neto Costa, Alvaro
Publicado em: (2024)
school Versus: a Web Data Repository with Time Support
por: Campos, João P.
Publicado em: (2003)
por: Campos, João P.
Publicado em: (2003)
article Application of programming cocktails identity cards to development complexity analysis
por: Costa Neto, Alvaro
Publicado em: (2024)
por: Costa Neto, Alvaro
Publicado em: (2024)
school Business plan: Cocktail and mocktail robot
por: Fortes, Daniel Simões Teles
Publicado em: (2023)
por: Fortes, Daniel Simões Teles
Publicado em: (2023)
article Programming cocktail analysis based on the cognitive load theory, a first approach
por: Costa Neto, Alvaro
Publicado em: (2025)
por: Costa Neto, Alvaro
Publicado em: (2025)
groups Looking back at the EUNIS repository data using text mining techniques
por: Cardoso, E.
Publicado em: (2016)
por: Cardoso, E.
Publicado em: (2016)
article Effective reduction of Salmonella Enteritidis in broiler chickens using the UPWr_S134 phage cocktail
por: Kuzminska-Bajor, Marta
Publicado em: (2023)
por: Kuzminska-Bajor, Marta
Publicado em: (2023)
article Unstructuring the sequentiality of commits into a semantic network with higher informational and functional quality
por: Patricio, Manuel
Publicado em: (2024)
por: Patricio, Manuel
Publicado em: (2024)
article Development of an anti-Acinetobacter baumannii biofilm phage cocktail: Genomic adaptation to the Host
por: Blasco, L.
Publicado em: (2022)
por: Blasco, L.
Publicado em: (2022)
school Development of a Phage Cocktail to Prevent Catheter-Associated Urinary Tract Infections
por: Amorim, Catarina Domingues
Publicado em: (2017)
por: Amorim, Catarina Domingues
Publicado em: (2017)
article The investigation on the workability of fibre cocktail reinforced self-compacting high performance concrete
por: Ding Yining
Publicado em: (2008)
por: Ding Yining
Publicado em: (2008)
article Development of a phage cocktail to control Proteus mirabilis catheter-associated urinary tract infections
por: Melo, Luís Daniel Rodrigues
Publicado em: (2016)
por: Melo, Luís Daniel Rodrigues
Publicado em: (2016)
article The investigation on strength and flexural toughness of fibre cocktail reinforced self-compacting high performance concrete
por: Ding Yining
Publicado em: (2009)
por: Ding Yining
Publicado em: (2009)
category Next generation repositories: scaling up repositories to a global knowledge commons
por: Shearer, Kathleen
Publicado em: (2018)
por: Shearer, Kathleen
Publicado em: (2018)
article The Modelery: a model-based software development repository
por: Couto, Rui
Publicado em: (2015)
por: Couto, Rui
Publicado em: (2015)
category RepositóriUM: the implementation of an Institutional Repository
por: Ferreira, Miguel
Publicado em: (2007)
por: Ferreira, Miguel
Publicado em: (2007)
article High-pressure processing and lytic bacteriophage cocktail phageguard S as a synergistic hurdle system towards salmonella inactivation in egg white
por: Maciel, Cláudia
Publicado em: (2021)
por: Maciel, Cláudia
Publicado em: (2021)
category Improving usage statistics for RepositóriUM
por: Rodrigues, Eloy
Publicado em: (2007)
por: Rodrigues, Eloy
Publicado em: (2007)
category Development of usage statistics for RepositóriUM
por: Rodrigues, Eloy
Publicado em: (2006)
por: Rodrigues, Eloy
Publicado em: (2006)
category Meet RODA, a full-fledged digital repository for long-term preservation
por: Castro, Rui
Publicado em: (2011)
por: Castro, Rui
Publicado em: (2011)
article Saccharification of different sugarcane bagasse varieties by enzymatic cocktails produced by Mycothermus thermophilus and Trichoderma reesei RP698 cultures in agro-industrial residues
por: Scarcella, Ana Sílvia de Almeida
Publicado em: (2021)
por: Scarcella, Ana Sílvia de Almeida
Publicado em: (2021)
article Entrapment of a phage cocktail and cinnamaldehyde on sodium alginate emulsion-based films to fight food contamination by Escherichia coli and Salmonella Enteritidis
por: Alves, Diana Filipa Barros
Publicado em: (2020)
por: Alves, Diana Filipa Barros
Publicado em: (2020)
article Symbiotic honey beverages: a matrix which tells a story of survival and protection of human health from a gastronomic and industrial perspective
por: Pinto Neto, Walter de Paula
Publicado em: (2025)
por: Pinto Neto, Walter de Paula
Publicado em: (2025)
category Social ideas in unsocial environments: bringing social software concepts to digital repositories
por: Ferreira, Miguel
Publicado em: (2007)
por: Ferreira, Miguel
Publicado em: (2007)
article Guidelines for legacy repository migration
por: Ferreira, Miguel
Publicado em: (2013)
por: Ferreira, Miguel
Publicado em: (2013)
article Repository conversations
por: Baptista, Ana Alice
Publicado em: (2006)
por: Baptista, Ana Alice
Publicado em: (2006)
article Institutional repositories and increased use in times of a pandemic: true image or mirage?
por: Melo, Luiza Baptista
Publicado em: (2022)
por: Melo, Luiza Baptista
Publicado em: (2022)
category A tale of two repositories: linking the local and the global
por: Chan, Leslie
Publicado em: (2006)
por: Chan, Leslie
Publicado em: (2006)
article A secure personal health record repository
por: Pedrosa, Tiago
Publicado em: (2012)
por: Pedrosa, Tiago
Publicado em: (2012)
school Establishment of in vitro gastrointestinal models platform for human food and animal feed ingredients development
por: Carvalho, Nelson Fernando Mota de
Publicado em: (2024)
por: Carvalho, Nelson Fernando Mota de
Publicado em: (2024)
article The modelery: a collaborative web based repository
por: Couto, Rui
Publicado em: (2014)
por: Couto, Rui
Publicado em: (2014)
image OpenAIRE dashboard for repository managers: from repositories for repositories
por: Príncipe, Pedro
Publicado em: (2017)
por: Príncipe, Pedro
Publicado em: (2017)
school Estágio Curricular nos Departamentos de Data Science e Software Development no Sport Lisboa e Benfica
por: Alcarva, Rodrigo Heliodoro Vilaça Santos
Publicado em: (2021)
por: Alcarva, Rodrigo Heliodoro Vilaça Santos
Publicado em: (2021)
category The behavior of academic investigators using an institutional repository
por: Sarmento e Souza, M. F.
Publicado em: (2004)
por: Sarmento e Souza, M. F.
Publicado em: (2004)
category Ingest with RODA: the present and the future of repository ingest
por: Faria, Luís
Publicado em: (2011)
por: Faria, Luís
Publicado em: (2011)
article An OpenEHR repository based on a native XML database
por: Velte, Linda
Publicado em: (2012)
por: Velte, Linda
Publicado em: (2012)
article Up-scalable approaches for yeast mannan oligosaccharides (MOS) production: characterization and immunomodulatory properties
por: Faustino, Margarida
Publicado em: (2024)
por: Faustino, Margarida
Publicado em: (2024)
article Potential prebiotic effect of Caatinga bee honeys from the Pajeú hinterland (Pernambuco, Brazil) on synbiotic alcoholic beverages fermented by Saccharomyces boulardii CNCM I-745
por: Pinto-Neto, Walter de Paula
Publicado em: (2025)
por: Pinto-Neto, Walter de Paula
Publicado em: (2025)
Atividades financiadas
progress_activity Carregando projetos financiados...
Registos relacionados
-
school Adoção de métricas ágeis integradas com a plataforma GITHUB
por: Santos, Bruna Amorim dos
Publicado em: (2022) -
article Sahub - stackoverflow and comments integrations
por: Oliveira, André
Publicado em: (2024) -
article An Ontology to Understand Programming Cocktails
por: Neto Costa, Alvaro
Publicado em: (2024) -
school Versus: a Web Data Repository with Time Support
por: Campos, João P.
Publicado em: (2003) -
article Application of programming cocktails identity cards to development complexity analysis
por: Costa Neto, Alvaro
Publicado em: (2024)