Publicação
Transfer learning with audioSet to voice pathologies identification in continuous speech
| Resumo: | The classification of pathological diseases with the implementation of concepts of Deep Learning has been increasing considerably in recent times. Among the works developed there are good results for the classification in sustained speech with vowels, but few related works for the classification in continuous speech. This work uses the German Saarbrücken Voice Database with the phrase “Guten Morgen, wie geht es Ihnen?” to classify four classes: dysphonia, laryngitis, paralysis of vocal cords and healthy voices. Transfer learning concepts were used with the AudioSet database. Two models were developed based on Long-Short-Term-Memory and Convolutional Network for classification of extracted embeddings and comparison of the best results, using cross-validation. The final results allowed to obtaining 40% of f1-score for the four classes, 66% f1-score for Dysphonia x Healthy, 67% for Laryngitis x healthy and 80% for Paralysis x Healthy. |
|---|---|
| Autores principais: | Guedes, Victor |
| Outros Autores: | Teixeira, Felipe; Oliveira, Alessa Anjos de; Fernandes, Joana Filipa Teixeira; Silva, Letícia; Candido Junior, Arnaldo; Teixeira, João Paulo |
| Assunto: | Long short term memory Convolutional neural network SVD Deep learning Voice pathologies diagnose |
| Ano: | 2019 |
| País: | Portugal |
| Tipo de documento: | comunicação em conferência |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Instituto Politécnico de Bragança |
| Idioma: | inglês |
| Origem: | Biblioteca Digital do IPB |
Registos relacionados
article Algorithm for Jitter and Shimmer Measurement in Pathologic Voices
por: Teixeira, João Paulo
Publicado em: (2016)
por: Teixeira, João Paulo
Publicado em: (2016)
groups Low band continuous speech system for voice pathologies identification
por: Cordeiro, Hugo
Publicado em: (2018)
por: Cordeiro, Hugo
Publicado em: (2018)
article Long short term memory on chronic laryngitis classification
por: Guedes, Victor
Publicado em: (2018)
por: Guedes, Victor
Publicado em: (2018)
article Clustering of voice pathologies based on sustained voice parameters
por: Oliveira, Alessa Anjos de
Publicado em: (2020)
por: Oliveira, Alessa Anjos de
Publicado em: (2020)
groups Voice Pathologies Identification Speech signals, features and classifiers evaluation
por: Cordeiro, Hugo
Publicado em: (2015)
por: Cordeiro, Hugo
Publicado em: (2015)
groups Spectral features of healthy and pathological voices: results comparison between two databases
por: Cordeiro, Hugo
Publicado em: (2019)
por: Cordeiro, Hugo
Publicado em: (2019)
groups Low band spectral tilt analysis for pathological voice discrimination
por: Cordeiro, Hugo
Publicado em: (2019)
por: Cordeiro, Hugo
Publicado em: (2019)
article Voice pathologies : the most comum features and classification tools
por: Fernandes, Joana Filipa Pinto
Publicado em: (2021)
por: Fernandes, Joana Filipa Pinto
Publicado em: (2021)
article Statistical analysis of voice parameters in healthy subjects and with vocal pathologies - HNR
por: André, Débora Cucubica
Publicado em: (2022)
por: André, Débora Cucubica
Publicado em: (2022)
article First version of a support system for the medical diagnosis of pathologies in the larynx
por: Fernandes, Joana
Publicado em: (2023)
por: Fernandes, Joana
Publicado em: (2023)
article Clustering pathologic voice with kohonen SOM and hierarchical clustering
por: Teixeira, João Paulo
Publicado em: (2021)
por: Teixeira, João Paulo
Publicado em: (2021)
article On the Capacity of Nonlinear Massive MIMO-OFDM Systems
por: Fernandes, Pedro
Publicado em: (2016)
por: Fernandes, Pedro
Publicado em: (2016)
groups On the Assessment of Nonlinear Distortion Effects in MIMO-OFDM Systems
por: Guerreiro, João
Publicado em: (2016)
por: Guerreiro, João
Publicado em: (2016)
article Geothermal heat exchanger’s temperature input sensor prediction based on deep learning modelling technique
por: Oliveira, P.
Publicado em: (2024)
por: Oliveira, P.
Publicado em: (2024)
article Hierarchical classification and system combination for automatically identifying physiological and neuromuscular laryngeal pathologies
por: Cordeiro, Hugo
Publicado em: (2017)
por: Cordeiro, Hugo
Publicado em: (2017)
article Attention and emotion shape self-voice prioritization in speech processing
por: Pinheiro, Ana P.
Publicado em: (2023)
por: Pinheiro, Ana P.
Publicado em: (2023)
article A deep learning approach to forecast the influent flow in wastewater treatment plants
por: Oliveira, Pedro
Publicado em: (2020)
por: Oliveira, Pedro
Publicado em: (2020)
article Accuracy Optimization in Speech Pathology Diagnosis with Data Preprocessing Techniques
por: Fernandes, Joana Filipa Teixeira
Publicado em: (2024)
por: Fernandes, Joana Filipa Teixeira
Publicado em: (2024)
groups Using transfer learning for classification of gait pathologies
por: Verlekar, T. T.
Publicado em: (2018)
por: Verlekar, T. T.
Publicado em: (2018)
article Smart data driven system for pathological voices classification
por: Fernandes, Joana
Publicado em: (2022)
por: Fernandes, Joana
Publicado em: (2022)
article "Speak from every mouth – the speech, a poem": conflicting voices, discourses and identities in the poetry of Robert Browning
por: Guimarães, Paula Alexandra
Publicado em: (2011)
por: Guimarães, Paula Alexandra
Publicado em: (2011)
article Determination of harmonic parameters in pathological voices-efficient algorithm
por: Fernandes, Joana Filipa Teixeira
Publicado em: (2023)
por: Fernandes, Joana Filipa Teixeira
Publicado em: (2023)
school Psychological features of functional voice disorders
por: Andrea, Mafalda Bordalo
Publicado em: (2018)
por: Andrea, Mafalda Bordalo
Publicado em: (2018)
article Vocal acoustic analysis: ANN versos SVM in classification of dysphonic voices and vocal cords paralysis
por: Teixeira, João Paulo
Publicado em: (2020)
por: Teixeira, João Paulo
Publicado em: (2020)
school Glioblastoma : diferenças entre sobreviventes curtos e longos numa série do Hospital de Santa Maria
por: Spac, Irina
Publicado em: (2022)
por: Spac, Irina
Publicado em: (2022)
school Deep learning aplicado a classificação de patologias da voz
por: Guedes, Victor
Publicado em: (2019)
por: Guedes, Victor
Publicado em: (2019)
groups Parâmetros espectrais de vozes saudáveis e patológicas: Comparação de resultados entre duas base de dados
por: Cordeiro, Hugo
Publicado em: (2019)
por: Cordeiro, Hugo
Publicado em: (2019)
article Development of head and neck pathology in Europe
por: Hellquist, Henrik
Publicado em: (2022)
por: Hellquist, Henrik
Publicado em: (2022)
school Emotional contagion in voice-to-voice service encounters: A dynamic approach to the influence of customers on employees’ behavior and welfare
por: Lopes, Maria Rita Rueff Negrão Mendonça
Publicado em: (2015)
por: Lopes, Maria Rita Rueff Negrão Mendonça
Publicado em: (2015)
article A mixed approach to the heterogeneity of the short-term rentals’ regulation in Spain
por: Viana-Lora, Alba
Publicado em: (2025)
por: Viana-Lora, Alba
Publicado em: (2025)
school Predictors of early readmission in chronic heart failure : REFERENCE (pREdictors oF Early REadmission iN Chronic hEart failure)
por: Barbosa, Mário Augusto Rodrigues Teixeira, 1980-
Publicado em: (2019)
por: Barbosa, Mário Augusto Rodrigues Teixeira, 1980-
Publicado em: (2019)
article A cognitive neuroscience view of voice-processing abnormalities in schizophrenia: a window into auditory verbal hallucinations?
por: Conde, Tatiana
Publicado em: (2016)
por: Conde, Tatiana
Publicado em: (2016)
groups An automatic voice pleasantness classification system based on prosodic and acoustic patterns of voice preference
por: Coelho, L.
Publicado em: (2011)
por: Coelho, L.
Publicado em: (2011)
article Podcast e Vodcast : o potencial da ferramenta VoiceThread
por: Bottentuit Junior, João Batista
Publicado em: (2009)
por: Bottentuit Junior, João Batista
Publicado em: (2009)
article A Markov chain analysis of emotional exchange in voice-to-voice communication: testing for the mimicry hypothesis of emotional contagion
por: Lopes, M.
Publicado em: (2015)
por: Lopes, M.
Publicado em: (2015)
groups The impact of a country's economic factors on housing prices: the case of Portugal
por: Rehman, Saira
Publicado em: (2020)
por: Rehman, Saira
Publicado em: (2020)
article Risk factors of peri-implant pathology
por: Nobre, Miguel de Araújo
Publicado em: (2015)
por: Nobre, Miguel de Araújo
Publicado em: (2015)
article Ex vivo exposure to titanium dioxide and silver nanoparticles mildly affect sperm of gilthead seabream (Sparus aurata) - A multiparameter spermiotoxicity approach
por: Carvalhais, A.
Publicado em: (2022)
por: Carvalhais, A.
Publicado em: (2022)
article Short-term Feature Space and Music Genre Classification
por: Marques, Gonçalo
Publicado em: (2011)
por: Marques, Gonçalo
Publicado em: (2011)
groups Constitutive models for numerical analysis of the short- and long-term behavior of geosynthetics and mechanical damage
por: Lombardi, Giovani
Publicado em: (2022)
por: Lombardi, Giovani
Publicado em: (2022)
Registos relacionados
-
article Algorithm for Jitter and Shimmer Measurement in Pathologic Voices
por: Teixeira, João Paulo
Publicado em: (2016) -
groups Low band continuous speech system for voice pathologies identification
por: Cordeiro, Hugo
Publicado em: (2018) -
article Long short term memory on chronic laryngitis classification
por: Guedes, Victor
Publicado em: (2018) -
article Clustering of voice pathologies based on sustained voice parameters
por: Oliveira, Alessa Anjos de
Publicado em: (2020) -
groups Voice Pathologies Identification Speech signals, features and classifiers evaluation
por: Cordeiro, Hugo
Publicado em: (2015)