Publicação

Voice Pathologies Identification Speech signals, features and classifiers evaluation

Ver documento

Detalhes bibliográficos
Resumo:Voice pathology identification using speech processing methods can be used as a preliminary diagnosis. This study implements a set of identification systems to screen voice pathologies using voice signal features from the sustained vowel /a/ and continuous speech. The two signals tasks are evaluated using three acoustic features applied to four classifiers. Three main classes are identified: physiological disorders; neuromuscular disorders; and healthy subjects. The main objective of this work is to evaluate which voice signal is more reliable for voice pathology diagnosis, which acoustic feature has more pathology information and which is the best classifier to carry out this task. The best overall system accuracy is 77.9%, obtained with Mel-Line Spectrum Frequencies (MLSF) feature extracted from continuous speech and applied to a Gaussian Mixture Models (GMM) classifier.
Autores principais:Cordeiro, Hugo
Outros Autores:Fonseca, José; Guimarães, Isabel; Meneses, Carlos
Assunto:Component Voice Pathologies Identification Continuous Speech Sustained Vowel MLSF GMM
Ano:2015
País:Portugal
Tipo de documento:documento de conferência
Tipo de acesso:acesso restrito
Instituição associada:Instituto Politécnico de Lisboa
Idioma:inglês
Origem:Repositório Científico do Instituto Politécnico de Lisboa
Descrição
Resumo:Voice pathology identification using speech processing methods can be used as a preliminary diagnosis. This study implements a set of identification systems to screen voice pathologies using voice signal features from the sustained vowel /a/ and continuous speech. The two signals tasks are evaluated using three acoustic features applied to four classifiers. Three main classes are identified: physiological disorders; neuromuscular disorders; and healthy subjects. The main objective of this work is to evaluate which voice signal is more reliable for voice pathology diagnosis, which acoustic feature has more pathology information and which is the best classifier to carry out this task. The best overall system accuracy is 77.9%, obtained with Mel-Line Spectrum Frequencies (MLSF) feature extracted from continuous speech and applied to a Gaussian Mixture Models (GMM) classifier.