Um Novo Método Usando Autocorrelação para Extração da Freq¨uência Fundamental em Sinais de Voz
DOI:
https://doi.org/10.5540/tema.2007.08.02.0191Abstract
Este artigo descreve o algoritmo de extração da freqüência fundamental do sinal de voz usado na implementação do programa P-NAV (Programa Neuro Analizador Vocal), por Brandão (2006). O método proposto toma como base o algoritmo descrito por Boersma (1993), que usa o método da autocorrelação, e desenvolve quatro algoritmos obtendo, com isso, um método mais robusto para marcar corretamente os períodos do sinal de voz, mesmo em trechos severamente perturbados e diplofônicos.References
P. Boersma, Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound, IFA Proceedings, 17 (1993), 97-110.
A. Brandão, F.R. Leta, Usando redes neurais para classificação de padrões de voz, em “XXVII CNMAC - Congresso Nacional de Matemática Aplicada e Computacional”, SBMAC, 2005.
A. Brandão, “Classificação de Vozes Naturais e de Vozes Sintetizadas através de Modelos Mecânicos de Laringe e de Trato Vocal usando Redes Neurais”, Dissertação de Mestrado, Universidade Federal Fluminense, Niterói, RJ, 2006.
A. Brandão, E. Cataldo, R. Sampaio, “Análise e Processamento de Sinais”, Apostila, SBMAC, 2005.
J. Cernocky, “Speech Processing Using Automatically Derived Segmental Units”, PhD Thesis, ESIEE, France, 1998.
M.P. Karnell, Laryngeal perturbation analysis: minimum length of analysis window, Journal of Speech and Hearing Research, 34 (1991), 544-548.
A.P. Klapuri, Multiple fundamental frequency estimation based on harmonicity and spectral smoothness, IEEE Transactions on Speech and Audio Processing, 11, No. 6 (2003).
P. Lieberman, Perturbation in vocal pitch, Journal of the Acoustical Society of America, 33 (1961), 597-603.
P. Motlíˇcek, L. Burget, “Reliability Improvement of Speech Pitch Detetion Using Paths”, Institute of Radio Electronics, Faculty of Electrical Engineering, TU Brno, 2000.
L.R. Rabiner, et al., A comparative performance study of several pitch detection algorithms, IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-24, No. 5 (1976).
D. Talkin, “A Robust Algorithm for Pitch Tracking (RAPT). Speech Coding and Synthesis”. New York, Elsevier, 1995.
D. Wong, R. Lange, I. Titze, C.G. Guo, Mechanisms of Jitter-Induced Shimmer in a driven model of vocal fold vibration, in “NCVS Status and Progress Report”, pp. 33-41, 1995.
Downloads
Published
How to Cite
Issue
Section
License
Copyright
Authors of articles published in the journal Trends in Computational and Applied Mathematics retain the copyright of their work. The journal uses Creative Commons Attribution (CC-BY) in published articles. The authors grant the TCAM journal the right to first publish the article.
Intellectual Property and Terms of Use
The content of the articles is the exclusive responsibility of the authors. The journal uses Creative Commons Attribution (CC-BY) in published articles. This license allows published articles to be reused without permission for any purpose as long as the original work is correctly cited.
The journal encourages Authors to self-archive their accepted manuscripts, publishing them on personal blogs, institutional repositories, and social media, as long as the full citation is included in the journal's website version.