TY - GEN
T1 - Neural network classifiers and principal component analysis for blind signal to noise ratio estimation of speech signals
AU - Marbach, Matthew
AU - Ondusko, Russell
AU - Ramachandran, Ravi P.
AU - Head, Linda M.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a neural network classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Speech corrupted by additive white Gaussian noise, pink noise and two types of bandpass channel noise are investigated. The best individual feature is the vector of line spectral frequencies. Combination of the estimates of 3 features lowers the estimation error to an average of 3.69 dB for the four types of noise.
AB - A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a neural network classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Speech corrupted by additive white Gaussian noise, pink noise and two types of bandpass channel noise are investigated. The best individual feature is the vector of line spectral frequencies. Combination of the estimates of 3 features lowers the estimation error to an average of 3.69 dB for the four types of noise.
UR - http://www.scopus.com/inward/record.url?scp=70350157251&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70350157251&partnerID=8YFLogxK
U2 - 10.1109/ISCAS.2009.5117694
DO - 10.1109/ISCAS.2009.5117694
M3 - Conference contribution
AN - SCOPUS:70350157251
SN - 9781424438280
T3 - Proceedings - IEEE International Symposium on Circuits and Systems
SP - 97
EP - 100
BT - 2009 IEEE International Symposium on Circuits and Systems, ISCAS 2009
T2 - 2009 IEEE International Symposium on Circuits and Systems, ISCAS 2009
Y2 - 24 May 2009 through 27 May 2009
ER -