Abstract
The pitch-predictor contributes greatly to the efficiency of current analysis-by-synthesis speech coders by mapping the past reconstructed signal into the present. However, for good performance, it is required that its parameters are updated often (once every 2.5-7.5 ms). A slower update rate of the pitch-predictor delay results in time misalignment between the original signal and the pitch-predictor contribution to the reconstructed signal. We introduce a new procedure, that allows a slow update rate of the pitch-predictor parameters without this problem. In this method the original signal is modified in a closed-loop fashion such that the parameter values obtained by interpolation of open-loop estimates form the optimal encoding of the modified signal. This new paradigm is a generalization of the familiar analysis-by-synthesis principle. The generalized analysis-by-synthesis principle can be used for interpolation of both the pitch-predictor delay and gain. We compare, by means of a subjective test, speech signals encoded with different versions of the code-excited linear predictor (CELP) coder. The comparison shows that a pitch predictor exploiting our interpolation strategy, with an update rate of 50 Hz, provides a subjective speech quality similar to a conventional pitch predictor where the parameters are updated for every pitch cycle.
Original language | English (US) |
---|---|
Pages (from-to) | 42-54 |
Number of pages | 13 |
Journal | IEEE Transactions on Speech and Audio Processing |
Volume | 2 |
Issue number | 1 |
DOIs | |
State | Published - Jan 1994 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Software
- Acoustics and Ultrasonics
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering