Speech Recognition Method Using Multiple Fenemic Baseforms of HMM
Original Publication Date: 1987-Nov-01
Included in the Prior Art Database: 2005-Feb-02
This article describes modified speech recognition of a fenemic baseform type, which can reflect context-dependent speech fluctuation at each frame in a word with a small increase of computation time. The basic idea of this approach is to prepare multiple time-aligned fenemic baseforms and use them in a combined way at decoding time. For combining the multiple baseforms, one of the following two methods is applied at each frame. 1) Use of averaged parameters of corresponding fenemic phone machines. 2) Use of the maximum probability for the observed label. The system of the proposed method consists of a training session and a decoding session. In the training session, as shown in Fig. 1, multiple baseforms are constructed from utterances for each word.