Browse Prior Art Database

Speech Recognition Method Using Multiple Fenemic Baseforms of HMM

IP.com Disclosure Number: IPCOM000040386D
Original Publication Date: 1987-Nov-01
Included in the Prior Art Database: 2005-Feb-02

Publishing Venue

IBM

Related People

Authors:
Nishimura, M Toshioka, K [+details]

Abstract

This article describes modified speech recognition of a fenemic baseform type, which can reflect context-dependent speech fluctuation at each frame in a word with a small increase of computation time. The basic idea of this approach is to prepare multiple time-aligned fenemic baseforms and use them in a combined way at decoding time. For combining the multiple baseforms, one of the following two methods is applied at each frame. 1) Use of averaged parameters of corresponding fenemic phone machines. 2) Use of the maximum probability for the observed label. The system of the proposed method consists of a training session and a decoding session. In the training session, as shown in Fig. 1, multiple baseforms are constructed from utterances for each word.