Speech Compression by Phoneme Recognition
Original Publication Date: 1982-Nov-01
Included in the Prior Art Database: 2005-Feb-10
This invention is directed to a method for lossy batch mode speech wave compression in which an encoding device is trained to establish best match criteria between each wave pattern in a reference speech wave and each phoneme. There is also generated a Markov model defining states and transitions with respect to the reference speech wave. Next, subsequently applied speech waves are parsed into segments based upon global utterance optimizations and comparisons correlated to information stored in the trained encoder. Lastly, the speech segments are converted into a phoneme sequence in corresponding energy/time factors which are then recorded. The selection of the phoneme as the audio measure of interest and the compression of the subsequent speech wave by way of recognition are the critical observations.