Dismiss
The InnovationQ application will be updated on Sunday, May 31st from 10am-noon ET. You may experience brief service interruptions during that time.
Browse Prior Art Database

Speech Compression by Phoneme Recognition

IP.com Disclosure Number: IPCOM000050545D
Original Publication Date: 1982-Nov-01
Included in the Prior Art Database: 2005-Feb-10

Publishing Venue

IBM

Related People

Authors:
Choy, DM Jih, CR [+details]

Abstract

This invention is directed to a method for lossy batch mode speech wave compression in which an encoding device is trained to establish best match criteria between each wave pattern in a reference speech wave and each phoneme. There is also generated a Markov model defining states and transitions with respect to the reference speech wave. Next, subsequently applied speech waves are parsed into segments based upon global utterance optimizations and comparisons correlated to information stored in the trained encoder. Lastly, the speech segments are converted into a phoneme sequence in corresponding energy/time factors which are then recorded. The selection of the phoneme as the audio measure of interest and the compression of the subsequent speech wave by way of recognition are the critical observations.