Browse Prior Art Database

Subsampled Prototypes and Weighted Clustering

IP.com Disclosure Number: IPCOM000038506D
Original Publication Date: 1987-Jan-01
Included in the Prior Art Database: 2005-Jan-31

Publishing Venue

IBM

Related People

Authors:
Cohen, JR Nadas, A [+details]

Abstract

The present invention relates to the characterization of the support of the distribution of speech by prototypical points obtained through vector quantization biased to represent a uniform distribution on the support. One method of characterizing speech is by prototypical points (in Euclidean space) determined from a clustering algorithm. The present invention proposes two algorithms which seek to cover the support of speech (like a phonetician) without speech-theoretic notions or biasses. The algorithms are constructed to automatically produce a set of prototypes whose distribution represents, not the complex bumpy distribution of real speech, but the uniform distribution on the support of real speech.