Subsampled Prototypes and Weighted Clustering
Original Publication Date: 1987-Jan-01
Included in the Prior Art Database: 2005-Jan-31
The present invention relates to the characterization of the support of the distribution of speech by prototypical points obtained through vector quantization biased to represent a uniform distribution on the support. One method of characterizing speech is by prototypical points (in Euclidean space) determined from a clustering algorithm. The present invention proposes two algorithms which seek to cover the support of speech (like a phonetician) without speech-theoretic notions or biasses. The algorithms are constructed to automatically produce a set of prototypes whose distribution represents, not the complex bumpy distribution of real speech, but the uniform distribution on the support of real speech.