Browse Prior Art Database

Voice Identification Based Programmable Speech Data Filter

IP.com Disclosure Number: IPCOM000018949D
Original Publication Date: 2003-Aug-21
Included in the Prior Art Database: 2003-Aug-21

Publishing Venue

IBM

Abstract

Speech products from cell phone to Automatic Speech Recognition (ASR) applications, require high quality audio input to achieve optimal performance. Applications where the user is some distance from the microphone, known as far field microphone applications, are turning to microphone arrays to achieve the necessary audio quality, specifically high Signal-to-Noise Ratio (SNR). To achieve optimize SNR, the microphone array's Digital Signal Processor (DSP) adjusts its' parameters to change the array's sensitivity profile. In essence a high sensitivity beam is directed at the intended speaker, and low sensitivity areas, or nulls, are directed toward detected noise sources. When the speaker of interest is one of several people in proximity to the array, as in a conference room setting for example, it is not always clear to whom the array's beam should be focused. One way to determine which of several speakers, is the speaker of interest and therefore the direction toward which to steer the beam, is through the use Automatic Speaker Identification (ASI).