A Multi-Lingual Speaker-Independent Voice Name Dialing System
Original Publication Date: 2004-Nov-25
Included in the Prior Art Database: 2004-Nov-25
Related PeopleOther Related People:
Nowadays, it is possible with new technologies like grapheme-to-phoneme conversation (G2P) to access the phone book or the address book in a mobile phone by voice without any training (say-in). The user just presses the push-to-speak (PTS) button, speaks a name in his contacts, and after a speech recognition has been run a phone call to the respective person is established. Such functionality is called "speaker-independent voice name dialing (SI-VND)". This means: No say-in (i.e. no training of the names to be recognized) is necessary. The voice access to the names is available out of the box. The underlying technology of SI-VND is G2P. G2P often is implemented as a set of rules or as a neural network trained to convert (parts of) words into a phonemic description. This conversion is also called "transcription". The resulting phonemic description gives a speech recognizer the information necessary to recognize the name. Due to the generation of a phonemic representation (through G2P) of names, the SI-VND is highly language-dependent (LD). In state of the art implementations, the speech recognition (and/or the dialogue between user and device in general) is designed to take place in a particular language (such as German). This means that the name "Peter" is automatically transcribed into the SAMPA (Speech Assessment Methods Phonetic Alphabet) phoneme string /p e: t 6/.