Browse Prior Art Database

A Complementary GUI System using Web Browsers for Voice Application

IP.com Disclosure Number: IPCOM000015540D
Original Publication Date: 2002-Feb-01
Included in the Prior Art Database: 2003-Jun-20
Document File: 4 page(s) / 114K

Publishing Venue

IBM

Abstract

Disclosed is a system for·usability improvement of VoiceXML applications by adding the Web browser GUI as a complementary user interface. Such a visual user interface is convenient for some kinds of voice applications, such as a form application that requires complicated input fields, which are difficult to input via voice dialogues. For such input fields, using the keyboard or mouse selection is preferable to voice recognition. More specifically, the disclosed method enables users to use complementary Web forms for arbitrary input fields in voice applications without modifying the original VoiceXML documents. Fig. 1. Typical System D iagram of V oiceX M L A pplications 101 103 102

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 49% of the total text.

Page 1 of 4

A Complementary GUI System using Web Browsers for Voice Application

Disclosed is a system for·usability improvement of VoiceXML applications by adding the Web browser GUI as a complementary user interface. Such a visual user interface is convenient for some kinds of voice applications, such as a form application that requires complicated input fields, which are difficult to input via voice dialogues. For such input fields, using the keyboard or mouse selection is preferable to voice recognition. More specifically, the disclosed method enables users to use complementary Web forms for arbitrary input fields in voice applications without modifying the original VoiceXML documents.

Fig. 1. Typical System D iagram of V oiceX M L A pplications

101

103102

VoiceXM L Application

105

Fig. 1 shows a typical system diagram for voice applications using VoiceXML documents, which consists of a VoiceXML generator (101) as the voice application, a voice browser (103), a telephony system (104), and a user's phone (105). First, a user calls the telephony system using a phone, and the telephony system invokes the voice browser. According to the user's requirements, which are known in some way, for example, from the telephone number, by using the DTMS (dual tone multi frequency) buttons, or by voice recognition, the voice browser sends the application request to the corresponding URL (universal resource locator) that represents the location of the required voice application. The application processes the request and the VoiceXML generator creates a VoiceXML document. The generated document (102) is sent back to the voice browser, which interprets it and interacts with the user via voice through the telephony system.

Voice Brow ser

VoiceXM L

P hone Line

104

Telephony System

1

[This page contains 5 pictures or other non-text objects]

Page 2 of 4

101

VoiceXM L A pplication

103

104

201203Interm ediary System

204

102105

Voice Brow ser

 Modified VoiceXM L

VoiceXM L

Phone Line (V oice)

Telephony System

N etw ork (W eb)

Update

The additional intermediary system (201) in Fig. 2 is an implementation of the disclosed system to enable complementary GUI operations without modifying the original application. The system is located between the voice browser (103) and the voice application (101), and provides Web server functions for Web browsers that the phones (105) are equipped with. The system (201) consists of two subsystems: a transcoder for VoiceXML documents (203) and a Web server for the phone browser (202). The transcoder (203) receives the original VoiceXML documents (102) from the voice application (101), modifies it so that the voice browser (103) can synchronize with the Web server (202), and sends the modified VoiceXML documents (204) to the voice browser. The Web server (202) serves the complementary Web form for the VoiceXML field currently being processed, and updates the voice browser to import the data input from the Web form. For such synchronizati...