Browse Prior Art Database

Mechanism to add speech recognition to image data for image sequence control

IP.com Disclosure Number: IPCOM000013121D
Original Publication Date: 2003-Jun-13
Included in the Prior Art Database: 2003-Jun-13

Publishing Venue

IBM

Abstract

Image data file and Video formats are enhanced with finite speech grammars. The speech grammars allow the user to control the navigation of a remote camera that can either be real or virtual. For a real camera, the speech grammars capture user's commands to position or move the camera when processed by an Automated Speech Recognizer (ASR) system. For a virtual camera, the speech grammars capture the user's commands to move through a sequence of images stored in a remote database and representing different locations in a virtual world.