2015-Jun-29
This disclosure proposes a method and system for exploring the visual face information from images or videos which can be easily captured by camera installed either on devices like cell phone or mounted on a particular position of a space, e.g. on the ceiling of a room. The identity of the speaker can be inferred by detecting/recognizing his/her face. Levaraging this uncovered identity information, the speech recognition method and system can be further enhanced in both training and testing stages, which ultimately improves the recognition accuracy.

