Browse Prior Art Database

"What did the speaker say?" Button

IP.com Disclosure Number: IPCOM000245505D
Publication Date: 2016-Mar-13
Document File: 2 page(s) / 121K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed is a mechanism that provides video users with a "What did the speaker say?" button. This button enables users to return to the place in the video just prior to the speaker’s last phrase and not only hear, but also see closed captioning for, the misunderstood speech.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 51% of the total text.

Page 01 of 2

"What did the speaker say?" Button

While watching a video, a user might not hear exactly what the speaker said due to background noise, a language difference or accent, or a low voice, etc. The user might then attempt to rewind the video in order to hear the sound bite again. The problem is that current methods for returning to a specific segment of the video, such as a sliding bar or arrows, are not sensitive enough, which causes the user to repeatedly move too

far in one direction or the other, and miss the sound bite. This causes the user to waste time and take longer to see the clip or movie. Another problem is that even with

repeated rewinds, a listener may not be able to understand the spoken voice, or doing

so may require more than one rewind.

The novel contribution is a mechanism that provides video users with a "What did the speaker say?" button. This button enables users to return to the place in the video just prior to the speaker's last phrase. The mechanism skips back in the video at increments defined not by static time lengths, but by sentence breaks, changes in speakers, or screen changes. The capability detects who the last speaker was and returns to the immediately previous sentence. In addition, the system may automatically enable closed captions (if not previously enabled) for the duration of the repeated phrase or up to the previous point of playback, or conditionally enable closed captioning during a rewind based on detecting that a user may have trouble understanding spoken words in the audio.

Instead of using 5 or 10-second increments for rewind, this mechanism rewinds using breakpoints decided by natural pauses in speech, scene changes, and other transition points in the video and audio stream. (The methods to detect scene changes, speaker changes, or sentence breaks are enabling art not claimed in th...