Browse Prior Art Database

Pause Duration Control for Japanese Text-To-Speech System

IP.com Disclosure Number: IPCOM000037174D
Original Publication Date: 1989-Nov-01
Included in the Prior Art Database: 2005-Jan-29
Document File: 2 page(s) / 47K

Publishing Venue

IBM

Related People

Suzuki, K: AUTHOR

Abstract

This article describes a method for controlling pause duration in spoken sentences synthesized by a text-to-speech system. This method is based on analysis of spoken sentences and can produce natural pauses.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 80% of the total text.

Page 1 of 2

Pause Duration Control for Japanese Text-To-Speech System

This article describes a method for controlling pause duration in spoken sentences synthesized by a text-to-speech system. This method is based on analysis of spoken sentences and can produce natural pauses.

In Japanese, pause duration is very important for communicating the syntactic and semantic structure of the sentence. Consequently pause duration control is a key to synthesizing natural-sounding spoken sentences.

Conventional methods: There are two typical methods for controlling pause duration. The first method is based solely on punctuation marks; pause duration P is given by: P0 (No punctuation mark)

P =

P1 (Punctuation mark),

where P0 and P1 are constant values of pause duration.

(Image Omitted)

The other method controls pause duration solely by using breath- group length before the pause (L1); pause duration P is given by:

P = a' + b'* L1,

where a' and b' are parameters given by regression

analysis.

However, pause durations generated by using these methods bear little relation to those of natural utterance data. Pause durations assigned by these rules do not contribute to the naturalness of the synthesized speech.

New method: In this new control method, pause duration is calculated by the length of both the breath-groups before and after the pause (L1, L2); pause duration P is given by P = a" + b"( L1 + c"* L2 ), where a", b" and c" are parameters given by regression analysis.

Fig. 2 shows the re...