Browse Prior Art Database

Sentence-end Detector For Language Processing

IP.com Disclosure Number: IPCOM000095444D
Original Publication Date: 1964-Jan-01
Included in the Prior Art Database: 2005-Mar-07
Document File: 2 page(s) / 32K

Publishing Venue

IBM

Related People

Craft, JL: AUTHOR [+2]

Abstract

Language translating machines process the text sentence by sentence. The sentence-end detector is capable of automatically determining whether or not a punctuation mark, detected in a stream of text, indicates the end of a sentence or has some other significance.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 100% of the total text.

Page 1 of 2

Sentence-end Detector For Language Processing

Language translating machines process the text sentence by sentence. The sentence-end detector is capable of automatically determining whether or not a punctuation mark, detected in a stream of text, indicates the end of a sentence or has some other significance.

Input data are scanned for terminal punctuation marks at the last stage of a shift register before they are fed to a data store. When such a mark is detected, the remaining data symbols of the shift register are transferred into the store. Other input data are prevented from entering the shift register. A table look-up device contains entries representing every possible form in which a terminal mark appears between data symbols, and for each form indicates with a flag bit whether the mark means end of sentence.

When a succession of symbols in the data store is matched in a compare circuit with an entry in the table look-up device, control C responds to the flag bit. Such is by either feeding the sentence from the data store to the sentence by sentence translator or feeding further input data to the shift register until the next terminal mark is detected.

1

Page 2 of 2

2

[This page contains 1 picture or other non-text object]