Browse Prior Art Database

Document Format Definition Method

IP.com Disclosure Number: IPCOM000016040D
Original Publication Date: 2002-Nov-01
Included in the Prior Art Database: 2003-Jun-21
Document File: 3 page(s) / 71K

Publishing Venue

IBM

Abstract

[Title] OCR Form Definition Method [Outline] On an OCR form definition program, it was a quite burden to input field names of OCR (Optical Character Recognition) forms with keyboad.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 59% of the total text.

Page 1 of 3

Document Format Definition Method

[Title]

OCR Form Definition Method

[Outline]

On an OCR form definition program, it was a quite burden to input field
names of OCR (Optical Character Recognition) forms with keyboad.

By recognizing characters in a form image, in this method, the burden can
be reduced.

[Problem to be solved]

Reducing the burden of field name input operations for OCR form definition.

[The method]

Field name input operations are supported by recognizing characters in a
form image.

1. Search and recognize character groups in a form image and evaluate
them. And make the most suitable one be a field name.
(A character group includes one or more words.)
2. Also display the other candidates in a list box. The listed candidates
are also displayed in a form image with some markings.

   Operator can select a appropriate candidate from the list box or image
markings.

3. When the appropriate one is not listed, a new character group in the
image can be selected with a mouse or something. It is recognized and
inputted as a field name.
4. By choosing plural candidates continuously, a compound field name can
also be inputted easily.

[A sample case]

1. Character groups near the field which is going to be defined is
searched, and the found character groups are recognized. And the found
candidates are surrounded by red frames. (Fig. 1)
2. The most suitable candidate is displayed on a field name input area.
(Fig. 2)
3. If the field name in the field name input area is appropreate, it wil...