Browse Prior Art Database

Structure of the Dictionary for Multi-Term Post-Process

IP.com Disclosure Number: IPCOM000113730D
Original Publication Date: 1994-Sep-01
Included in the Prior Art Database: 2005-Mar-27
Document File: 2 page(s) / 47K

Publishing Venue

IBM

Related People

Kita, Y: AUTHOR

Abstract

Disclosed is a structure of the dictionary used for post-processing the multi-term words in the Optical Character Recognition system. It has the TRIE structure. In other words it has the tree-structure whose nodes contain some single characters. The root node contains all the initial letters of the words contained in the dictionary. By tracing this tree and picking up one character in each node, one path gives one word. This structure can reduce the disk space for the dictionary containing pretty many words compared with the flat structure that contains each word as itself.

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 100% of the total text.

Structure of the Dictionary for Multi-Term Post-Process

      Disclosed is a structure of the dictionary used for
post-processing the multi-term words in the Optical Character
Recognition system.  It has the TRIE structure.  In other words it
has the tree-structure whose nodes contain some single characters.
The root node contains all the initial letters of the words contained
in the dictionary.  By tracing this tree and picking up one character
in each node, one path gives one word.  This structure can reduce the
disk space for the dictionary containing pretty many words compared
with the flat structure that contains each word as itself.