Segmenting and Classifying Connected Variable Width Characters
Original Publication Date: 1983-Jan-01
Included in the Prior Art Database: 2005-Feb-06
This invention relates to a method for segmenting and classifying connected variable-width characters. It is premised on the statistics of their being at least one non-joined character in the document which is included in a prototype library. The method operates on a character stream left to right and compares a window segment of the first character with the library. This is done recursively with a window segment of increasing size until a mismatch is attained. This indicates a character boundary. Small to large varying window segment comparisons are performed on the next character. If no match can be made, a new prototype is added to the library.