Browse Prior Art Database

A PARSER FOR OPTICAL DOCUMENT LAYOUT

IP.com Disclosure Number: IPCOM000027343D
Original Publication Date: 1996-Apr-30
Included in the Prior Art Database: 2004-Apr-07

Publishing Venue

Xerox Disclosure Journal

Abstract

Proposed is a method for document processing using an attributed ambiguous grammar, as a representation, and preferences to determine the best or optimal results. The grammar possess finite-state machines and the preference logic compares values in two different possibilities to indicate which possibility is better. The proposed method comprises the steps of: describing the possible resultant document structures (such as layout) using the ambiguous grammar; defining preferences among the the possible results; and parsing the input according to the ambiguous grammar to determine possible results, while using the preferences to eliminate less desirable or sub-optimal results.