Browse Prior Art Database

Conversion of final form data, such as AFP, to XML

IP.com Disclosure Number: IPCOM000013719D
Original Publication Date: 2001-Apr-07
Included in the Prior Art Database: 2003-Jun-18
Document File: 5 page(s) / 66K

Publishing Venue

IBM

Abstract

Described herein are 2 methods of converting final form data into XML. The advantage is that a final data format is converted into a flexible intermediate data format. Advanced Function Presentation (AFP) data is final format, ready to print or archive. AFP is not considered available for further downstream processes as XML data is. Converting AFP to XML opens the data in the AFP data stream to additional uses in eBusiness, publishing, etc. Currently there are a number of businesses that generate AFP for printing that would also like to produce XML out of this data for eBusiness applications. There are 2 different mechanisms that can be employed to accomplish the above: 1. Manual Mapping of AFP Physical Areas to XML Elements.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 42% of the total text.

Page 1 of 5

Conversion of final form data, such as AFP, to XML

Described herein are 2 methods of converting final form data into XML. The advantage is that a final data format is converted into a flexible intermediate data format.

Advanced Function Presentation (AFP) data is final format, ready to print or archive. AFP is not considered available for further downstream processes as XML data is. Converting AFP to XML opens the data in the AFP data stream to additional uses in eBusiness, publishing, etc. Currently there are a number of businesses that generate AFP for printing that would also like to produce XML out of this data for eBusiness applications.

There are 2 different mechanisms that can be employed to accomplish the above:

1. Manual Mapping of AFP Physical Areas to XML Elements.

In this implementation, an AFP document exists with data to be extracted into an XML document. For example, the AFP document might look like the example below:

---------------------------

Bill Smith

IBM

----------------------------

The goal would be to get the Name and the Company out of the AFP document and put it into a convenient structure, suitable for re-purposing. XML is such a structure and if this information is extracted it can be put it into an XML document of the form:

<document>

<Name> Bill Smith </Name>

<Company> IBM </Company> </document>

Implementation of this procedure is accomplished using software that works as described below:

------------------------------------------------------------------------------------------------------

1

Page 2 of 5

----

First: ------------------------------------------------------------------------------------------------------ ----

A.) The user would work with a GUI, that could view an AFP document. Areas of the document containing data or graphics would be highlighted by the user with the mouse by selection or rubberbanding. The areas would be processed one at a time. The areas could also be indicated in a more manual way by recording within a file, etc.

B.) These identified areas would be mapped into tag names. In the above example, if the user highlighted Bill Smith, they would then enter the tag name "Name". "Bill Smith" would be the data associated with the tag "Name". The position of the data "Bill Smith" would be recorded in a file and it would associate in that file with the tag name "Name".

C.) If the user went through the entire document a file could be built up that associates each location (position and size) with an XML tag. In some cases perhaps tags would be repeated, that is, 2 locations would use the same tag, resulting in an XML structure like:

<document>

<Name> Bill Smith </Name>

<Company> IBM </Company>

<Company> ABC </Company> </document>

From a page that looks like:

---------------------------

Bill Smith

IBM

ABC

----------------------------

But that should not be any problem.

D.) Now each physical location in the AFP document is mapped to a location in the XML document hierarchy.

2

Page 3 of...