Browse Prior Art Database

A method to process XML documents with preserving their representation

IP.com Disclosure Number: IPCOM000015025D
Original Publication Date: 2001-Oct-22
Included in the Prior Art Database: 2003-Jun-20
Document File: 1 page(s) / 41K

Publishing Venue

IBM

Abstract

Disclosed is a method for processing certain parts of an XML document without breaking the representation of the rest of the document.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 57% of the total text.

Page 1 of 1

A method to process XML documents with preserving their representation

    Disclosed is a method for processing certain parts of an XML document without breaking the representation of the rest of the document.

An XML document is usually processed by an XML processor. However, the processor may not preserve the representation of the document, such as whitespaces, quotation marks, and the order of attributes, and hence the use of the processor may cause a problem for certain applications. For example, a digital signature on an XML document will be broken if the document contains information items being removed by the processor.

In order to avoid the problem, XML Signature [XML-Signature] specifies some canonicalization methods. However, the use of the methods is responsible for users and hence none of the methods may be used. In that case, a digital signature may be still broken even if reversible processings (e.g., XML Encryption [XML-Encryption]) are performed after signing.

Our method uses an XML processor for processing certain parts of an XML document but does not break the representation of the rest of the document. In our method, it is assumed that the processor can provide "range information", which is a set of starting and ending positions' pair for each information item of the document (e.g., element and attribute) within the serialized form of the document. Considering the processing performed by the processor, the assumption is quite reasonable. With the as...