Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

Canonical XML Version 1.0 (RFC3076)

IP.com Disclosure Number: IPCOM000005270D
Original Publication Date: 2001-Mar-01
Included in the Prior Art Database: 2001-Aug-20
Document File: 29 page(s) / 64K

Publishing Venue

Internet Society Requests For Comment (RFCs)

Related People

J. Boyer: AUTHOR

Abstract

Any XML (Extensible Markup Language) document is part of a set of XML documents that are logically equivalent within an application context, but which vary in physical representation based on syntactic changes permitted by XML 1.0 and Namespaces in XML. This specification describes a method for generating a physical representation, the canonical form, of an XML document that accounts for the permissible changes. Except for limitations regarding a few unusual cases, if two documents have the same canonical form, then the two documents are logically equivalent within the given application context. Note that two documents may have differing canonical forms yet still be equivalent in a given context based on application-specific equivalence rules for which no generalized XML specification could account.

This text was extracted from a ASCII document.
This is the abbreviated version, containing approximately 5% of the total text.

Network Working Group J. Boyer Request for Comments: 3076 PureEdge Solutions Inc. Category: Informational March 2001

Canonical XML Version 1.0

Status of this Memo

This memo provides information for the Internet community. It does not specify an Internet standard of any kind. Distribution of this memo is unlimited.

Copyright Notice

Copyright (C) The Internet Society (2001). All Rights Reserved.

Abstract

Any XML (Extensible Markup Language) document is part of a set of XML documents that are logically equivalent within an application context, but which vary in physical representation based on syntactic changes permitted by XML 1.0 and Namespaces in XML. This specification describes a method for generating a physical representation, the canonical form, of an XML document that accounts for the permissible changes. Except for limitations regarding a few unusual cases, if two documents have the same canonical form, then the two documents are logically equivalent within the given application context. Note that two documents may have differing canonical forms yet still be equivalent in a given context based on application-specific equivalence rules for which no generalized XML specification could account.

Boyer Informational [Page 1]

RFC 3076 Canonical XML March 2001

Table of Contents

1. Introduction............................................... 2 1.1 Terminology............................................... 3 1.2 Applications.............................................. 4 1.3 Limitations............................................... 4 2. XML Canonicalization....................................... 6 2.1 Data Model................................................ 6 2.2 Document Order............................................ 10 2.3 Processing Model.......................................... 10 2.4 Document Subsets.......................................... 13 3. Examples of XML Canonicalization........................... 14 3.1 PIs, Comments, and Outside of Document Element............ 14 3.2 Whitespace in Document Content............................ 15 3.3 Start and End Tags........................................ 16 3.4 Character Modifications and Character References.......... 17 3.5 Entity References......................................... 19 3.6 UTF-8 Encoding............................................ 19 3.7 Document Subsets.......................................... 20 4. Resolutions................................................ 21 4.1 No XML Declaration........................................ 21 4.2 No Character Model Normalization.......................... 21 4.3 Handling of Whitespace Outside Document Element........... 22 4.4 No Namespace Prefix Rewriting............................. 22 4.5 Order of Namespace Declarations and Attributes............ 23 4.6 Superfluous Namespace Declarations........................ 23 4.7 Propagation of...