Browse Prior Art Database

System for Automatically and More Efficiently Splitting XML Documents

IP.com Disclosure Number: IPCOM000234745D
Publication Date: 2014-Jan-31
Document File: 3 page(s) / 29K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed are a method and system to use software to automatically split a large document per business needs, before sending it across a network. The software splits an Extensible Markup Language (XML) document in a smart way, without having to split the functional knowledge of the XML entities, resulting in faster and more efficient processing.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 38% of the total text.

Page 01 of 3

System for Automatically and More Efficiently Splitting XML Documents

Sending large files over the network can be problematic, possibly straining the system to the point of failure. Sometimes it is advantageous to split a file into smaller chunks and then separately send the chunks. This requires re-building the file on the target system.

One solution, for example in the case of purchase order Extensible Markup Language (XML) files over a certain size, is to split those files. This specific solution requires software on both the source and target to understand specifics about the file in order to determine the number of splits, where to split, and if splitting was entirely necessary in the first place. This can work in a specific case; however, it is not applicable in all instances, as it requires significant changes to the software to meet the needs of the client. The logic, although functional, is bound to the application and cannot be extended to any other system that generates large XML files.

A software build is needed that can split the XML in a smart way, without having to split the functional knowledge of the XML entities. Ideally, this should be done only as necessary, creating the chunks based on logic and thresholds.

The novel contribution is a method and system to use software to automatically split a large document by breaking it up in a smart way per the business needs, before sending it across a network. In addition, the method is generic; it is not specifically built for any one application. This method does not need specific rules baked into the software for splitting up and building back a document. This is a dynamic solution in the sense that the creation of XML files adhering to the design can be built and split up differently based on the intelligence of the system and the configuration of user desires.

The solution is a generic approach (i.e. not specifically built for any particular application) in which the schemas are built only once, at design time, and the software automatically determines the splitting points based on the schema. Although the designers of the schema know the allowable places for the split, the software breaks off pieces only as necessary (e.g., if a threshold is broken) and only in allowable places.

This method allows thousands of files to be sent across the network with no functionally specific splitting logic other than to use the generic solution described herein. This generic solution addresses the example purchase order issue without logic that understands a company's specific purchase orders. This method is applicable as a solution in multiple situations.

The method employs a set of schema tags and the intelligence to efficiently use the new tags while automatically splitting the XML file in the appropriate places for the business in the cases where a multitude of thresholds is exceeded. The creation of the tags is simple, while building the system smarts to use the tags in the b...