Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

System and method to compare file content regardless of syntax

IP.com Disclosure Number: IPCOM000236744D
Publication Date: 2014-May-14
Document File: 2 page(s) / 44K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed is a method to compare two documents that have the same content, but different syntax.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 52% of the total text.

Page 01 of 2

System and method to compare file content regardless of syntax

File comparison tools can compare two or more files and identify differences . However, such tools are inefficient in comparing files that contain syntax changes when the data content might be the same. Current tools allow the user to customize which parts of the file to compare and/or ignore. However, those sections of the file being compared are expected to follow the same structure in order to provide an accurate comparison. Some Extensible Markup Language (XML) comparison tools allow a certain degree of differences to determine whether files are 'similar'. However, it is not useful if the syntax of the file has changed significantly because it does not make a distinction between syntax changes versus content changes.

The novel contribution is a method to compare two documents that have the same content , but different syntax.

The novel contribution is a smarter file comparison method that detects when two files contain the same content even though the structure or syntax of the data file has significantly changed. The system can analyze the compared files and detect where structure changes have occurred, and thus more accurately compare content.

The advantage of using this method over existing solutions is the ability to compare content even though the file syntax or structure is different. Many times when comparing data files like XML, a user is not interested in syntax or formatting changes in the data file; instead, the main interest is in changes in the dat...