Method on detect duplicate files based on different encoding
Publication Date: 2014-Apr-30
The IP.com Prior Art Database
The data can be stored in different encoding scheme like Unicode, EBCDIC, ASCII, etc. If the data in different encoding scheme are stored in database. It leads to a duplication of storage,and more importantly it will affect the performance and outcome accuracy when searching in the database. This duplication can easily be avoided by checking all the data's encoding scheme before storing into the database.