Browse Prior Art Database

Utilization of Unicode Conversion Table in UCS-2 format into UCS-4 format

IP.com Disclosure Number: IPCOM000215861D
Publication Date: 2012-Mar-14
Document File: 1 page(s) / 38K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed is an extended utilization of Unicode Conversion Table in UCS-2 format (Basic Multilingual Plane) into UCS-4 format (Plane 1~16) for code conversion programs, without changing the basic structure of the table. The way to refer to the table by code values made it possible.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 52% of the total text.

Page 01 of 1

Utilization of Unicode Conversion Table in UCS -2 format into UCS-4 format

Disclosed is an extended utilization of Unicode Conversion Table in UCS-2 format (Basic Multilingual Plane) into UCS-4 format (Plane 1~16) for code conversion programs, without changing the basic structure of the table. The way to refer to the table by code values made it possible.


- The problem solved by this invention
It has not been easy to implement Unicode conversion for the range of Plane 1 to 16 (0x10000 - 0x10FFFF) when the coverage of Unicode used in a system was expanded from Basic Multilingual Plane (BMP), which is 0xFFFF or less.


- Known solutions


We usually consider modifying the Unicode conversion table itself if the range of Unicode used in a system was expanded from BMP. e.g. Modifying the table from 0xFFFF (for JISX0208) to 0x2FFFF (for JISX0213).

However, it requires too much modifications such as changing the table structure itself, the logic of creating the table and programming interface which refers to the table, since the binary BMP Unicode conversion table uses the value range within 0xFFFF for index creation and mapping layout.


- How the invention works


Applying the characteristic of UCS-4 format, Unicode conversion tables will be created per planes in UCS-2 format and distinguished by file names. And it makes extended use in UCS-4 format possible with minimum changes. Because it requires changing neither the binary format of conversion table nor the interface of code conversion program which...