Browse Prior Art Database

Supporting User-Defined Characters During Data Set Translation

IP.com Disclosure Number: IPCOM000043556D
Original Publication Date: 1984-Sep-01
Included in the Prior Art Database: 2005-Feb-05
Document File: 3 page(s) / 16K

Publishing Venue

IBM

Related People

Aiken, JA: AUTHOR [+2]

Abstract

The disclosed method for converting some Japanese Business/Personal Computer (JPC) data sets into and from documents that can be processed by a Japanese Word Processor (JWP) on a small system takes into account the differing data stream representations to provide a conversion that is as faithful as possible. In particular, the existence of user-defined characters in JPC and JWP is supported as faithfully as possible. User-defined characters are translated so that no information is lost (if the users adhere to some simple rules when defining user characters), and so that the output resembles the input as far as possible, even though the data stream of JPC does not support the same number of user-defined characters as the JWP data stream.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 39% of the total text.

Page 1 of 3

Supporting User-Defined Characters During Data Set Translation

The disclosed method for converting some Japanese

Business/Personal Computer (JPC) data sets into and from documents that can be processed by a Japanese Word Processor (JWP) on a small system takes into account the differing data stream representations to provide a conversion that is as faithful as possible. In particular, the existence of user-defined characters in JPC and JWP is supported as faithfully as possible. User-defined characters are translated so that no information is lost (if the users adhere to some simple rules when defining user characters), and so that the output resembles the input as far as possible, even though the data stream of JPC does not support the same number of user-defined characters as the JWP data stream. A Japanese small business computer/personal computer may support several applications tailored to the particular user set. Business computer applications require a general-purpose system which is very flexible and which may be used under a large variety of situations. Users are often familiar with computer systems and programming concepts. JPC data sets, in particular, are designed to be able to support a very large variety of business or personal applications. This means that the data set and data stream architectures should be very simple and extendable to many applications. Word processing applications present special performance and function requirements. To provide optimum function for a word processing user, the system and application programming may be tailored specifically to the unique JWP requirements. Data set and data stream architectures, in particular, may be very complex and designed specifically for the JWP needs. Word processing operators as a class tend to be less sophisticated in their knowledge of computer concepts than general JPC users. This is not necessarily a disadvantage, since the JWP applications generally perform all functions associated with data streams and data set architectures.

The JWP operator thus does not know at all about the complexity of the underlying structures of a document. The Japanese written language consists of many thousands of ideographic characters (Kanji). The number of Kanji characters actually in use is unknown, but is in excess of 40,000. A small system could not reasonably support that many characters, even if they could all be defined in a manner generally acknowledged. Therefore, the ability for system users to define new characters is required on Japanese systems. This allows JWP and JPC to support between 7,000 and 8,000 system-defined characters, while users can tailor the system to support their particular needs. While both JPC and JWP support user-definable characters, the manner in which each does so is different. In both data stream architectures, though, a certain range of code points in the Kanji code range has been reserved for user-defined characters. These characters...