Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

A Character Input Method Featured Sub-setting of Character Sets and its Display Method

IP.com Disclosure Number: IPCOM000123656D
Original Publication Date: 1999-Feb-01
Included in the Prior Art Database: 2005-Apr-05
Document File: 3 page(s) / 111K

Publishing Venue

IBM

Related People

Enomoto, Y: AUTHOR [+4]

Abstract

A program is disclosed that character input methods filter the characters generated from by referring to the database that correlates each character with the attribute indicating that it is useable or not in a system, in an application, or by an end user. A program is also disclosed that display methods of intermediate characters ease to differentiate the characters filtered by the character input methods from the characters originally contained in the dictionaries of character input methods.

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 52% of the total text.

A Character Input Method Featured Sub-setting of Character Sets and
its Display Method

   A program is disclosed that character input methods filter
the characters generated from by referring to the database that
correlates each character with the attribute indicating that it is
useable or not in a system, in an application, or by an end user.  A
program is also disclosed that display methods of intermediate
characters ease to differentiate the characters filtered by the
character input methods from the characters originally contained in
the dictionaries of character input methods.

   Most conventional character input methods have a kind of
dictionary that correlates Japanese phonetic symbols with Kanji
characters (ideographs).  They allow all the characters to be
inputted and used in a system or an application, which may be
harmful to some application users who want to restrict use of
certain characters in order to keep data base integrity or to
prevent certain characters from being displayed corrupted in
heterogeneous systems.  IBM or vendor selected characters are an
example to be restricted in electrical mails.

   As the number of characters increases with the advent of
Universal Multiple-Octet Coded Character Set, standardized in
ISO/IEC 10646-1, the function to limit unnecessary characters and
allow necessary characters to be used in a system is desired.  Our
program provides an easy solution to the problem.
  1) Structure of usable character set database

   In our methods, character input methods have the usable
character set database that contains information on which characters
can be allowed to be used or not.  The database registers usable
characters by the character names (or code points of characters)
under certain character set categories as follows:
  Character set category: Character name(or code point of
  character)

   Example:
  JIS X 0201 Roman character set:  LA01, LA02, LA03, ...
  JIS X 0208 Kanji character set:  JIS0101, JIS0102, JIS0103, ...

   The characters such as LA01 (Japanese Roman alphabet A) are
categorized under JIS X 0201 Roman character set and they are
allowed to be used.  The characters such as JIS0101 (Japanese Kanji
ward 1 and point 1) are categorized under JIS X 0208 Kanji character
set and they are allowed to be used.
  2) Character set filtering function

   Character input methods select candidates from the
characters registered to a Kana-Kanji conversion dictionary in the
conventional way.  In our method, character input methods
additionally refer to the usable character set database ex...