Dismiss
IP.com applications will be updated on Sunday, March 5, from 11 am to 2 pm ET, to add new functionality and content. You may experience brief service interruptions during this period. We apologize for any inconvenience.
Browse Prior Art Database

ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP (RFC1554)

IP.com Disclosure Number: IPCOM000002387D
Original Publication Date: 1993-Dec-01
Included in the Prior Art Database: 2000-Sep-12
Document File: 8 page(s) / 11K

Publishing Venue

Internet Society Requests For Comment (RFCs)

Related People

M. Ohta: AUTHOR [+2]

Abstract

This memo describes a text encoding scheme: "ISO-2022-JP-2", which is used experimentally for electronic mail [RFC822] and network news [RFC1036] messages in several Japanese networks. The encoding is a multilingual extension of "ISO-2022-JP", the existing encoding for Japanese [2022JP]. The encoding is supported by an Emacs based multilingual text editor: MULE [MULE].

This text was extracted from a ASCII Text document.
This is the abbreviated version, containing approximately 28% of the total text.

Network Working Group M. Ohta

Request for Comments: 1554 Tokyo Institute of Technology

Category: Informational K. Handa

ETL

December 1993

ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP

Status of this Memo

This memo provides information for the Internet community. This memo

does not specify an Internet standard of any kind. Distribution of

this memo is unlimited.

Introduction

This memo describes a text encoding scheme: "ISO-2022-JP-2", which is

used experimentally for electronic mail [RFC822] and network news

[RFC1036] messages in several Japanese networks. The encoding is a

multilingual extension of "ISO-2022-JP", the existing encoding for

Japanese [2022JP]. The encoding is supported by an Emacs based

multilingual text editor: MULE [MULE].

The name, "ISO-2022-JP-2", is intended to be used in the "charset"

parameter field of MIME headers (see [MIME1] and [MIME2]).

Description

The text with "ISO-2022-JP-2" starts in ASCII [ASCII], and switches

to other character sets of ISO 2022 [ISO2022] through limited

combinations of escape sequences. All the characters are encoded

with 7 bits only.

At the beginning of text, the existence of an announcer sequence:

"ESC 2/0 4/1 ESC 2/0 4/6 ESC 2/0 5/10" is (though omitted) assumed.

Thus, characters of 94 character sets are designated to G0 and

invoked as GL. C1 control characters are represented with 7 bits.

Characters of 96 character sets are designated to G2 and invoked with

SS2 (single shift two, "ESC 4/14" or "ESC N").

For example, the escape sequence "ESC 2/4 2/8 4/3" or "ESC $ ( C"

indicates that the bytes following the escape sequence are Korean KSC

characters, which are encoded in two bytes each. The escape sequence

"ESC 2/14 4/1" or "ESC . A" indicates that ISO 8859-1 is designated

to G2. After the designation, the single shifted sequence "ESC 4/14

4/1" or "ESC N A" is interpreted to represent a character "A with

acute".

The following table gives the escape sequences and the character sets

used in "ISO-2022-JP-2" messages. The reg# is the registration number

in ISO's registry [ISOREG].

94 character sets

reg# character set ESC sequence designated to

------------------------------------------------------------------

6 ASCII ESC 2/8 4/2 ESC ( B G0

42 JIS X 0208-1978 ESC 2/4 4/0 ESC $ @ G0

87 JIS X 0208-1983 ESC 2/4 4/2 ESC $ B G0

14 JIS X 0201-Roman ESC 2/8 4/10 ESC ( J G0

58 GB2312-1980 ESC 2/4 4/1 ESC $ A G0

149 KSC5601-1987 ESC 2/4 2/8 4/3 ESC $ ( C G0

159 JIS X 0212-1990 ESC 2/4 2/8 4/4 ESC $ ( D G0

...