Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

8-Bit CHARACTER Encoding for Multiple Languages

IP.com Disclosure Number: IPCOM000046347D
Original Publication Date: 1983-Jul-01
Included in the Prior Art Database: 2005-Feb-07
Document File: 1 page(s) / 11K

Publishing Venue

IBM

Related People

Mayfield, VA: AUTHOR

Abstract

An 8-bit character encoding technique is described which will support a supplementary spelling dictionary in a text processing system for eleven or more languages. The character encoding technique provides all required diacritic and upper case combinations in seven bits per character. The eighth bit is used as a flag to define whether a hyphen point follows the character.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 100% of the total text.

Page 1 of 1

8-Bit CHARACTER Encoding for Multiple Languages

An 8-bit character encoding technique is described which will support a supplementary spelling dictionary in a text processing system for eleven or more languages. The character encoding technique provides all required diacritic and upper case combinations in seven bits per character. The eighth bit is used as a flag to define whether a hyphen point follows the character.

The character encoding transform is structured as follows:

BIT 8 - HYPHENATION FLAG

BIT 7 - CAPITALIZATION FLAG

BIT 6 - ACCENT FLAG

BITS 5 Z 1 - CHARACTER CODE WHEN BIT 6 = 0

BITS 5 Z 4 - ACCENT CODE WHEN BIT 6 = 1

BITS 3 Z 1 - CHARACTER CODE WHEN BIT 6 = 1

Characters are processed by testing the high-order flag bits. The low-order bits contain the character code sequence or accented character with the specific accent. Under this encoding technique, all characters can be translated to their previous forms.

1