Browse Prior Art Database

Message Storage and Transmission Using Mixed Alphanumeric and Word Codes

IP.com Disclosure Number: IPCOM000047082D
Original Publication Date: 1983-Sep-01
Included in the Prior Art Database: 2005-Feb-07
Document File: 2 page(s) / 31K

Publishing Venue

IBM

Related People

Price, CA: AUTHOR

Abstract

This compression algorithm utilizes unique codes for each of a large number of words comprising a vocabulary. Also provided is a default procedure whereby standard EBCDIC codes can be used for short words not justifying the special encoding, as well as for words not in specially coded vocabulary. The switching between the special code and EBCDIC code groups of bytes is signalled by two-byte codes which indicate spaces and also characterize the significance of the bytes which follow. The following table gives an example of such space code designations: Code* 'B0' - EBCDIC code follows until next 'Bx' code. 'B1' - Special 3-byte code for word follows. Lower case. 'B2' - Special 3-byte code for word follows. Upper case all letters. 'B3' - Special 3-byte code for word follows.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 53% of the total text.

Page 1 of 2

Message Storage and Transmission Using Mixed Alphanumeric and Word Codes

This compression algorithm utilizes unique codes for each of a large number of words comprising a vocabulary. Also provided is a default procedure whereby standard EBCDIC codes can be used for short words not justifying the special encoding, as well as for words not in specially coded vocabulary. The switching between the special code and EBCDIC code groups of bytes is signalled by two-byte codes which indicate spaces and also characterize the significance of the bytes which follow. The following table gives an example of such space code designations: Code* 'B0' - EBCDIC code follows until next 'Bx' code. 'B1' - Special 3-byte code for word follows. Lower case. 'B2' - Special 3-byte code for word follows.

Upper case all letters. 'B3' - Special 3-byte code for word follows. Upper case first letter. 'B4' - Punctuation mark with preceding space. 'B5' - Punctuation mark without preceding space. 'B6' - Same as 'B1' without preceding space. 'B7' - Same as 'B2' without preceding space. 'B8' - Same as 'B3' without preceding space. Using a special code set of three bytes per language word, a very large vocabulary could be accommodated. Words of no more than three letters need not be specially coded, since they could just as well be spelled out in EBCDIC form. An example of the phrase SINCE THE ENGLISH LANGUAGE IS, coded as indicated above, would be: 'B2' 3-byte code for since Ù'B0 /' EBCDIC code for THE Ù 'B2' 3-byte code for english Ù'B2' 3-byte code for language Ù 'B0' (EBCDIC code for IS) Ù For a smaller vocabulary, a special two-byte code set could be used, in wh...