Browse Prior Art Database

IMPROVED COMPRESSED TEXT MESSAGE RECEPTION WHEN MISSING MESSAGE FRAGMENTS

IP.com Disclosure Number: IPCOM000009169D
Original Publication Date: 1999-Jun-01
Included in the Prior Art Database: 2002-Aug-12
Document File: 1 page(s) / 76K

Publishing Venue

Motorola

Related People

Karl Weiss: AUTHOR

Abstract

Over the air compression of text information is usually accomplished by choosing tokens which represent the most commonly used phrases, words, or parts of words. The smallest size tokens (in num- ber of bits) are assigned to those phrases, words, or parts of words which occur most often in the domain for which the text compression is being used and defined. This leads to tokens which have differ- ent sizes and which are not integer multiples of each other. Therefore the start location of token bound- aries is not known up front by the receiving decom- pression routine and is a function of previous received tokens. This is unlike using standard ASCII 7 bit values where the token boundary occurs every 7 bits.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 50% of the total text.

Page 1 of 1

MOTOROLA Technical Developments

@

IMPROVED COMPRESSED TEXT MESSAGE RECEPTION WHEN MISSING MESSAGE FRAGMENTS

by Karl Weiss

INTRODUCTION

  Over the air compression of text information is usually accomplished by choosing tokens which represent the most commonly used phrases, words, or parts of words. The smallest size tokens (in num- ber of bits) are assigned to those phrases, words, or parts of words which occur most often in the domain for which the text compression is being used and defined. This leads to tokens which have differ- ent sizes and which are not integer multiples of each other. Therefore the start location of token bound- aries is not known up front by the receiving decom- pression routine and is a function of previous received tokens. This is unlike using standard ASCII 7 bit values where the token boundary occurs every 7 bits.

  Because the decompression routine does not have prior knowledge of where the token boundaries occur, errors received in the message can cause tokens to be misinterpreted. These misinterpreted or errored tokens may also cause subsequent tokens to be misread because the token boundary for these tokens may now be incorrect. A mechanism to help resolve this issue exists where a default token size is chosen and every time the compression encoder inserts a token of the default size, the compression encoder will add pad bits before this default token to ensure that this token is on a boundary which is a multiple of the default token length. This will help the decompression routine to realign itself to a default symbol boundary when errors are encountered.

OPERATION

  In protocols where messages can be sen...