Browse Prior Art Database

Validation of Double-Byte Character Sets Text for Prompting in a Language-Sensitive Editing System

IP.com Disclosure Number: IPCOM000114036D
Original Publication Date: 1994-Nov-01
Included in the Prior Art Database: 2005-Mar-27
Document File: 4 page(s) / 137K

Publishing Venue

IBM

Related People

Storisteanu, A: AUTHOR

Abstract

SO SI DBCS SBCS EOL True False ______ ______ ______ ______ ______ ______ ______ Sn Sn Sn Sn+1 Sn TRUE - - Snt1 - - - - - FALSE Sn Ss Sst2 FALSE Sst1 Ss TRUE - - Sst1 - - - - - FALSE. Ss Sst2 - - - - - FALSE. SOinfo:=vector(i) Sd Sd Sdt2 Sdt1 Sdt3 FALSE. FALSE. - - Sdt1 - - - - - FALSE. Ss Sdt2 - - - - - FALSE. Sd Sdt3 - - - - - FALSE. Sd

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 36% of the total text.

Validation of Double-Byte Character Sets Text for Prompting in a
Language-Sensitive Editing System

   SO      SI      DBCS    SBCS    EOL     True    False
      ______  ______  ______  ______  ______  ______  ______
 Sn     Sn      Sn      Sn+1    Sn      TRUE    -       -
 Snt1   -       -       -       -       -       FALSE   Sn
 Ss     Sst2    FALSE   Sst1    Ss      TRUE    -       -
 Sst1   -       -       -       -       -       FALSE.  Ss
 Sst2   -       -       -       -       -       FALSE.
SOinfo:=vector(i)
                                                       Sd
 Sd     Sdt2    Sdt1    Sdt3    FALSE.  FALSE.  -       -
 Sdt1   -       -       -       -       -       FALSE.  Ss
 Sdt2   -       -       -       -       -       FALSE.  Sd
 Sdt3   -       -       -       -       -       FALSE.  Sd

      Typical problems in writing programs that support Double-Byte
Character Sets (DBCS) are to avoid the splitting of the DBCS byte
pairs for most text handling and rendering operations, to ensure that
only certain fields receive DBCS or mixed characters, and to maintain
correct DBCS sequences.  This disclosure proposes a finite-state
machine based method and algorithm to carry out all the text
validations for prompting (editing templates) purposes in a
language-sensitive editing system.

This method addresses the complete prompting validation of text for
both:
  o  systems that support and maintain (emulated) Shift Out (SO) and
      Shift In (SI) delimiters around DBCS characters in the text
      (here, SOSI mode).  When the prompt dialog entry fields are
      prefilled with DBCS text, emulated SO and SI delimiters are
      stripped, and restored when the text line is reconstructed in
the
      source; and
  o  systems that handle DBCS characters identified as such by their
      code point, where the SO and SI characters are treated like any
      other controls, with no special DBCS meaning (here, NO SOSI
      mode).

      The text line is scanned to verify it conforms to the
definition of the entry fields that appear in the particular prompt.
Prompting, here, implies any splitting of the text into fields for
individual editing, followed by the recreation of the text, where
each field has its own characteristics:  a starting position in the
line, a certain length, whether it accepts SBCS characters only, or
mixed characters.  Such editing templates are common in the source
editing of fixed-format column-sensitive programming languages, like
RPG.

      The validation function is invoked when it is establ...