Browse Prior Art Database

Construct for Enrolling System Components in PROBLEM Management Services

IP.com Disclosure Number: IPCOM000036889D
Original Publication Date: 1989-Nov-01
Included in the Prior Art Database: 2005-Jan-29
Document File: 6 page(s) / 110K

Publishing Venue

IBM

Related People

Johnston, DL: AUTHOR

Abstract

A construct called a reference code translate table (RCTT) is a table used to store information about the set of detected errors of a system component and the set of actions required to resolve each error. Each detected error is known by an encoded name called a reference code. This table is used by problem management services in the operating (Image Omitted) system to translate (decode) a reference code representing a detected error condition into the set of problem isolation procedures, problem resolution actions, and post resolution verification procedures that must be done by the system or a system user to resolve the problem. The reference code and the name of the RCTT to look it up in are logged in an error record by the detecting component at the time of failure.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 39% of the total text.

Page 1 of 6

Construct for Enrolling System Components in PROBLEM Management Services

A construct called a reference code translate table (RCTT) is a table used to store information about the set of detected errors of a system component and the set of actions required to resolve each error. Each detected error is known by an encoded name called a reference code. This table is used by problem management services in the operating

(Image Omitted)

system to translate (decode) a reference code representing a detected error condition into the set of problem isolation procedures, problem resolution actions, and post resolution verification procedures that must be done by the system or a system user to resolve the problem. The reference code and the name of the RCTT to look it up in are logged in an error record by the detecting component at the time of failure.

An overview of the RCTT is shown in Fig. 1. The table is made up of a header 1 and a body 2 which contains the data. Header 1 contains the length of the table, its name, and information identifying the hardware or software component the table supports and the maintenance level of that component. Body 2 contains reference code description records, resolution action description records, and an index to each set of records.

There is a reference code description record for each detected error. The records contain problem isolation, problem fix verification and problem recover procedure names, resolution action pointers, text pointers and data constants. The problem isolation procedure is the name of the program to be called to complete problem isolation for problems not isolated at point of failure. The verification procedure is the name of the program to be called to verify that a problem resolution action has fixed the problem. The problem recovery procedure is the name of the program to be called to assist the user to continue useful work. The resolution actions are identified in these records by four character resolution action codes which point to resolution action description record. The text pointers are used to locate text residing in a message library that describes each error condition.

A resolution action description record contains information describing a resolution action. Examples are replace a part, update a module of code, or replace media. There are ten kinds of resolution action records. They contain a resolution action code, a resolution action type identifier, and resolution action description information, such as pointers used to locate text in a message library, failure cause identification information (part number, code module name, parameter name, etc.), and part location information (as applicable).

Reference code description record information is used by the operating system problem management services to do the following: Determine whether further problem isolation is required

to define the minimum failure cause set.

Locate text describing the error condition.

1

Page 2...