Browse Prior Art Database

Predictive Failure Analysis of Hardware Errors

IP.com Disclosure Number: IPCOM000108787D
Original Publication Date: 1992-Jun-01
Included in the Prior Art Database: 2005-Mar-22
Document File: 2 page(s) / 46K

Publishing Venue

IBM

Related People

Hicken, MS: AUTHOR [+5]

Abstract

A method for alerting customers to hardware problems on direct-access storage devices (DASDs) is disclosed. The method is called hardware error rate predictive failure analysis (PFA).

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 99% of the total text.

Predictive Failure Analysis of Hardware Errors

       A method for alerting customers to hardware problems on
direct-access storage devices (DASDs) is disclosed.  The method is
called hardware error rate predictive failure analysis (PFA).

      By tracking the rate at which hardware errors occur, an
increase in the probability of hardware failure can be detected.

      By applying limits to the rate at which hardware errors occur,
the point at which this failure mechanism is established can be
reported.

      This method of predictive failure analysis is used to alert the
customer to hardware problems which may affect the device's ability
to function in the future.

      The hardware error log contains a record of the hardware errors
and the seek count.  The seek count is related to the usage of the
hardware components of the device, much like the odometer on an
automobile.

      Periodically, the contents of the hardware error log is
analyzed to determine if the rate of hardware errors exceeds the
replacement threshold.  The replacement threshold is based on the
minimum expected seek count per hardware error. It is important to
take a large enough sample to apply the replacement threshold over in
order to filter transient problems which may not be associated with
the device's hardware.

      If the replacement threshold is exceeded, the device is
recommended for replacement by reporting the condition to the
communicating device (see th...