Browse Prior Art Database

Method and System for Managing Fatal Error Handling (FEH) in a Multi-node Server Environment

IP.com Disclosure Number: IPCOM000199710D
Publication Date: 2010-Sep-15
Document File: 2 page(s) / 44K

Publishing Venue

The IP.com Prior Art Database

Abstract

A method and system for managing Fatal Error Handling (FEH) in a multi-node server environment is disclosed. The method involves executing FEH processes at each node of the multi-node server and performing a power cycle (restart) operation at each node when each node of the multi-node server has completed executing FEH processes.

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 51% of the total text.

Page 1 of 2

Method and System for Managing Fatal Error Handling (FEH) in a Multi-node Server Environment

Disclosed is a method and system for managing Fatal Error Handling (FEH) in a multi-node server environment. FEH processes are executed for handling occurrence of fatal errors in one or more nodes of a multi-node server complex. Upon completion of FEH processes at each node of the multi-node server complex, power cycle operations are performed at each node of the multi-node server complex. A power cycle operation performed at a node includes switching-off the power supplied to the node and then switching-on the power for enabling the nodes to operate normally. Further, the method involves coordinating the power cycle operations at all nodes. In order to coordinate the power cycle operations, a primary node of the multi-node server complex waits for each secondary node of the multi-node server complex. Thereafter, the primary node performs power cycle operations at each secondary node of the multi-node server complex.

The method and system disclosed is illustrated as a flowchart in Fig. 1.

1

[This page contains 2 pictures or other non-text objects]

Page 2 of 2

Figure 1

As shown in Fig. 1, the multi-node server environment includes multiple nodes connected to each other in order to form a multi-node server complex. The multi-node server complex includes a primary node and multiple secondary nodes. Upon initialization of the multi-node server complex, the primary node determines a count of secondary nodes connected to the primary node. The count of the secondary nodes may be saved in a software variable. Further, the primary node and each secondary node of the multi-node server complex include a Baseboard Management Controller (BMC) chipset and a firmware for performing system management functions....