Browse Prior Art Database

Automatic Remote Disable of Processor using SMI

IP.com Disclosure Number: IPCOM000019255D
Original Publication Date: 2003-Sep-08
Included in the Prior Art Database: 2003-Sep-08

Publishing Venue

IBM

Abstract

A local SP in an SMP server monitors environmental and CPU PFA events. When an event occurs indicating a CPU is going to fail, the local SP informs a remote management server (or SP) which may authenticate & validate the need for the CPU to be taken offline. If so, the remote management server/SP responds to the local SP requesting an appropriate SMI in BIOS be invoked. In the SMI, BIOS transfers thread tasks running atop the faulting CPU to an available CPU, & brings down the faulting CPU to a stand-by or powered-off state. With BIOS handshaking to the local SP, a notice of the avoided fault may be sent back to the remote management server/SP. Several advantages to this technique: 1. A wider mapping of PFA & other system-wide information may be used to identify the CPU is about to fail (ie, thermal core overtempurature). 2. A policy on the remote system may be used to 2.1 analyze the reason the CPU appears to be faulting, to validate the request to take that CPU offline 2.2 allow a customer a choice in using this automatic fault-avoidance technique