A method for Managing Node Failures in a Storage System with Multiple Paths from Clients to Data
Original Publication Date: 2003-May-01
Included in the Prior Art Database: 2003-May-01
A method and program product is disclosed for policy-based optimization of "shared nothing" multi-node file servers. Multi-node file servers, eg, a NAS with several computational engines capable of serving files to clients, are often structured as "shared nothing" systems: each filesystem (or volume, if block services are provided) is available at any given time through at most one node. When that node suffers a failure, the filesystems it serves become unavailable until the multi-node file server finds a node that can pick up fileserving duties (the "failover node"). The invention solves the problem of finding that node in a way that balances unavailability against performance and the possibility of secondary failures due to increased computational load (stress) on the failover node(s).