Monitoring and Management Critical WEB Sites
Original Publication Date: 2002-Jan-27
Included in the Prior Art Database: 2003-Jun-20
Disclosed is a monitoring and management system to allow to do a proactive monitoring and to provide some management functions to recover or prevent possible dangerous situations of a Web site where availability, visibility and reliability are critical success factors. The application also provides mechanisms to manage the cache information published in the site. As happens in any big, mission critical Intranet, a secure and reliable monitoring and managing application was required for the Olympic internal web site (INFO). In addition to the general and centralized monitoring module designed for all the Olympic Systems (based on Tivoli), it was necessary to design and develop a specific monitoring application for taking into account the specific characterises of this Web Site. The system showed to the monitor the critical parameters, correlated information of the INFO Web Servers and the reports obtained from the analysis of the Web Servers logs. These logs were processed, in an incremental way, using the Web Detection Intrusion component of Tivoli Risk Manager in the monitoring workstation for avoiding impact the performance of the application. In this way, problems were detected not only as soon as they happened or when a threshold was reached, but doing a proactive monitoring and providing some management functions to recover or prevent possible dangerous situations. This application was operated from a workstation running a Web Browser and communicating with the SP nodes, running the INFO application, using SSL and SSH2 protocols for ensuring the integrity of the interchanged data between the different SP nodes or machines. The application was extremely useful for helping to identify application problems, Web servers intrusion and incorrect workstations configurations, as well as for watching the evolution of the load of the Web Servers, DB2 servers and application resources. In this way, it allowed the Olympic I/T system to be prepared for critical situations and it provided some basic functions to be able to repair any data or application problem.