Browse Prior Art Database

Method and System to Intelligently Identify Characteristics of a Log file for grouping and analysis

IP.com Disclosure Number: IPCOM000217433D
Publication Date: 2012-May-07
Document File: 3 page(s) / 47K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed is an automated method to group and categorize like sections of a log file to improve the usability and serviceability of the log. The core idea is to intelligently group the content in a file by using a Natural Language Processing (NLP) engine to categorize a file and identify like data as well as the beginning and end of that like data.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 52% of the total text.

Page 01 of 3

Method and System to Intelligently Identify Characteristics of a Log file for grouping and analysis

The disclosed invention is a method to improve the serviceability of products and the productivity of users. The method introduces an application or plugin that takes either unstructured or structured data, such as that in a log file, categorizes it, and then intelligently parses, organizes, and groups the contents through a Natural Language Processing (NLP) engine. The result of this process is the identification of like data as well as the beginning and end of that like data. Furthermore, the disclosure seeks to persistently improve the success rate of the NLP engine by monitoring the action of a user when reviewing a particular file and taking into account certain attributes such as their age, job role, experience, etc.

Example Scenario #1


User A is reviewing a document on his smart phone. Given the length of the document and the small screen, it is not practical to read the entire document. Moreover, he is only interested in a small portion of the document. Using the concept in this disclosure User A confidently reads only the section he needs, clearly knowing where the section began and ended.

Example Scenario #2


User B works in Server Support for an enterprise application. A customer sends User B a log file that is 30,000 lines. User B spends 30 minutes trying to group the beginning and end of a certain request. As a result, User B tells the customer that analysis of the trace file will take some additional time and will need to call the customer back. If User B had a tool that intelligently identified the beginning and end of the particular thread or request, then User B would have been able to resolve the issue with the customer on the phone in only a few minutes.

Related Prior Art:


• System and method for configuring tracing and logging functions. [1]


• Computer software framework and method for logging messages. [2]


• Automatic analysis of the properties of a system based on runtime logs [3]

The idea disclosed herein is novel because it does not claim how a log or trace file is written. Rather, the invention identifies an automated method to group and categorize like sections of a log file to improve the usability and serviceability of the log.

The core idea is to intelligently group the content in a file by using an NLP engine to categorize a file and identify like data as well as the beginning and end of that like data. For example, if a trace log is 200MB in size and contains nearly 1,000,000 lines, it is difficult for a human to discern what they need from that file. However, since the content in a tr...