Browse Prior Art Database

Augmenting Indexing System

IP.com Disclosure Number: IPCOM000036945D
Original Publication Date: 1989-Nov-01
Included in the Prior Art Database: 2005-Jan-29
Document File: 2 page(s) / 14K

Publishing Venue

IBM

Related People

Larson, LE: AUTHOR

Abstract

Described is a process of augmenting indexes to items to improve the information retrieval characteristics of the index. The indexer achieves the improvement by modifying the index according to a specified set of rules. The specified set of rules constitute a system of logic that will make conclusions about the indexed item based upon clues contained within the index material of the item.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 55% of the total text.

Page 1 of 2

Augmenting Indexing System

Described is a process of augmenting indexes to items to improve the information retrieval characteristics of the index. The indexer achieves the improvement by modifying the index according to a specified set of rules. The specified set of rules constitute a system of logic that will make conclusions about the indexed item based upon clues contained within the index material of the item.

The conclusions the augmenting indexing system reaches about the indexed item may result in the deletion of certain words. Almost every indexing environment would delete the word 'THE' from an item index. However the word 'ALL' should not be deleted from an item index involved with quantified logic. Another possible conclusion is the replacement of indexing terms in an item index.

Depending upon the environment, it may be beneficial to replace the collection of terms 'COMPILE' 'COMPILATION' and 'COMPILING' with the single root term '%COMPIL%'.

Pure addition of indexing terms may be required in some environments. An item index having the terms 'ALLOCATE', 'MEMORY', and 'SERVICE' may benefit by the addition of the term 'STORAGE_MANAGEMENT'.

These decisions are dependent upon the context in which they occur, and do not apply universally to every item.

The specification of the augmenting indexing system is simply a bounded length string of characters. The characters composing the string can be upper and lower case alphabetics, numerics, special characters (e.g., exclamation), and blank.

The characters in the string appear in two categories: separator, and word. The separator category is composed of any number of blank characters. A word is a single special character or a sequence of numerics and alphabetics. The following list of rules define the specification language for the indexing system: . Rule_Set ::= '' | Rule_Set Simple_Rule

This rule simply states that the set of rules defining the indexing system may be empty, resulting in no changes to any words used as an index. . Simple_Rule ::= ':' Not_Word_Lis...