Browse Prior Art Database

Extracting Salient Keywords in a Document that belong to a Specific Context

IP.com Disclosure Number: IPCOM000010453D
Original Publication Date: 2002-Dec-03
Included in the Prior Art Database: 2002-Dec-03

Publishing Venue

IBM

Abstract

Disclosed is a novel method for extracting keywords that are salient and relevant to the documents for a specific context. The method creates a glossary of terms from a large collection of documents and re-calculates the relevancy scores of terms in the glossary for each context. The biased glossary for each context is used to extract salient keywords in the documents.