Browse Prior Art Database

System and Technique for Automatically Detecting a Distributed Component Structure of Web Based Documents using a Cluster Crawler Analyzer

IP.com Disclosure Number: IPCOM000014891D
Original Publication Date: 2002-Jun-01
Included in the Prior Art Database: 2003-Jun-20

Publishing Venue

IBM

Abstract

Automatically Detecting a Distributed Component Structure of Web Based Documents using a Cluster Crawler Analyzer The system is related to the area of information retrieval technologies in the context of distributed documents and document/information classification techniques. We will describe: 1. Problem Statement 2. Proposed Solution