Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

Taxonomized Web Search

IP.com Disclosure Number: IPCOM000118716D
Original Publication Date: 1997-May-01
Included in the Prior Art Database: 2005-Apr-01
Document File: 2 page(s) / 45K

Publishing Venue

IBM

Related People

Raghavan, P: AUTHOR

Abstract

Disclosed is a search interface (window) that combines key-word search with classified search and a technique for machine classification of pages of the web (and similar corpus).

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 62% of the total text.

Taxonomized Web Search

      Disclosed is a search interface (window) that combines key-word
search with classified search and a technique for machine
classification of pages of the web (and similar corpus).

      Available products on the Web provide two types of search
mechanisms: Web directories and search engines.

      Web directories, such as the one provided by "Yahoo", give a
tree-like classification.  Moving along the tree, a user can access a
limited set of pages that were manually classified and added to the
tree.  Such a directory typically contains only a small fraction of
the Web.

      Search engines, such as Lycos, Alta Vista, or Yahoo's search
engine claim to reach a large portion of the Web.  The drawback of
these search engines is that their searches is syntactic only, i.e.,
the search returns a list of pages that include a given key (or
keys). The  common problem with using a search engine is that most
queries return either no page or a long list of pages, all of which
include the given  keys, but most are irrelevant.  The user needs to
search manually one title after the other to find the required page.

      The disclosed method is a search interface that allows queries
that combine context-free syntactic search with context sensitive
search, guided by keywords.  In parallel to the crawler that builds
the database for the syntactic search, the disclosed method would run
a second crawler that automatically classifies pages...