Browse Prior Art Database

Integrating Heterogeneous Text Search Engines For A Content Management System

IP.com Disclosure Number: IPCOM000234823D
Publication Date: 2014-Feb-10
Document File: 3 page(s) / 30K

Publishing Venue

The IP.com Prior Art Database

Abstract

Disclosed is a method to integrate dual (or multiple) text search engines to a Content Management System (CMS) on a relational database in a manner which reduces limitations and improves performance.

This text was extracted from a PDF file.
This is the abbreviated version, containing approximately 36% of the total text.

Page 01 of 3

Integrating Heterogeneous Text Search Engines For A Content Management System

A content management system (CMS) supports relational databases, sometimes more than one at a time, and the CMS on each database is integrated with its own unique text search engine/server. There are two separate databases; hence, the CMS on one database is independent from another.

A CMS on a relational database integrates with only one text search engine, resulting in search engine related limitations for search performance, dependability, and search features such as proximity search, fuzzy search, etc. based on the characteristics and capabilities provided by this search engine of the text search server.

To minimize these limitations, a method for integration of dual (or multiple) text search engines to the CMS on a relational database is herein disclosed.

The relational database, as a client of the text search server, has evolved to support both the text search server and Enterprise Content Manager Text Search (ECMTS) servers for text search engines. The new ECMTS server can also be stand-alone as a remote server communicating with a relational database via Transmission Control Protocol/Internet Protocol (TCP/IP) while the text search server can only be integrated with a relational database on the same computer box. The ECMTS being remote makes it possible to maintain the same indexing schedule for both the text search server and ECMTS. The CMS system administrator can configure a CMS text searchable item type with text indexes on both text search engines, and then index on both search engines after the search contents are inserted into the text searchable attributes (columns) or the search documents are imported into the remote repository.

The advantages of the CMS integration with dual indexing/search engines are many. First, the text indexing is an extensive operation; it requires a high Central Processing Unit (CPU) consumption. Since the new ECMTS server can be remote (i.e. separated from the computer box where the text search server resides), the indexing schedule can be maintained the same for both the text search server and ECMTS. The method of dual text search engines disclosed herein allows the same contents under the same CMS item type to be text-indexed in at least two sets, one for text search server, and the other for ECMTS. If one set is down, then the CMS System Administrator can switch to turn on the active flag for the other engine so that all search applications are directed to the active search engine. Second, even when both search engines are running, switching the active flag from one to the other allows user search applications to switch the search from one set of text indexes to another. A relational database

allows one search engine to be active at a time. This allows search engines to complement one another and thus minimizes the search limitations resulting from each search engine. In addition, the method adapts to users'...