Browse Prior Art Database

System and Technique for Self-Maintaining Hyperlinks, embedded within Web-based documents

IP.com Disclosure Number: IPCOM000013264D
Original Publication Date: 2002-Jun-01
Included in the Prior Art Database: 2003-Jun-18
Document File: 7 page(s) / 85K

Publishing Venue

IBM

Abstract

Self-Maintaining Hyperlinks Embedded within Web-based Documents This idea is related to the fields of Web-based information management, and dynamic handling of changing data references. The seminal importance of the Internet as a generalized communication medium has been clear from the very outset. As the Web has surged in popularity, however, the number of Web documents has also grown exponentially. Many documents are now authored solely for the purpose of Web publishing. A browser with an HTML editor and some knowledge of HTML on the part of the human author are sufficient to create basic static Web documents. At a slightly advanced level, knowledge or Perl, Java and scripting languages such as JavaScript can facilitate the creation of dynamic Web content. The beauty of online documents is that they can use the functionality afforded by hypertext to contain "hyperlinks" to other Web documents. The term "hypertext" refers to the organization of information units into connected associations that a user can choose to make. A "hyperlink", "link", or "hypertext link" is a specific instance of such an association. The extensive use of Hypertext and Hyperlinks in Online documents has been a driving force behind the proposed solution of the World Wide Web, which is, after all, nothing more than an enormous amount of information content connected by an enormous number of hypertext links. This document comprises of 1) Problem Description

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 15% of the total text.

Page 1 of 7

  System and Technique for Self-Maintaining Hyperlinks, embedded within Web-based documents

Self-Maintaining Hyperlinks Embedded within Web-based Documents

This idea is related to the fields of Web-based information management, and dynamic handling of changing data references. The seminal importance of the Internet as a generalized communication medium has been clear from the very outset. As the Web has surged in popularity, however, the number of Web documents has also grown exponentially. Many documents are now authored solely for the purpose of Web publishing. A browser with an HTML editor and some knowledge of HTML on the part of the human author are sufficient to create basic static Web documents. At a slightly advanced level, knowledge or Perl, Java and scripting languages such as JavaScript can facilitate the creation of dynamic Web content.

The beauty of online documents is that they can use the functionality afforded by hypertext to contain "hyperlinks" to other Web documents. The term "hypertext" refers to the organization of information units into connected associations that a user can choose to make. A "hyperlink", "link", or "hypertext link" is a specific instance of such an association. The extensive use of Hypertext and Hyperlinks in Online documents has been a driving force behind the proposed solution of the World Wide Web, which is, after all, nothing more than an enormous amount of information content connected by an enormous number of hypertext links. This document comprises of

1) Problem Description
2) Proposed Solution
3) Overview System Architecture
4) Detailed Description of the System Architecture
5) Conclusion

Problem Description

With rapid growth in the number of hyperlink containing documents on the Web, however, new, previously unforeseen, problems have emerged on a large scale. In particular, the problem of keeping hyperlinks fresh has been a topic of active research for the past several years.

Typically browsers show a "404 - Not Found" error when the server that hosts the site cannot find the HTML document at the end of a hyperlinked URL. Although, this may often be a simple case of a miss-typed URL in the referring document, it may also indicate that the document no longer exists on the host server. If either case, the presence of such an error requires that the author of the referring Web document either manually delete the hyperlink (if the destination page no longer exists or is forbidden), or update the hyperlink (if the destination page has simply been relocated to a new URL, or if the old URL was simply typed incorrectly to begin with).

The problem we seek to address is that with the rapid increase in the number of hyperlink containing Web documents, and given the highly dynamic and ever changing nature of the World Wide Web, there is the compelling need for a technology that will facilitate the automatic maintenance and update of hyperlinks embedded within Web pages. Through our proposed solution,...