Browse Prior Art Database

A method to find the blanks where the texts are not displayed in the Web object

IP.com Disclosure Number: IPCOM000014061D
Original Publication Date: 2000-Dec-01
Included in the Prior Art Database: 2003-Jun-19
Document File: 2 page(s) / 107K

Publishing Venue

IBM

Abstract

Described is a method to find the blanks ("white space") where text is not displayed in a Web object by searching for text occupying specific coordinates from the end of each line in the Web DOM representation of the object. The method will also detect where new paragraphs begin. As shown in Figure 1 "White space" means places where text is not displayed in a Web object. In this context, a Web object is a thing which is defined by HTML tags, such as an image, form, input item, button, etc., that is displayed on a Web page with some associated text. The procedure is shown in Figure 2. The details of the procedure is explained as follows. [Step 1] As Figure 2 (a) is shown, starting from the right side, scan to the left for the specific coordinates of the character in the target line in order to find a white space at the right side in a Web object, and [Step 2] When the character is found, checking the HTML file (Figure 2 (b)) to judge where this character which is found in [Step 1] is used or it is the white space for the line. [Step 3] Calculate the coordinates of the white space using the two endpoints detected in [Step 1] and [Step 2] and the coordinates of the edges of the target of the line. This method is implemented as a JavaScript program embedded in the HTML. This method detects operation events by defining event handlers for each HTML element, and acquires the sizes, locations, and hierarchy of objects from the properties of the objects of Dynamic HTML. "D ove" is a symbol of Japanese peace. "H A TO is a w ord that

This text was extracted from a PDF file.
At least one non-text object (such as an image or picture) has been suppressed.
This is the abbreviated version, containing approximately 54% of the total text.

Page 1 of 2

A method to find the blanks where the texts are not displayed in the Web object

   Described is a method to find the blanks ("white space") where text is not displayed in a Web object by searching for text occupying specific coordinates from the end of each line in the Web DOM representation of the object. The method will also detect where new paragraphs begin. As shown in Figure 1 "White space" means places where text is not displayed in a Web object. In this context, a Web object is a thing which is defined by HTML tags, such as an image, form, input item, button, etc., that is displayed on a Web page with some associated text. The procedure is shown in Figure 2. The details of the procedure is explained as follows.

[Step 1] As Figure 2 (a) is shown, starting from the right side, scan to the left for the specific coordinates of the character in the target line in order to find a white space at the right side in a Web object, and [Step 2] When the character is found, checking the HTML file (Figure 2 (b)) to judge where this character which is found in [Step 1] is used or it is the white space for the line. [Step 3] Calculate the coordinates of the white space using the two endpoints detected in [Step 1] and [Step 2] and the coordinates of the edges of the target of the line. This method is implemented as a JavaScript program embedded in the HTML. This method detects operation events by defining event handlers for each HTML element, and acquires the sizes, locations...