Dismiss
InnovationQ will be updated on Sunday, Oct. 22, from 10am ET - noon. You may experience brief service interruptions during that time.
Browse Prior Art Database

Lexicographer's Tool Kit

IP.com Disclosure Number: IPCOM000109614D
Original Publication Date: 1992-Sep-01
Included in the Prior Art Database: 2005-Mar-24
Document File: 2 page(s) / 89K

Publishing Venue

IBM

Related People

Chodorow, MS: AUTHOR [+3]

Abstract

A series of programs are disclosed that manipulate lexical information found in published thesauri to assist lexicographers in revising, updating or expanding these thesauri. The programs are potentially useful computational tools because they turn the lexical information found in thesauri into networks of links among word-senses.

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 52% of the total text.

Lexicographer's Tool Kit

       A series of programs are disclosed that manipulate
lexical information found in published thesauri to assist
lexicographers in revising, updating or expanding these thesauri.
The programs are potentially useful computational tools because they
turn the lexical information found in thesauri into networks of links
among word-senses.

      Printed thesauri that are arranged alphabetically consist of
discrete entries.  Each entry contains a headword (W), separated into
senses (W1, W2, ...), and followed by synonym lists for each sense
(W1:  X,Y,...; W2: A,B,...).  The structure of the book prevents easy
access to other semantic links, found elsewhere in the thesaurus,
which may be of interest.  Specifically, it is difficult to find out
if synonym links are symmetric; that is, if the synonyms given for a
certain headword, W, are themselves headwords, and if they list W as
a synonym.  As a result of this difficulty, inconsistency and
incompleteness may remain undetected in the published thesaurus.

      When the thesaurus is available on-line, it can be treated as a
network of links among word-senses.  The programs proposed here
manipulate the network automatically to determine whether links are
symmetric or not, and to sort the different kinds of asymmetric
links.  The output generated can assist lexicographers in detecting
and correcting inconsistencies and oversights.
1. A program to identify and list terminal nodes - that is synonyms
that do not appear as headwords in the thesaurus.  The output of this
program has the following format:
     ...