Browse Prior Art Database

Ambiguity Memory

IP.com Disclosure Number: IPCOM000105516D
Original Publication Date: 1993-Aug-01
Included in the Prior Art Database: 2005-Mar-20
Document File: 2 page(s) / 71K

Publishing Venue

IBM

Related People

Lehmann, H: AUTHOR

Abstract

Proposed is a method which improves the interpretation and translation of ambiguous words in systems understanding and translating texts.

This text was extracted from an ASCII text file.
This is the abbreviated version, containing approximately 53% of the total text.

Ambiguity Memory

      Proposed is a method which improves the interpretation and
translation of ambiguous words in systems understanding and
translating texts.

      The basic idea is to store ambiguous words of a text along with
suitable additional information in an ambiguity memory so that they
may be referred to whenever they reappear in the text.  This also
allows translated texts to be automatically edited.

Assume there is an English text containing the sentences

     The ruler declared that taxes would be doubled.
     ...
     He seized the ruler.

In German:

     Der Herrscher erklaerte, dass die Steuern verdoppelt wuerden.
     ...
     Er ergriff den Herrscher.

Another text might contain the sentences

     He seized the ruler.
     ...
     He drew a line with the ruler.

In German:

     Er ergriff das Lineal.
     ...
     Er zog eine Linie mit dem Lineal.

      The problem is to translate the ambiguous word ruler correctly
in a specific context, which is done with the aid of the proposed
ambiguity memory.

      This ambiguity memory is a data structure that looks as
follows:

     ambiguitymemory(
      ambiguouswordinsourcelanguage,
      pos(sourcesentence,positioninsourcesentence),
      disambiguatingcontext,
      translation,
      pos(targetsentence,positionintargetsentence)).

An ambiguous word is a word for which there is more than one trans
lation.  A "disambiguating context" for a word is a (sentence)
context in which there is no doubt as to how the word is translated
correctly.  Such contexts may result from limiting the syntactic or
semantic com patibility (in the above examples: ruler as a subjective
of declared or as an instrument of drew).

      For each ambiguous word in a text, a term is stored in the
above- mentioned format.  When the word appears in the text for the
first time, in a context other than a disambiguating one, there are
four ways in which to proceed:

    a.  No translation is selected initially.
    b.  A translation is selected arbitrarily.
    c.  A selection is made by default.
    d.  All possible translations are stored and erased...