Many Natural Language Processing (NLP) platforms rely on dictionaries that list lexical variants of clinical concepts. For most languages, the coverage of these dictionaries is low, which limits the capability of NLP to support products in international markets. This paper describes methods for mining translations of clinical terms from parallel (sentence-aligned) and comparable (not sentence-aligned) bilingual corpora. In combination with standard machine translation technology, these methods facilitate automatic extension of concept dictionaries beyond the English language.

