EXIT: Un système itératif pour l'extraction de la terminologie

du domaine à partir de corpus spécialisés

Mathieu Roche, Thomas Heitz, Oriane Matte-Tailliez, Yves Kodratoff


Abstract

The work presented in this paper is relative to the discovery of a significant terminology in specialized texts. Our approach, partly based on statistical methods extracts the terms in an iterative way. At first, the only terms looked for are binary. The binary terms detected during this first phase are included in the corpus, and the process is iteratively repeated in order to detect very long terms, that happen often to be the most significant terms, as our experience in molecular biology has clearly shown.

Keywords

Terminology.