Natural Language Processing (NLP)

SYNTAX, SEMANTIC OF TEXT, LEXICAL SEMANTICS, ALGEBRAIC MODELS, VECTOR MODELS, TEXT MINING, MODELS OF DIALOGUE.

Team leader : This e-mail address is being protected from spambots. You need JavaScript enabled to view it

Deputy team leader : This e-mail address is being protected from spambots. You need JavaScript enabled to view it

Team goal :

  • development of models and tools for natural language processing subjected to evaluation by known applications, preferably at national and international competitions (such as campaigns TREC, MUC, DEFT, etc.)

Two theoretical areas of Natural Language Processing:

  • Syntax (models: Markov algorithms on trees, pre-groups of Lambek) ,
  • Semantics (semantic vector models, lexical networks), with some forays into pragmatics.

Application areas :

  • Generation of monolingual and multilingual lexical databases (thesis de D. Schwab, 2005),
  • Assistance to the terminology , thematic categorization of text (thesis A. Labadie, 2008),
  • Summary of texts by compression (thesis de M. Yousfi-Monod 2007),
  • Machine translation (thesis Johan Segura, since September 2009),
  • The automatic title of documents (thesis Cedric Lopez, since October 2009).

The team :

  • is composed of 12 members : 6 permanent staff, 2 associates, 2 phD Students, 1 post doctoral fellow, 1 doctoral associate.