TEXTE: Exploration et exploitation de donnees textuelles
Nous développons des modèles et des outils pour analyse automatique, syntaxique et sémantique, du langage naturel ainsi que pour la constitution des ressources nécessaires.
Membres
Permanents
- Alain Joubert, Maître de Conférences UM
- Mathieu Lafourcade, Maître de Conférences UM
- Richard Moot, Chargé de Recherche CNRS
- Violaine Prince, Professeur UM
- Jean-Philippe Prost, Maître de Conférences UM
- Christian Retore, Professeur UM
Non permanents
- Jimmy Benoits, Doctorant
- Davide Catta, CDD Ingénieur-Technicien
- Nadia Clairet, Doctorant
- Kévin Cousot, Doctorant
- Mehdi Mirzapour, Doctorant
- Noémie-Fleur Sandillon-Rezer, CDD Ingénieur-Technicien
Thématiques de Recherche
L’équipe TEXTE développe des méthodes, des outils et des ressources pour le traitement automatique du langage naturel, surtout écrit. Ces travaux portent plus particulièrement sur sa syntaxe et sur sa sémantique aussi bien logique que lexicale. Nous utilisons plutôt des méthodes symboliques, le plus souvent logiques, d'où notre rattachement au pôle Intelligence artificielle. Bien qu'elles soient toutes reliées entre elles, distinguons dans Texte les activités suivantes:
- Construction, acquisition de ressources pour le traitement automatique des langues (lexique, grammaire)
- Analyse automatique de la syntaxe et de la sémantique du langage naturel.
Ces travaux nécessitent des recherches fondamentales, souvent fédérées par la logique:
- Programmation logique par contraintes pour la syntaxe guidée par les modèles
- Analyse syntaxique et sémantique en théorie des types.
- Règles d'inférence dans un réseau lexical.
- Représentation des connaissances.
D'autres méthodes sont aussi utilisées: jeux sérieux collaboratifs, algorithmique distribuée sur des graphes (fourmis), algèbre linéaire (vecteurs de mots), statistiques (suppression du bruit, étiquetage grammatical).
Publications depuis 2013 - Evaluation 2019
Articles de revues internationales
2017
- Combining logical and distributional methods in type-logical grammarsJournal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, In press.
2016
- Quantification in Ordinary Language and Proof TheoryMichele Abrusci, Fabio Pasquali, Christian RetoréPhilosophia Scientiae, Paris; Editions Kime; [2014], 2016, pp.185-205.
- Conditions d’assertion de "chaque" et de "tout" et règles de déduction du quantificateur universelAlda Mari, Christian RetoréTravaux de Linguistique : Revue Internationale de Linguistique Française, De Boeck Université, 2016, 72, pp.89-106.
2015
- Recognition of logical units in log filesHassan Saneifar, Stéphane Bonniol, Pascal Poncelet, Mathieu RocheIntelligent Data Analysis, IOS Press, 2015, 19 (2), pp.431-448.
- Software understanding: Automatic classification of software identifiersPattaraporn Warintarawej, Anne Laurent, Marianne Huchard, Mathieu Lafourcade, Pierre PompidorIntelligent Data Analysis, IOS Press, 2015, 19 (4), pp.761-778.
2014
- From Logical to Distributional ModelsAnne PrellerElectronic Proceedings in Theoretical Computer Science, EPTCS, 2014, 171, pp.113-131.
- Deverbal semantics and the Montagovian generative lexicon ΛTynLivy-Maria Real-Coelho, Christian RetoréJournal of Logic, Language and Information, Springer Verlag, 2014, 23 (3), pp.347-366.
- Partially Commutative Linear Logic and Lambek Caculus with Product: Natural Deduction, Normalisation, Subformula PropertyMaxime Amblard, Christian RetoréIfColog Journal of Logics and their Applications (FLAP), College Publications, 2014, 1 (1), pp.53-94.
- A natural framework for natural language semantics: many sorted logic and Hilbert operators in type theoryChristian RetoréBulletin of Symbolic Logic, Association for Symbolic Logic, 2014, 20 (2), pp.241-241.
- Category theory, logic and formal linguistics: some connections, old and newJean Gillibert, Christian RetoréJournal of Applied Logic, Elsevier, 2014, 12 (1), pp.1-13.
- Natural Language Semantics in Biproduct Dagger CategoriesAnne PrellerJournal of Applied Logic, Elsevier, 2014, 12, pp.88-108.
- How can catchy titles be generated without loss of informativeness?Cédric Lopez, Violaine Prince, Mathieu RocheExpert Systems with Applications, Elsevier, 2014, 41 (4), pp.1051-1062.
- Are opinions expressed in land-use planning documents?Eric Kergosien, Bernard Laval, Mathieu Roche, Maguelonne TeisseireInternational Journal of Geographical Information Science, Taylor & Francis, 2014, 28 (4), pp.739-762.
- How to Combine Text-Mining Methods to Validate Induced Verb-Object Relations?Nicolas Béchet, Jacques Chauché, Violaine Prince, Mathieu RocheComputer Science and Information Systems, ComSIS Consortium, 2014, 11 (1), pp.133-155.
2013
- Can Mammographic Assessments Lead to Consider Density as a Risk Factor for Breast Cancer?Catherine Colin, Violaine Prince, Pierre-Jean ValetteEuropean Journal of Radiology, Elsevier, 2013, 82, pp.404-411.
- Sud4science, de l'acquisition d'un grand corpus de SMS en français à l'analyse de l'écriture SMSRachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand VerineEpisteme, Cambridge University Press (CUP), 2013, Communication électronique et écritures numériques, pp.107-138.
Communications internationales
2017
- An Empirical Study for a Machine Aided Translation of French Prepositions '` a', 'de' and 'en' into EnglishViolaine Prince8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznan, Poland.
- Ontolex JeuxDeMots and Its Alignment to the Linguistic Linked Open Data CloudAndon Tchechmedjiev, Théophile Mandon, Mathieu Lafourcade, Anne Laurent, Konstantin TodorovISWC: International Semantic Web Conference, Oct 2017, Vienne, Austria. 16th International Semantic Web Conference, LNCS (10587), pp.678-693, 2017.
- Ambiguss, a game for building a Sense Annotated Corpus for FrenchMathieu Lafourcade, Nathalie Le BrunIWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017.
- Explicative Path Finding in a Semantic NetworkKévin Cousot, Mathieu LafourcadeIWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017.
- Identifying Polysemous Words and Inferring Sense Glosses in a Semantic NetworkMaxime Chapuis, Mathieu LafourcadeIWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017.
- If mice were reptiles, then reptiles could be mammals or How to detect errors in the JeuxDeMots lexical network?Mathieu Lafourcade, Alain Joubert, Nathalie Le BrunRANLP: Recent Advances in Natural Language Processing, Sep 2017, Varna, Bulgaria. International Conference on Recent Advances in Natural Language Processing, 2017.
- Towards the Automatic Detection of Nutritional Incompatibilities Based on Recipe TitlesNadia Clairet, Mathieu LafourcadeAndreas Holzinger; Peter Kieseberg; A Min Tjoa; Edgar Weippl. 1st International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2017, Reggio, Italy. Springer International Publishing, Lecture Notes in Computer Science, LNCS-10410, pp.346-366, 2017, Machine Learning and Knowledge Extraction.
- Parcourir, reconnaître et réfléchir. Combinaison de méthodes légères pour l'extraction de relations sémantiques.Mathieu Lafourcade, Nathalie Le BrunTALN: Traitement Automatique des Langues, Jun 2017, Orléans, France. 24rd French Conference on Natural Language Processing, 2017.
2016
- Compilation de grammaire de propriétés pour l'analyse syntaxique par optimisation de contraintesJean-Philippe Prost, Remi Coletta, Christophe LecoutreTALN: Traitement Automatique des Langues Naturelles, Jul 2016, Paris, France. 23ème Conférence sur le Traitement Automatique des Langues Naturelles, 2016.
- Patrons sémantiques pour l'extraction de relations entre termes - Application aux comptes rendus radiologiquesLionel Ramadier, Mathieu LafourcadeTALN 2016, Jul 2016, Paris, France. Actes de la conférence conjointe JEP-TALN-RECITAL 2016, jep-taln2016.
- Construire un lexique de sentiments par crowdsourcing et propagationMathieu Lafourcade, Nathalie Le Brun, Alain JoubertTALN: Traitement Automatique des Langues Naturelles, Jul 2016, Paris, France. 5ème édition conjointe de la conférence JEP-TALN-RECITAL 23e conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2016.
- Mixing Crowdsourcing and Graph Propagation to Build a Sentiment LexiconMathieu Lafourcade, Nathalie Le Brun, Alain JoubertFeelings are contagious. NLDB: Natural Language to Information Systems, Jun 2016, Manchester, United Kingdom. 21st International Conference on Applications of Natural Language to Information Systems, LNCS (9612), pp.258-266, 2016.
- Découverte des patrons de connaissance grâce à la modélisation sémantique des phrases d'instructionsNadia Clairet, Sylvie Despres, Mathieu LafourcadeTOTh, Jun 2016, Chambéry, France. 10ème édition de la Conférence TOTh, 2016, « Tournant linguistique et renouveau conceptuel ».
- Using Constraints on a general Knowledge lexical networK for domain-specific semantic relation extraction and modelingNadia Bebeshina-Clairet, Lionel Ramadier, Mathieu LafourcadeDialogue 2016, Jun 2016, Moscou, Russia. 22nd International Conference on Computational Linguistics and Intellectual Technologies, 15 (22), 2016, Computational Linguistics and Intellectual Technologies.
- Semantic RelationExtraction with Semantic Patterns: Experiment on Radiology ReportMathieu Lafourcade, Lionel RamadierLREC 2016 Conference on Language Resources and Evaluation, May 2016, Portorož, Slovenia. 10th, LREC 2016 Proceedings.
2015
- “Chaque vin a sa lie." versus “Toute nuit a un jour." --- does the difference in the human processing of " chaque" and " tout" match the difference between the proof rules for conjunction and quantification?Alda Mari, Christian Retoré(In)coherence of Discourse, Dec 2015, Nancy, France. (In)coherence of Discourse 3, 2015.
- A Case Study of Copredication over a Deverbal that Reconciles Empirical Data with Computational SemanticsLivy Real, Christian RetoréEric McReady. LENLS12: Logic and Engineering of Natural Language Semantics 12, Nov 2015, Tokyo, Japan. ISBN: 978-4-915905-68-1, 2015.
- Are Books Events? Ontological Inclusions as Coercive Sub-Typing, Lexical Transfers as EntailmentBruno Mery, Christian RetoréEric McReady. LENLS12: Logic and Engineering of Natural Language Semantics 12, Nov 2015, Tokyo, Japan. ISBN 978-4-915905-68-1, pp.74-87.
- Medical Imaging Report Indexing: Enrichment of Index through an Algorithm of Spreading over a Lexico-semantic NetworkMathieu Lafourcade, Lionel RamadierRANLP: Recent Advances in Natural Language Processing, Sep 2015, Hissar, Bulgaria. 2015.
- Typed Hilbert Operators for the Lexical Semantics of Singular and Plural Determiner PhrasesBruno Mery, Richard Moot, Christian RetoréEpsilon: Hilbert’s Epsilon and Tau in Logic, Informatics and Linguistics, Aug 2015, Montpellier, France. 2015.
- Type Theories and Lexical Networks: Using Serious Games as the Basis for Multi-Sorted Typed SystemsStergios Chatzikyriakidis, Mathieu Lafourcade, Lionel Ramadier, Manel ZarroukESSLLI: European Summer School in Logic, Language and Information, Aug 2015, Barcelona, Spain. 2015.
- Vous aimez ?...ou pas ? LikeIt, un jeu pour construire une ressource lexicale de polarité.Mathieu Lafourcade, Nathalie Le Brun, Alain JoubertTALN: Traitement Automatique des Langues Naturelles, Jun 2015, Caen, France. 22e conférence sur le Traitement Automatique des Langues Naturelles, 2015.
- Quantifier scope: a formal and experimental studyArthur Capelier-Mourguy, Philippe Blache, Christian Retoré, Laurent PrevotCJC-SC: Colloque des Jeunes Chercheurs en Sciences Cognitives, Jun 2015, Compiègne, France. 2015.
2014
- Computing the Semantics of Plurals and Massive Entities Using Many-Sorted TypesBruno Mery, Richard Moot, Christian RetoréKoji Mineshima. LENLS: Logic and Engineering of Natural Language Semantics, Nov 2014, Kanagawa, Japan. Keio University Press, JSAI-isAI 2014 Workshops, LENLS, JURISIN, and GABA, Kanagawa, Japan, October 27-28, 2014, Revised Selected Papers The Eleventh International Workshop of Logic and Engineering of Natural Language Semantics 11 (LENLS11), LNCS (9067), pp.144-159, 2015, New Frontiers in Artificial Intelligence.
- From NL Preference Expressions to Comparative Preference Statements: A Preliminary Study in Eliciting Preferences for Customised Decision Support.Souhila Kaci, Namrata Patel, Violaine PrinceICTAI: International Conference on Tools with Artificial Intelligence, Nov 2014, Limassol, Cyprus. 26th International Conference on Tools with Artificial Intelligence, pp.591-598, 2014.
- Mining Tweet Data - Statistic and semantic information for political tweet classificationGuillaume Tisserant, Violaine Prince, Mathieu RocheKDIR: Knowledge Discovery and Information Retrieval, Oct 2014, Rome, Italy. KDIR'14: International Conference on Knowledge Discovery and Information Retrieval, pp.523-529, 2014, Text-Mining Session.
- Typed Hilbert Epsilon Operators and the Semantics of Determiner Phrases (Invited Lecture)Christian RetoréGlyn Morrill; Frank Richter; Rainer Osswald; Reinhard Muskens. FG: Formal Grammar, Aug 2014, Tübingen, Germany. Springer, The 19th Conference on Formal Grammar will be held from August 16th to August 17th, 2014, in conjunction with the 26th European Summer School in Logic, Language and Information (ESSLLI 2014) in Tübingen, Germany., 8612, pp.15-33, 2014, LNCS.
- Les couleurs des gensMathieu Lafourcade, Nathalie Le Brun, Virginie ZampaTALN: Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. 21ème conférence sur le Traitement Automatique des Langues Naturelles, 2014.
- Jugement exact de grammaticalité d'arbre syntaxique probableJean-Philippe ProstTALN: Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. Actes de la 21ème conférence sur le Traitement Automatique des Langues Naturelles (TALN'2014), 2014.
- Crowdsourcing Word-Color AssociationsMathieu Lafourcade, Nathalie Le Brun, Virginie ZampaMétais E.; Roche M.; Teisseire M. NLDB: Natural Language in the Database and Information Systems, Jun 2014, Montpellier, France. Springer, Cham, 19th International Conference on Applications of Natural Language to Information Systems, LNCS (8455), pp.39-44, 2014.
- Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A PurposeMathieu Lafourcade, Karën FortInternational Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland. 2014.
- From Natural Language to RDF Graphs with PregroupsAntonin Delpeuch, Anne PrellerEACL'2014: 14th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2014, Gothenburg, Sweden. EACL, pp.55-62, 2014.
- Spreading Relation Annotations in a Lexical Semantic Network Applied to RadiologyLionel Ramadier, Manel Zarrouk, Mathieu Lafourcade, Antoine MicheauCICLing: Computational Linguistics and Intelligent Text Processing, Apr 2014, Kathmandu, Nepal. 15th International Conference, CICLing 2014, Kathmandu, Nepal, April 6-12, 2014, Proceedings, Part I, LNCS (8403), pp.40-51, 2014, Computational Linguistics and Intelligent Text Processing.
- Vectorisation paramétrée des données textuellesCélia Da Costa Pereira, Mathieu Lafourcade, Patrick Lloret, Cédric Lopez, Mathieu RocheEGC: Extraction et Gestion des Connaissances, Jan 2014, Rennes, France. 14èmes Journées Internationales Francophones sur l’Extraction et la Gestion des Connaissances, RNTI-E-26, pp.593-596, 2014.
2013
- How to extract unit of measure in scientific documents?Soumia Lilia Berrahou, Patrice Buche, Juliette Dibie, Mathieu RocheKDIR: Knowledge Discovery and Information Retrieval, Sep 2013, Vilamoura, Portugal. Springer, 5th International Conference on Knowledge Discovery and Information Retrieval, pp.454-459, 2013.
- From Functional to Distributional ModelsAnne PrellerQuantum Physics and Logic 2013, Jul 2013, Barcelona, Spain. pp.17, 2013.
- GenDesc: A Partial Generalization of Linguistic Features For Text ClassificationGuillaume Tisserant, Violaine Prince, Mathieu RocheNLDB'2013: International Conference on Applications of Natural Language to Information Systems, Jun 2013, United Kingdom. pp.6, 2013.
- Text2Geo: from textual data to geospatial informationSabiha Tahrat, Eric Kergosien, Sandra Bringay, Mathieu Roche, Maguelonne TeisseireWIMS: Web Intelligence, Mining and Semantics, Jun 2013, Madrid, Spain. 13th International Conference on Web Intelligence, Mining and Semantics, 2013.
- Inference and Reconciliation in a Crowdsourced Lexical-Semantic NetworkManel Zarrouk, Mathieu Lafourcade, Alain JoubertCICLING: International Conference on Intelligent Text Processing and Computational Linguistics, Mar 2013, Samos, Greece. 14th International Conference on Intelligent Text Processing and Computational Linguistics March 24–30, 2013. University of the Aegean, Samos, Greece, 2013.
- Approaches of anonymisation of an SMS corpusNamrata Patel, Pierre Accorsi, Diana Inkpen, Cédric Lopez, Mathieu RocheCICLing: Conference on Intelligent Text Processing and Computational Linguistics, Mar 2013, Samos, Greece. Springer-Verlag, 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS (7816), pp.77-88, 2013.
Dernière mise à jour le 29/03/2018
Fiche-équipe TEXTE
Télécharger la fiche-équipe TEXTE du rapport d'activité 2008-2013 :