_Publications



2020

  • Tanguy, Ludovic, Fabre, Cécile, Bard, Yoann (2020). Impact de la structure logique des documents sur les modèles distributionnels : expérimentations sur le corpus TALN. In Actes, 27ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Nancy, France. pp.122-135. [pdf]
  • Martin Laville, Amir Hazem, Emmanuel Morin, and Langlais Philippe (2020). Data Selection for Bilingual Lexicon Inductionfrom Specialized Comparable Corpora. In Proceedings of the 28th International Conference on Computational Linguistics (COLING). Barcelona, Spain, 2020
  • Yizhe Wang, Béatrice Daille, Nabil Hathout, (2020). A study of semantic projection from single word terms to multi-word terms in the environment domain. In Proceedings of the 6th International Workshop on Computational Terminology, 50--54, Marseille, France. [pdf]
  • Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne, Hiroshi Noji, Pierre Zweigenbaum, Junichi Tsujii (2020). CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters. arXiv preprint arXiv:2010.10392.[pdf] [code]
  • Hicham El Boukkouri (2020). Ré-entraîner ou entraîner soi-même ? Stratégies de pré-entraînement de BERT en domaine médical . In Actes, 22ème Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL), 29–42, Nancy.[pdf] [code]
  • Pauline Brunet, Olivier Ferret and Ludovic Tanguy (2020). Which Dependency Parser to Use for Distributional Semantics in a Specialized Domain? In Proceedings, 6th International Workshop on Computational Terminology (COMPUTERM), 26–36, Marseille, France.[pdf]
  • Ludovic Tanguy, Pauline Brunet and Olivier Ferret (2020). Extrinsic Evaluation of French Dependency Parsers on a Specialized Corpus: Comparison of Distributional Thesauri. In Proceedings, 12th Language Resources and Evaluation Conference (LREC), 5822–5830, Marseille, France. [pdf]
  • Martin Laville, Amir Hazem and Emmanuel Morin(2020). TALN/LS2N Participation at the BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora. In Proceedings, 13th Workshop on Building and Using Comparable Corpora (BUCC), Marseille, France.
  • Martin Laville, Mériéme Bouhandi, Emmanuel Morin and Philippe Langlais (2020). Seed Lexicons, Word Representations, Mapping Procedure, and Evaluation Lists: What Matters in Bilingual Lexicon Induction from Comparable Corpora? In Proceedings, 33rd Canadian Conference on Artificial Intelligence (CAIAC), Quebec, Canada.

2019

  • Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne and Pierre Zweigenbaum (2019). Embedding Strategies for Specialized Domains: Application to Clinical Entity Recognition. In Proceedings, 57th Conference of the Association for Computational Linguistics (ACL) student research workshop, 295–301, Florence, Italy. [pdf] [code]
  • Mérième Bouhandi (2019). Apport des termes complexes pour enrichir l’analyse distributionnelle en domaine spécialisé. In Actes, 21ème Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL), 473–486 Toulouse, France. [pdf]
  • Ludovic Tanguy, Pauline Brunet et Olivier Ferret (2019). Comparaison qualitative et extrinsèque d'analyseurs syntaxiques du français : confrontation de modèles distributionnels sur un corpus spécialisé. In Actes, 26ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), 39–53, Toulouse, France. [pdf]
  • Mohamadou Ba, Robert Bossy, Pauline Brunet, Louise Deléger, Hicham El Boukkouri, Olivier Ferret, Arnaud Ferré, Thomas Lavergne, Claire Nédellec, & Pierre Zweigenbaum (2019). Combining string-based and embeddings-based methods for medical concept normalization: LIMSI-CEA-INRA@n2c2 2019. In Özlem Uzuner, Yanshan Wang, Feichen Shen, and Anna Rumshisky, editors, 2019 n2c2/OHNLP Shared Task on Challenges in Natural Language Processing for Clinical Data, 2019.

2018

  • Olivier Ferret (2018). Using pseudo-senses for improving the extraction of synonyms from word embeddings. In Proceedings, 56th Annual Meeting of the Association for Computational Linguistics : short paper session (ACL), 351–357, Melbourne, Australia.[pdf]
  • Olivier Ferret (2018). Des pseudo-sens pour améliorer l'extraction de synonymes à partir de plongements lexicaux. In Actes, 25e Conférence sur le Traitement Automatique des Langues Naturelles (CORIA-TALN-RJC), session articles courts, 365–373, Rennes, France. [pdf]
  • Amir Hazem and Emmanuel Morin (2018). Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora. In Proceedings, 27th International Conference on Computational Linguistics (COLING), 937–949, Santa Fe, New Mexico, USA. [pdf]