A Topic Segmentation of Texts Based on Semantic Domains Olivier Ferret, Brigitte Grau
A Topic Segmentation of Texts based on Semantic Domains Olivier Ferret, Brigitte Grau To cite this version: Olivier Ferret, Brigitte Grau. A Topic Segmentation of Texts based on Semantic Domains. ECAI 2000, Proceedings of the 14th European Conference on Artificial Intelligence, 2000, Berlin, Germany. pp.426–430. hal-02458243 HAL Id: hal-02458243 https://hal.archives-ouvertes.fr/hal-02458243 Submitted on 28 Jan 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. A Topic Segmentation of Texts based on Semantic Domains Olivier Ferret1 and Brigitte Grau1 Abstract. Thematic analysis is essential for many Natural Language Proc- 2 OVERVIEW OF THE SEGAPSITH SYSTEM essing (NLP) applications, such as text summarization or information extraction. It is a two-dimensional process that has both to delimit the SEGAPSITH has two main components: a module that learns thematic segments of a text and to identify the topic of each of them. The semantic domains from texts and a topic segmentation module that system we present possesses these two characteristics. Based on the use of relies on these domains. Semantic domains are topic representa- semantic domains, it is able to structure narrative texts into adjacent the- tions made of weighted words.
[Show full text]