Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature Jean-Baptiste Camps, Florian Cafiero To cite this version: Jean-Baptiste Camps, Florian Cafiero. Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature. Revue des Nouvelles Technologies de l’Information, Editions RNTI, 2013, SHS-1 (MASHS 2011/2012. Modèles et Apprentissages en Sciences Humaines et Sociales Rédacteurs invités : Mar), pp.55-84. halshs-00765651 HAL Id: halshs-00765651 https://halshs.archives-ouvertes.fr/halshs-00765651 Submitted on 15 Dec 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature∗ Jean–Baptiste Camps**, Florian Cafiero*** ** Laboratoire Etudes´ et ´editionde textes m´edi´evaux (EA 4349) Universit´eParis – Sorbonne 1, rue Victor Cousin 75005 Paris
[email protected] *** Ecole´ centrale de Paris Grande voie des Vignes 92295 Chˆatenay-Malabry CEDEX florian.cafi
[email protected] Abstract The authors present here an exploratory and unspecific method that does not necessitate any a priori on the data – or any heavy transformation such as lemma- tisation– that would have to be understood as a first step in the apprehension of a corpus.