Towards a Digital Ecosystem: NLP. Corpus infrastructure. Methods for Retrieving Texts and Computing Text Similarities (8 articles )


  • Methods for the detection of intertexts and text reuse, manual (e.g. crowd-sourcing) or automatic (e.g. algorithms);

  • Infrastructure for the preservation of digital texts and quotations between different text passages;

  • Linguistic preprocessing and data normalisation, such as lemmatisation of historical languages, root stemming, normalisation of variants, etc.


Managing different types of text re-uses (3 articles )


This part focuses on the conceptual definitions, the modelling of the unstable idea of “quotation” and the XML-TEI encoding to implement for its characterization.


Visualisation of intertextuality and text reuse (3 articles )

Project presentations (8 articles )

Digital libraries and virtual exhibitions (2 articles )

Data deluge: which skills for wich data? (3 articles )

Project (2 articles )

A position paper describes goals of a specific project. Sponsorship is required. A fine description of all packages is useful to understand complementariy of each contribution in the framework of the project.

HistoInformatics (7 articles )

Digital humanities in languages (6 articles )

Sciences of Antiquity and digital humanities (1 article )

Editors: Julien Cavero ; Marie-Laure Massot