Towards a Digital Ecosystem: NLP. Corpus infrastructure. Methods for Retrieving Texts and Computing Text Similarities (8 articles)


  • Methods for the detection of intertexts and text reuse, manual (e.g. crowd-sourcing) or automatic (e.g. algorithms);

  • Infrastructure for the preservation of digital texts and quotations between different text passages;

  • Linguistic preprocessing and data normalisation, such as lemmatisation of historical languages, root stemming, normalisation of variants, etc.


Managing different types of text re-uses (3 articles)


This part focuses on the conceptual definitions, the modelling of the unstable idea of “quotation” and the XML-TEI encoding to implement for its characterization.


Visualisation of intertextuality and text reuse (3 articles)

Project presentations (8 articles)

Digital libraries and virtual exhibitions (2 articles)

Data deluge: which skills for wich data? (3 articles)

Project (2 articles)

A position paper describes goals of a specific project. Sponsorship is required. A fine description of all packages is useful to understand complementariy of each contribution in the framework of the project.

HistoInformatics (0 articles)

Digital humanities in languages (7 articles)

Sciences of Antiquity and digital humanities (1 article)

Editors: Julien Cavero ; Marie-Laure Massot