Timo Korkiakangas ; Matti Lassila - Visualizing linguistic variation in a network of Latin documents and scribes

jdmdh:4472 - Journal of Data Mining & Digital Humanities, 30 avril 2018, Numéro spécial sur le traitement assisté par ordinateur de l‘intertextualité dans les langues anciennes - https://doi.org/10.46298/jdmdh.4472
Visualizing linguistic variation in a network of Latin documents and scribesArticle

Auteurs : Timo Korkiakangas 1; Matti Lassila 2

This article explores whether and how network visualization can benefit philological and historical-linguistic study. This is illustrated with a corpus-based investigation of scribes' language use in a lemmatized and morphologically annotated corpus of documentary Latin (Late Latin Charter Treebank, LLCT2). We extract four continuous linguistic variables from LLCT2 and utilize a gradient colour palette in Gephi to visualize the variable values as node attributes in a trimodal network which consists of the documents, writers, and writing locations underlying the same corpus. We call this network the "LLCT2 network". The geographical coordinates of the location nodes form an approximate map, which allows for drawing geographical conclusions. The linguistic variables are examined both separately and as a sum variable, and the visualizations presented as static images and as interactive Sigma.js visualizations. The variables represent different domains of language competence of scribes who learnt written Latin practically as a second-language. The results show that the network visualization of linguistic features helps in observing patterns which support linguistic-philological argumentation and which risk passing unnoticed with traditional methods. However, the approach is subject to the same limitations as all visualization techniques: the human eye can only perceive a certain, relatively small amount of information at a time.


Volume : Numéro spécial sur le traitement assisté par ordinateur de l‘intertextualité dans les langues anciennes
Rubrique : Visualisation de l'intertextualité et de la réutilisation des textes
Publié le : 30 avril 2018
Accepté le : 29 avril 2018
Soumis le : 27 avril 2018
Mots-clés : network visualization, Latin linguistics, Early Middle Ages, philology, [ SHS.LANGUE ] Humanities and Social Sciences/Linguistics, [ SHS.STAT ] Humanities and Social Sciences/Methods and statistics, [ SHS.HIST ] Humanities and Social Sciences/History, [ SHS.CLASS ] Humanities and Social Sciences/Classical studies

Datasets

Est lié à
Korkiakangas, T., & Lassila, M. (2017). Visualizing Linguistic Variation In A Network Of Latin Documents And Scribes (1–) [Dataset]. Zenodo. 10.5281/ZENODO.1064692 1
Korkiakangas, T., & Lassila, M. (2017). Visualizing Linguistic Variation In A Network Of Latin Documents And Scribes (1–) [Dataset]. Zenodo. 10.5281/ZENODO.1064693 1
Korkiakangas, T. (2018). Late Latin Charter Treebank, Version 1 (Llct1) (1–) [Dataset]. Zenodo. 10.5281/ZENODO.1197357 1
  • 1 ScholeXplorer

1 Document citant cet article

Statistiques de consultation

Cette page a été consultée 2646 fois.
Le PDF de cet article a été téléchargé 24840 fois.