Niko Partanen ; Jack Rueter - Dialect Cartography of Erzya and Moksha Languages: Digitized Historical Sources and Evaluation of the Contemporary Data

jdmdh:16439 - Journal of Data Mining & Digital Humanities, 13 novembre 2025, NLP4DH - https://doi.org/10.46298/jdmdh.16439
Dialect Cartography of Erzya and Moksha Languages: Digitized Historical Sources and Evaluation of the Contemporary DataArticle

Auteurs : Partanen, Niko ORCID1; Rueter, Jack ORCID1

  • 1 University of Helsinki

This study investigates the correspondences between a recent map of Uralic languages that also covers the Erzya and Moksha languages in detail. We discuss our point of view in linguistic cartography more generally, but especially within the context of Uralic languages, and address various difficulties that can be recognized in defining the speaker area boundaries and choosing settlements that should be included in the traditional or contemporary speech communities. We use the historical data of Heikki Paasonen, which, we believe, is a highly reliable indicator of at least some areas that should be included in the traditional distributions of these languages as points of comparison. This data is contrasted with the contemporary language maps.


Volume : NLP4DH
Publié le : 13 novembre 2025
Accepté le : 16 septembre 2025
Soumis le : 31 août 2025
Mots-clés : Uralic languages, Erzya, Moksha, language maps, dialectology

Fichiers

Nom Taille
Dialect_Cartography_of_Erzya_and_Moksha_Languages.pdf
md5 : 185ce984181fd5797b054e7c51d5b9ea
1.92 MB

Publications

Est dérivé de
, & Partanen, N. (2020). rueter/Mordvin-Varieties: comparative-mordvin-database (Version v0.6). Zenodo. 10.5281/ZENODO.3627624 1
Continue
Rueter, J., & Partanen, N. (2025). Restructuring and visualising dialect dictionary data: Report on Erzya and Moksha materials. In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities (pp. 41-47). Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities. Association for Computational Linguistics. 10.18653/v1/2025.nlp4dh-1.5 1
  • 1 Zenodo

Datasets

Est basé sur
Jack Rueter, & Niko Partanen. (2025). rueter/Mordvin-Varieties: Comparative Mordvin Database (Version v0.7). Zenodo. 10.5281/ZENODO.17464539