Niko Partanen ; Jack Rueter - Dialect Cartography of Erzya and Moksha Languages: Digitized Historical Sources and Evaluation of the Contemporary Data

jdmdh:16439 - Journal of Data Mining & Digital Humanities, November 13, 2025, NLP4DH - https://doi.org/10.46298/jdmdh.16439
Dialect Cartography of Erzya and Moksha Languages: Digitized Historical Sources and Evaluation of the Contemporary DataArticle

Authors: Partanen, Niko ORCID1; Rueter, Jack ORCID1

  • 1 University of Helsinki

This study investigates the correspondences between a recent map of Uralic languages that also covers the Erzya and Moksha languages in detail. We discuss our point of view in linguistic cartography more generally, but especially within the context of Uralic languages, and address various difficulties that can be recognized in defining the speaker area boundaries and choosing settlements that should be included in the traditional or contemporary speech communities. We use the historical data of Heikki Paasonen, which, we believe, is a highly reliable indicator of at least some areas that should be included in the traditional distributions of these languages as points of comparison. This data is contrasted with the contemporary language maps.


Volume: NLP4DH
Published on: November 13, 2025
Accepted on: September 16, 2025
Submitted on: August 31, 2025
Keywords: Uralic languages, Erzya, Moksha, language maps, dialectology

Files

Name Size
Dialect_Cartography_of_Erzya_and_Moksha_Languages.pdf
md5: 185ce984181fd5797b054e7c51d5b9ea
1.92 MB

Publications

Is derived from
, & Partanen, N. (2020). rueter/Mordvin-Varieties: comparative-mordvin-database (Version v0.6). Zenodo. 10.5281/ZENODO.3627624 1
Continues
Rueter, J., & Partanen, N. (2025). Restructuring and visualising dialect dictionary data: Report on Erzya and Moksha materials. In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities (pp. 41-47). Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities. Association for Computational Linguistics. 10.18653/v1/2025.nlp4dh-1.5 1
  • 1 Zenodo

Datasets

Is based on
Jack Rueter, & Niko Partanen. (2025). rueter/Mordvin-Varieties: Comparative Mordvin Database (Version v0.7). Zenodo. 10.5281/ZENODO.17464539

Consultation statistics

This page has been seen 316 times.
This article's PDF has been downloaded 133 times.