Jani Marjanen ; Jussi Kurunmäki ; Lidia Pivovarova ; Elaine Zosa
-
The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections
Words with the suffix-ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other domains of language, especially culture, science, and religion. This has not always been the case. This paper studies isms in a historical record of digitized newspapers from 1820 to 1917 published in Finland to find out how the language of isms developed historically. We use diachronic word embeddings and affinity propagation clustering to trace how new isms entered the lexicon and how they relate to one another over time. We are able to show how they became more common and entered more and more domains. Still, the uses of isms as traditions for political action and thinking stand out in our analysis.
Mots-clés : isms,ideology,political language,diachronic word embeddings,affinity propagation clustering,[SHS.HIST]Humanities and Social Sciences/History,[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Financement :
Source : OpenAIRE Graph
NewsEye: A Digital Investigator for Historical Newspapers; Financeur: European Commission; Code: 770299
Cross-Lingual Embeddings for Less-Represented Languages in European News Media; Financeur: European Commission; Code: 825153
Datasets
Est lié à
Hengchen, S., Ros, R., & Marjanen, J. (2019). A data-driven approach to the changing vocabulary of the ‘nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950 (1–) [Dataset]. DataverseNL. 10.34894/AVBD7A1
Hengchen, S., Ros, R., & Marjanen, J. (2019). Models for "A data-driven approach to the changing vocabulary of the ’nation’ in English, Dutch, Swedish and Finnish newspapers, 1750-1950" (Version 1.0.0, 1–) [Dataset]. Zenodo. 10.5281/ZENODO.32706481
1 ScholeXplorer
Références bibliographiques
2 Documents citant cet article
Estelle Bunout, 2021, Grasping the Anti-Modern Discourse on Europe in the Swiss Digitised Press, or can Text Mining Generate a Research Corpus from an Article Collection?, Journal of Open Humanities Data, 7, 0, pp. 21, 10.5334/johd.37, https://doi.org/10.5334/johd.37.
Simon Hengchen;Ruben Ros;Jani Marjanen;Mikko Tolonen, 2021, A data-driven approach to studying changing vocabularies in historical newspaper collections, Digital Scholarship in the Humanities, 36, Supplement_2, pp. ii109-ii126, 10.1093/llc/fqab032, https://doi.org/10.1093/llc/fqab032.