The expansion of isms, 1820–1917: Data-driven analysis of political language in digitized newspaper collectionsArticleAuteurs : Jani Marjanen
1; Jussi Kurunmäki
2; Lidia Pivovarova
1; Elaine Zosa
1
0000-0002-3085-4862##NULL##0000-0002-0026-9902##0000-0003-2482-0663
Jani Marjanen;Jussi Kurunmäki;Lidia Pivovarova;Elaine Zosa
Words with the suffix -ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other domains of language, especially culture, science, and religion. This has not always been the case. This paper studies isms in a historical record of digitized newspapers from 1820 to 1917 published in Finland to find out how the language of isms developed historically. We use diachronic word embeddings and affinity propagation clustering to trace how new isms entered the lexicon and how they relate to one another over time. We are able to show how they became more common and entered more and more domains. Still, the uses of isms as traditions for political action and thinking stand out in our analysis.
Volume : HistoInformatique
Publié le : 18 décembre 2020
Accepté le : 11 septembre 2020
Soumis le : 26 février 2020
Mots-clés : [SHS.HIST]Humanities and Social Sciences/History, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], [en] isms, ideology, political language, diachronic word embeddings, affinity propagation clustering
Financement :
Source : OpenAIRE Graph- NewsEye: A Digital Investigator for Historical Newspapers; Financeur: European Commission; Code: 770299
- Cross-Lingual Embeddings for Less-Represented Languages in European News Media; Financeur: European Commission; Code: 825153