Jani Marjanen ; Jussi Kurunmäki ; Lidia Pivovarova ; Elaine Zosa - The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections

jdmdh:6159 - Journal of Data Mining & Digital Humanities, 18 décembre 2020, HistoInformatique - https://doi.org/10.46298/jdmdh.6159
The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collectionsArticle

Auteurs : Jani Marjanen ORCID1; Jussi Kurunmäki 2; Lidia Pivovarova ORCID1; Elaine Zosa ORCID1

Words with the suffix-ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other domains of language, especially culture, science, and religion. This has not always been the case. This paper studies isms in a historical record of digitized newspapers from 1820 to 1917 published in Finland to find out how the language of isms developed historically. We use diachronic word embeddings and affinity propagation clustering to trace how new isms entered the lexicon and how they relate to one another over time. We are able to show how they became more common and entered more and more domains. Still, the uses of isms as traditions for political action and thinking stand out in our analysis.


Volume : HistoInformatique
Publié le : 18 décembre 2020
Accepté le : 11 septembre 2020
Soumis le : 26 février 2020
Mots-clés : isms,ideology,political language,diachronic word embeddings,affinity propagation clustering,[SHS.HIST]Humanities and Social Sciences/History,[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Financement :
    Source : OpenAIRE Graph
  • NewsEye: A Digital Investigator for Historical Newspapers; Financeur: European Commission; Code: 770299
  • Cross-Lingual Embeddings for Less-Represented Languages in European News Media; Financeur: European Commission; Code: 825153

2 Documents citant cet article

Statistiques de consultation

Cette page a été consultée 2659 fois.
Le PDF de cet article a été téléchargé 1069 fois.