Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

Shlomo Tannor; Nachum Dershowitz; Moshe Lavee

doi:10.46298/jdmdh.11375

Shlomo Tannor ; Nachum Dershowitz ; Moshe Lavee - Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

jdmdh:11375 - Journal of Data Mining & Digital Humanities, 13 août 2023, NLP4DH - https://doi.org/10.46298/jdmdh.11375

Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma MaterialArticle

Auteurs : Shlomo Tannor ¹; Nachum Dershowitz ¹; Moshe Lavee ²

Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus. To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recent advances in natural language processing for Hebrew texts. Additionally, we demonstrate how this method can be applied to uncover lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has been preserved in later anthologies.

https://doi.org/10.46298/jdmdh.11375

Source : arXiv.org:2211.09710

Volume : NLP4DH

Publié le : 13 août 2023

Accepté le : 6 juillet 2023

Soumis le : 25 mai 2023

Mots-clés : Computer Science - Computation and Language, Computer Science - Machine Learning

Licence : arXiv.org - Non-exclusive license to distribute

Références bibliographiques

Partager et exporter

Statistiques de consultation

Cette page a été consultée 1507 fois.

Le PDF de cet article a été téléchargé 657 fois.