Using ChatGPT and Other AI Engines to Vocalize Medieval Hebrew

Gordon, Nehemia; Freeman, Natan

doi:10.46298/jdmdh.11182

Nehemia Gordon ; Natan Freeman - Using ChatGPT and Other AI Engines to Vocalize Medieval Hebrew

jdmdh:11182 - Journal of Data Mining & Digital Humanities, 11 décembre 2024 - https://doi.org/10.46298/jdmdh.11182

Using ChatGPT and Other AI Engines to Vocalize Medieval HebrewArticle

Auteurs : Gordon, Nehemia ¹; Freeman, Natan ²

1 Institute for Hebrew Bible Manuscript Research
2 Independent researcher

Hebrew is usually written without vowel points, making it challenging for some readers to decipher. This is especially true of medieval Hebrew, which can have nonstandard grammar and orthography. This paper tested four artificial intelligence (AI) tools by asking them to add vowel points to an unpublished medieval Hebrew translation of the Lord’s Prayer. The vocalization tools tested were OpenAI’s ChatGPT-3.5 and ChatGPT-4, Pellaworks’ DoItInHebrew, and Dicta’s Nakdan. ChatGPT-3.5 freely changed the text, even rewriting some phrases and adding an entire sentence. ChatGPT-3.5 also provided erroneous vowels in its rewritten Hebrew text. ChatGPT-4 did a moderately good job with only a few errors, but also modified the orthography. One of ChatGPT-4’s errors was not trivial, resulting in the invention of a word. When challenged, ChatGPT-4 corrected this confabulation by inventing another word, which it claimed was a “rare form” for which it provided a fictitious derivation. When challenged on this second made-up word, ChatGPT-4 replaced the word from the input text with a word based on an entirely different root. DoItInHebrew inserted vowels that produced a gibberish text. In contrast, Dicta’s Nakdan provided near perfect vocalization, with only one genuine error, but like ChatGPT-4 it modified the orthography. ChatGPT-3.5, ChatGPT-4, and DoItInHebrew exhibited serious “hallucinations,” of both the “factual” and the “untruthful” varieties, typical of other AIs, making them counterproductive for vocalizing historic Hebrew texts. Nakdan can be a powerful tool but still requires someone with expertise in Hebrew grammar to verify and correct the vocalization. Nakdan’s interface simplified correcting the vocalization, although it required its user to have advanced knowledge of Hebrew.

https://doi.org/10.46298/jdmdh.11182

Source : zenodo.org:12561418

Publié le : 11 décembre 2024

Accepté le : 26 octobre 2023

Soumis le : 12 avril 2023

Mots-clés : ChatGPT,AI,Hebrew,manuscripts,vocalization,vowels,Lord's Prayer,New Testament,philology,Middle Ages,hallucinations

Licence : Creative Commons Attribution 4.0 International (CC BY 4.0)

Fichiers

Nom	Taille
Using ChatGPT and Other AI Engines to Vocalize Medieval Hebrew 5.03.pdf md5 : 7ab1e70b2bc28d25bf3dde548246bfc4	472.9 KB

Nehemia Gordon ; Natan Freeman - Using ChatGPT and Other AI Engines to Vocalize Medieval Hebrew

Fichiers

Références bibliographiques

Partager et exporter

Statistiques de consultation