Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

Song, Youngsook; Cho, Won Ik

doi:10.46298/jdmdh.13145

Youngsook Song ; Won Ik Cho - Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

jdmdh:13145 - Journal of Data Mining & Digital Humanities, 4 juin 2024, NLP4DH - https://doi.org/10.46298/jdmdh.13145

Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition CorpusArticle

Auteurs : Song, Youngsook ; Cho, Won Ik

In Korean, quantitative speech act studies have usually been conducted on single utterances with unspecified sources. In this study, we annotate sentences from the National Institute of Korean Language's Messenger Corpus and the National Petition Corpus, as well as example sentences from an academic paper on contemporary Korean vlogging, and check the discrepancy between human annotation and model prediction. In particular, for sentences with differences in locutionary and illocutionary forces, we analyze the causes of errors to see if stylistic features used in a particular domain affect the correct inference of speech act. Through this, we see the necessity to build and analyze a balanced corpus in various text domains, taking into account cases with different usage roles, e.g., messenger conversations belonging to private conversations and petition corpus/vlogging script that have an unspecified audience.

https://doi.org/10.46298/jdmdh.13145

Source : zenodo.org:10722019

Volume : NLP4DH

Rubrique : Jeu de données

Publié le : 4 juin 2024

Accepté le : 9 avril 2024

Soumis le : 28 février 2024

Licence : Attribution 4.0 International (CC BY 4.0)

Fichiers

Nom	Taille
2024_JDMDH_Speech_Act_0521.pdf md5 : 77c769b9d7b83212c8e71d675d2ba908	1.89 MB

Youngsook Song ; Won Ik Cho - Study on the Domain Adaption of Korean Speech Act using Daily Conversation Dataset and Petition Corpus

Fichiers

Références bibliographiques

Partager et exporter

Statistiques de consultation