Natallia Kokash ; Matteo Romanello ; Ernest Suyver ; Giovanni Colavizza - From Books to Knowledge Graphs

jdmdh:9380 - Journal of Data Mining & Digital Humanities, 13 mars 2023, 2023 - https://doi.org/10.46298/jdmdh.9380
From Books to Knowledge GraphsArticle

Auteurs : Natallia Kokash ORCID1; Matteo Romanello ORCID2; Ernest Suyver 3; Giovanni Colavizza ORCID1

The digital transformation of the scientific publishing industry has led to dramatic improvements in content discoverability and information analytics. Unfortunately, these improvements have not been uniform across research areas. The scientific literature in the arts, humanities and social sciences (AHSS) still lags behind, in part due to the scale of analog backlogs, the persisting importance of national languages, and a publisher ecosystem made of many, small or medium enterprises. We propose a bottom-up approach to support publishers in creating and maintaining their own publication knowledge graphs in the open domain. We do so by releasing a pipeline able to extract structured information from the bibliographies and indexes of AHSS publications, disambiguate, normalize and export it as linked data. We test the proposed pipeline on Brill's Classics collection, and release an implementation in open source for further use and improvement.


Volume : 2023
Publié le : 13 mars 2023
Accepté le : 5 décembre 2022
Soumis le : 25 avril 2022
Mots-clés : Computer Science - Digital Libraries

Statistiques de consultation

Cette page a été consultée 1356 fois.
Le PDF de cet article a été téléchargé 377 fois.