Andrea Bertino ; Luca Foppiano ; Laurent Romary ; Pierre Mounier - Leveraging Concepts in Open Access Publications

jdmdh:5081 - Journal of Data Mining & Digital Humanities, June 15, 2020, 2019 -
Leveraging Concepts in Open Access Publications

Authors: Andrea Bertino ; Luca Foppiano ORCID-iD; Laurent Romary ; Pierre Mounier

    This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing, was initially developed by Inria in the context of the EU FP7 project CENDARI and provides automatic entity recognition and disambiguation using the Wikipedia and Wikidata data sets. The application is distributed with an open-source licence, and it has been deployed as a web service in DARIAH's infrastructure hosted by the French HumaNum. In the paper, we focus on the specific issues related to its integration on five OA platforms specialized in the publication of scholarly monographs in the social sciences and humanities (SSH), as part of the work carried out within the EU H2020 project HIRMEOS (High Integration of Research Monographs in the European Open Science infrastructure). In the first section, we give a brief overview of the current status and evolution of OA publications, considering specifically the challenges that OA monographs are encountering. In the second part, we show how the HIRMEOS project aims to face these challenges by optimizing five OA digital platforms for the publication of monographs from the SSH and ensuring their interoperability. In sections three and four we give a comprehensive description of the entity-fishing service, focusing on its concrete applications in real use cases together with some further possible ideas on how to exploit the annotations generated. We show that entity-fishing annotations can improve both research and publishing process. In the last chapter, we briefly present further possible application scenarios that could be made available through infrastructural projects.

    Volume: 2019
    Published on: June 15, 2020
    Accepted on: August 22, 2019
    Submitted on: January 15, 2019
    Keywords: Open Access,Named Entity Recognition and Disambiguation (NERD),Entity-Fishing,Monographs,Digital Publishing Platforms, [ INFO.INFO-TT ] Computer Science [cs]/Document and Text Processing
    Fundings :
      Source : OpenAIRE Research Graph
    • High Integration of Research Monographs in the European Open Science infrastructure; Funder: European Commission; Code: 731102


    Consultation statistics

    This page has been seen 1097 times.
    This article's PDF has been downloaded 423 times.