Academy & Industry Research Collaboration Center (AIRCC)

Volume 13, Number 03, February 2023

An Entity Registry: A Model for a Repository of Entities Found in a Document Set

  Authors

Valerio Bellandi1 and Stefano Siccardi2, 1Università degli Studi di Milano, Italy, 2Consorzio Interuniversitario Nazionale per l’Informatica, Italy

  Abstract

This paper proposes a conceptual structure for a repository of entities that can be found by usual procedures of Natural Language Processing, that is the search for entities mentioned in text, their identification, possibly through the link to entries in Background Knowledge Basis (BKG) and theconstruction of a Knowledge Basis or Graph to host the information found in this process. We address applications where a BKG is of little help, because the involved entities are not so relevant to be included in any, being for instance ordinary people or small companies. Therefore, we rely on the entities’ attributes and relationships for unique identification, disambiguation, knowledge checking and any other relevant operation. One of the final goals achieved by the proposed method is the ability to merge knowledge collected in separate bases, once they are referred to the same Entity Registry.

  Keywords

Named Entity Recognition, Named Entity Linking, Knowledge Basis, Knowledge Graphs.