Analysis of the DigitArq archive record entities and their properties in Wikidata and in DBpedia

This dataset, which is the product of a master's dissertation (https://hdl.handle.net/10216/143034), analyzed a sample of 25 records from the Portuguese National Archives (https://digitarq.arquivos.pt/), chosen by archival specialists as representative of different fonds and description levels, to identify entities and properties and explore relationships with other non-archival resources. After selecting the entities in the records, Wikidata and DBpedia databases were chosen for data enrichment and linking. The analysis of the entities in the records provided hundreds of properties directly related to classes in the corresponding databases. A more significant set of properties was obtained by selecting properties that had more data entered into the databases for entities of the corresponding classes. These entities and properties were then subjected to tests using Wikidata Query Service and DBpedia SPARQL Explorer. The goal of this data generation was to provide additional information to improve the search interfaces of archival information systems helping to optimize information retrieval to the benefit of archivists and users.

Data and Resources

Additional Info

Field Value
Source Arquivo Nacional Torre do Tombo
Author Camilla Oliveira da Silveira
Last Updated May 14, 2024, 10:44 (UTC)
Created January 13, 2023, 15:36 (UTC)
Citation Oliveira da Silveira, C. (2023). Analysis of the DigitArq archive record entities and their properties in Wikidata and in DBpedia [Data set]. INESC TEC. https://doi.org/10.25747/GV77-5K64
Contributor EPISA Project team
Creation Date 2022-10-01
DOI https://doi.org/10.25747/gv77-5k64
Format .xlsx
Language PT, EN
Relation Oliveira da Silveira, C. (2022). Entidades em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA. https://repositorio-aberto.up.pt/handle/10216/143034
Temporal Coverage 1523-2006