-
Semantic representation of the Registos de Baptismos da Paróquia de Aldoar...
This dataset comprises mappings of archival records from the National Archives of Portugal to the RiC-O (Records in Contexts Ontology) framework, namely the baptism registries... -
Wikipedia and Simple Wikipedia Lead Section Pairs for Nine Categories
The dataset (categorized_dataset folder) contains 9 files in .csv format, each a collection of 10,000 lead section pairs sourced from Wikipedia (https://www.wikipedia.org/) and... -
Raw data collected onboard the Sagres ship during the SAIL project campaign
Project SAIL aimed to improve the scientific understanding of the marine boundary layer by means of a unique monitoring campaign on board the iconic Portuguese tall ship NRP... -
Pre-processed atmospheric data from the SAIL campaign onboard the Sagres ship
Project SAIL aimed to improve the scientific understanding of the marine boundary layer by means of a unique monitoring campaign on board the iconic Portuguese tall ship NRP... -
Metadata and Analysis of Clinical Information Extraction Publications Using...
This dataset contains all the data collected on all the papers analyzed in our publication, entitled "Harnessing Large Language Models for Clinical Information Extraction: A... -
Defect Detection Dataset: Porosities in Machined Aluminum Holes
This dataset comprises 302 JPEG images captured with an endoscopic camera, focusing on detecting porosities in the machined holes inner walls of cast aluminum parts. Each image... -
Heart and Clavicle Segmentation References in Chest Radiography - Montgomery Dataset
The analysis of chest radiography imaging is of paramount importance for healthcare institutions since it is one of the most used imaging modalities for patient diagnosis,... -
ROAM@CRAS - A haRbor multidOmAin Mapping dataset
The ROAM@CRAS dataset was acquired using an Autonomous Surface Vehicle (ASV), the SENSE, equipped with a set of incorporated sensors for perceiving the surface and underwater... -
Daily profiles (2020) of load of a rural synthetic electricity distribution...
This dataset was prepared under the framework of the ATTEST project, financed by the European Commission with grant number 864298. The dataset contains information about a... -
An urban synthetic electricity distribution network from Spain, first...
This dataset was prepared under the framework of the ATTEST project, financed by the European Commission with grant number 864298. The dataset contains information about the... -
Content Analysis of Publications in Experimental domains
This dataset support the proposal of manual content analysis as an approach to streamline the data curator workflow. We have performed manual context analysis over publications... -
Research Data Management behaviors in the image lifecycle
Research data management (RDM) practices are critical to ensuring research success. Data can take different formats and data in image format has been little studied in RDM. To... -
Gamma radiation monitoring at the Azores ENA-ARM station (Graciosa Island)
Gamma-ray total counts in counts/minute (cpm), every 15-minutes, from the Gamma Radiation Monitoring campaign at the ENA site (Graciosa, Azores). Data were collected to study... -
CIDOC-CRM Ontology Representation of the Portuguese Archival Description...
This dataset includes an excerpt of the CIDOC-CRM Ontology Representation of the DigitArq records from Bragança District Archive, two SPARQL query examples - "What are the... -
Ambient radioactivity data from S. Jorge island (Azores), since March 2022
The island of S. Jorge, in the Azores archipelago, has been one of the quietest islands in terms of volcanic and seismic events. However, seismicity levels raised abruptly on... -
NEREON (uNderwater dataset for monoculaR dEpth estimatiON)
In the context of the ATLANTIS project, the NEREON dataset (uNderwater datasEt for monoculaR dEpth estimatiON) was created to provide information for training Deep Learning... -
Typewritten Digital Representations of Portuguese Cultural Heritage...
The dataset has typewritten Portuguese documents extracted from the Arquivo Nacional da Torre do Tombo (https://digitarq.arquivos.pt/). It includes records from two fonds of the... -
ISAD(G) Descriptions of Archival Records With Entity Annotation
This dataset contains long text ISAD(G) fields from records from the Arquivo Nacional da Torre do Tombo annotated with entities. It was built to evaluate the effectiveness of... -
Manual Transcriptions of Typewritten Digital Representations of Portuguese...
The dataset includes manual transcriptions of typewritten digital representations of Portuguese cultural heritage documents from the 20th century, extracted from the Arquivo... -
Data of real-time prediction of Wikipedia articles' quality
This dataset contains data produced for the dissertation. "Real-time prediction of Wikipedia articles' quality". The project was conducted by student Pedro Miguel Moás...