-
ANODE, an underwater dataset for sacrificial anode detection
The ANODE dataset was created in the context of the ATLANTIS project and contains 18230 images collected at the ATLANTIS Test Center in Viana do Castelo, more specifically at... -
Wikipedia and Simple Wikipedia Lead Section Pairs for Nine Categories
The dataset (categorized_dataset folder) contains 9 files in .csv format, each a collection of 10,000 lead section pairs sourced from Wikipedia (https://www.wikipedia.org/) and... -
Metadata and Analysis of Clinical Information Extraction Publications Using...
This dataset contains all the data collected on all the papers analyzed in our publication, entitled "Harnessing Large Language Models for Clinical Information Extraction: A... -
ROAM@CRAS - A haRbor multidOmAin Mapping dataset
The ROAM@CRAS dataset was acquired using an Autonomous Surface Vehicle (ASV), the SENSE, equipped with a set of incorporated sensors for perceiving the surface and underwater... -
Content Analysis of Publications in Experimental domains
This dataset support the proposal of manual content analysis as an approach to streamline the data curator workflow. We have performed manual context analysis over publications... -
Gamma radiation monitoring at the Azores ENA-ARM station (Graciosa Island)
Gamma-ray total counts in counts/minute (cpm), every 15-minutes, from the Gamma Radiation Monitoring campaign at the ENA site (Graciosa, Azores). Data were collected to study... -
Ambient radioactivity data from S. Jorge island (Azores), since March 2022
The island of S. Jorge, in the Azores archipelago, has been one of the quietest islands in terms of volcanic and seismic events. However, seismicity levels raised abruptly on... -
NEREON (uNderwater dataset for monoculaR dEpth estimatiON)
In the context of the ATLANTIS project, the NEREON dataset (uNderwater datasEt for monoculaR dEpth estimatiON) was created to provide information for training Deep Learning... -
ISAD(G) Descriptions of Archival Records With Entity Annotation
This dataset contains long text ISAD(G) fields from records from the Arquivo Nacional da Torre do Tombo annotated with entities. It was built to evaluate the effectiveness of... -
Manual Transcriptions of Typewritten Digital Representations of Portuguese...
The dataset includes manual transcriptions of typewritten digital representations of Portuguese cultural heritage documents from the 20th century, extracted from the Arquivo... -
Data of real-time prediction of Wikipedia articles' quality
This dataset contains data produced for the dissertation. "Real-time prediction of Wikipedia articles' quality". The project was conducted by student Pedro Miguel Moás... -
Urban@CRAS
The dataset was acquired at Porto, one of the most iconic city of Portugal. It were obtained diverse trajectories in different type of scenarios - around of the costal zone of... -
Matrix profile analysis of Dansgaard-Oeschger events in palaeoclimate time series
This dataset includes all the datafiles and computational notebooks required to reproduce the work reported in the paper “Characterisation of Dansgaard-Oeschger events in... -
Radon and environmental parameters measured at Krasnohorska cave (Slovakia)
Measurements of radon activity concentration and environmental parameters performed in the atmosphere of the Krasnohorska cave (Slovakia) during a field campaign in the... -
Geographically distributed solar power time series
This dataset contains electrical energy hourly time series from 44 small-PV (households) units located in the same region, with installed capacity ranging between 1.1 and 3.7... -
Analysis of baptism, marriage and death registers belonging to the District...
The content of the datasets has description units related to baptisms, marriages and deaths from the District Archive of Guarda; baptisms, marriages and deaths from the District... -
Automatic Quality Assessment of Wikipedia Articles - A Systematic Literature...
This is the result dataset related to the article entitled "Automatic Quality Assessment of Wikipedia Articles - A Systematic Literature Review", which is a systematic... -
Radon data from ENVRIplus TNA campaign RELECT at SMEAR II – HYYTIÄLÄ...
Radon concentration measurements (in Bq/m3) every 2-hours. -
Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
Labadain-30k+ is a monolingual Tetun dataset containing 33,550 documents spanning from June 2001 to September 2023, excluding the years 2004 and 2005, for which no documents are... -
IMP Whole-Slide Images of Colorectal Samples 2024
The IMP-CRS 2024 dataset contains 5333 colorectal biopsy and polypectomy slides, retrieved from the data archive of IMP Diagnostics laboratory, Portugal, digitised at 40X by 2...