-
Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
Labadain-30k+ is a monolingual Tetun dataset containing 33,550 documents spanning from June 2001 to September 2023, excluding the years 2004 and 2005, for which no documents are... -
ArchOnto - January, 2022
A linked data model for archives composed of five different ontologies - CIDOC CRM (base ontology developed in the museums context), DataObject, N-ary (based in CIDOC... -
Sagres ship corrected meteorological data 2020
Meteorological data over the Atlantic Ocean collected onboard the NRP Sagres ship. The file includes as columns: day (dd) + month abbreviated name time (HHMM) Latitude (DD° M.M)... -
Sagres ship meteorological data 2020
Meteorological data over the Atlantic Ocean collected onboard the NRP Sagres ship. The file includes as columns: 1) day (dd) + month abbreviated name 2) time (HHMM) 3) Latitude... -
Automated Image Label Extraction from Radiology Reports
This data set contains the raw data from Figures 3 and 4 of the manuscript "Automated Image Label Extraction from Radiology Reports - A Review" The data used to create Figure 3... -
Text2Story Lusa
The Text2Story Lusa dataset contains 357 news articles published in European Portuguese by the Lusa news agency mostly between October 2020 and December 2020. The articles are... -
Text2Story Lusa Annotated Corpus
The Text2Story Lusa Annotated Corpus dataset contains multi-layered manual annotations for 117 articles from the Text2Story Lusa dataset. This resource includes both the text... -
ANODE, an underwater dataset for sacrificial anode detection
The ANODE dataset was created in the context of the ATLANTIS project and contains 18230 images collected at the ATLANTIS Test Center in Viana do Castelo, more specifically at... -
IMP Whole-Slide Images of Colorectal Samples 2024
The IMP-CRS 2024 dataset contains 5333 colorectal biopsy and polypectomy slides, retrieved from the data archive of IMP Diagnostics laboratory, Portugal, digitised at 40X by 2... -
Images annotated according to their content: a study on the description of...
Data description is a fundamental step in Research Data Management (RDM). When it comes to images, the challenge is increased, as they have characteristics that differentiate... -
IMP Whole-Slide Images of Cervical Samples 2024
The IMP-cervix dataset contains 600 cervical LEEP samples and surgical specimens, retrieved from the data archive of the IMP Diagnostics laboratory, Portugal, and were digitised... -
Data of real-time prediction of Wikipedia articles' quality
This dataset contains data produced for the dissertation. "Real-time prediction of Wikipedia articles' quality". The project was conducted by student Pedro Miguel Moás... -
Typewritten Digital Representations of Portuguese Cultural Heritage...
The dataset has typewritten Portuguese documents extracted from the Arquivo Nacional da Torre do Tombo (https://digitarq.arquivos.pt/). It includes records from two fonds of the... -
Evolution of Web search engine interfaces through SERP screenshots and HTML...
This dataset was extracted for a study on the evolution of Web search engine interfaces since their appearance. The well-known list of “10 blue links” has evolved into richer... -
Classification of online health messages
Classification of online health messages The dataset has 487 annotated messages taken from Medhelp, an online health forum with several health communities... -
Heart and Clavicle Segmentation References in Chest Radiography - Montgomery Dataset
The analysis of chest radiography imaging is of paramount importance for healthcare institutions since it is one of the most used imaging modalities for patient diagnosis,... -
NEREON (uNderwater dataset for monoculaR dEpth estimatiON)
In the context of the ATLANTIS project, the NEREON dataset (uNderwater datasEt for monoculaR dEpth estimatiON) was created to provide information for training Deep Learning... -
Manual Transcriptions of Typewritten Digital Representations of Portuguese...
The dataset includes manual transcriptions of typewritten digital representations of Portuguese cultural heritage documents from the 20th century, extracted from the Arquivo... -
Assessment of metrics for the development of an institutional DMP support system
The dataset includes two .xls files that combine the information related to the systematization of the DMP-building method and the development of the DMP support system that is... -
Bibliography and analysis on studies of institutional DMP support services
The dataset is an .xls file that combines the bibliographic information selected to Identify global studies on DMP support in different institutions and understand how...