-
ArchOnto - January, 2022
A linked data model for archives composed of five different ontologies - CIDOC CRM (base ontology developed in the museums context), DataObject, N-ary (based in CIDOC... -
Text2Story Lusa
The Text2Story Lusa dataset contains 357 news articles published in European Portuguese by the Lusa news agency mostly between October 2020 and December 2020. The articles are... -
Images annotated according to their content: a study on the description of...
Data description is a fundamental step in Research Data Management (RDM). When it comes to images, the challenge is increased, as they have characteristics that differentiate... -
Typewritten Digital Representations of Portuguese Cultural Heritage...
The dataset has typewritten Portuguese documents extracted from the Arquivo Nacional da Torre do Tombo (https://digitarq.arquivos.pt/). It includes records from two fonds of the... -
Evolution of Web search engine interfaces through SERP screenshots and HTML...
This dataset was extracted for a study on the evolution of Web search engine interfaces since their appearance. The well-known list of “10 blue links” has evolved into richer... -
Classification of online health messages
Classification of online health messages The dataset has 487 annotated messages taken from Medhelp, an online health forum with several health communities... -
Manual Transcriptions of Typewritten Digital Representations of Portuguese...
The dataset includes manual transcriptions of typewritten digital representations of Portuguese cultural heritage documents from the 20th century, extracted from the Arquivo... -
Analysis of the DigitArq archive record entities and their properties in...
This dataset, which is the product of a master's dissertation (https://hdl.handle.net/10216/143034), analyzed a sample of 25 records from the Portuguese National Archives... -
Immersive Learning Thematic Network Data
Information for a database where practices, strategies and uses of Immersive Learning are connected with works in the field. These Practices, Strategies and Uses were found... -
ISAD(G) Descriptions of Archival Records With Entity Annotation
This dataset contains long text ISAD(G) fields from records from the Arquivo Nacional da Torre do Tombo annotated with entities. It was built to evaluate the effectiveness of... -
CIDOC-CRM Ontology Representation of the Portuguese Archival Description...
This dataset includes an excerpt of the CIDOC-CRM Ontology Representation of the DigitArq records from Bragança District Archive, two SPARQL query examples - "What are the... -
Radon concentration (Bq.m-3) from INESC TEC station (Porto). Updated monthly.
The dataset consists on measurements every 6-hours of radon concentration on the roof of INESC TEC main building. This dataset has Jupyter notebook for visualization (for better... -
Methods and Tools for Causal Discovery and Causal Inference
Nowadays ML models are used in decision-making processes in real-world problems, by learning a function that maps the observed features with the decision outcomes. However these... -
Research Data Management behaviors in the image lifecycle
Research data management (RDM) practices are critical to ensuring research success. Data can take different formats and data in image format has been little studied in RDM. To... -
Wikipedia information quality comparison between idioms
Source code and dataset from the first part of my Master Dissertation - "Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde" (Wikipedia quality assessment... -
Content Analysis of Publications in Experimental domains
This dataset support the proposal of manual content analysis as an approach to streamline the data curator workflow. We have performed manual context analysis over publications... -
Research data management in image format - Survey
These datasets were the result of research on research data management in image format. Based on the data collected, it was possible to study the practices and habits in the... -
Meteorological data from LFC/FEUP station
The dataset consists on measurements every 5-minutes from the LFC/FEUP meteorological station, including air temperature (in degrees C), relative humidity (%), atmospheric... -
Atmospheric electric field from INESC TEC station (Porto)
The dataset consists on 1-min measurements of the atmospheric electric field by a CS110 field mill installed on the roof of INESC TEC main building. This dataset has Jupyter... -
Gamma radiation from INESC TEC station (Porto)
The dataset consists on measurements of the total number of gamma rays counted by a NaI(Tl) scintillator on the roof of INESC TEC main building. This dataset has Jupyter...