-
Discourse Ontology for Incorporation/Transference Events
The Discourse Ontology for Incorporation/Transference Events is an OWL ontology, a semantic interpretation ontology (SIO), that contains the concepts and relations used in... -
Consensual ArchOnto representation of 13 Portuguese Historical Archival...
The dataset contains archival descriptions represented in the ArchOnto model (https://rdm.inesctec.pt/dataset/cs-2022-004) of 13 records from the 20th century with typewritten... -
Evaluation of the adoption of the RDA recommendations
This work describes the work carried out wit several participants, from the University of Porto community, to promote of the adoption of the recommendations of the Research Data... -
Analysis of baptism, marriage and death registers belonging to the District...
The content of the datasets has description units related to baptisms, marriages and deaths from the District Archive of Guarda; baptisms, marriages and deaths from the District... -
Representation of 25 records from the Portuguese National Archives in Archonto
This dataset includes the manual representation of a sample of 25 archival records from the Portuguese National Archive in ArchOnto. The representation is based on the archive... -
Twitter profiles with related topics and websites
This dataset contains two files created for the dissertation "A Social Media Tool for Domain-Specific Information Retrieval - A Case Study in Human Trafficking" by Tito Griné... -
Wikipedia information quality assessment
Dataset from the second part of the Master Dissertation - "Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde" (Wikipedia quality assessment as health... -
Multi-domain data description sessions follow-up questionnaires
This dataset consists in data from 13 multi-domain data description sessions follow-up questionnaires. Researchers from the University of Porto participated in a data... -
Describing data in image format: Proposal of a metadata model and controlled...
Research data management (RDM) includes people with different needs, specific scientific contexts, and diverse requirements. The description of data is a big RDM challenge.... -
User evaluation of the DigitArq and DigitArq+ interfaces
This dataset is part of a dissertation entitled "Evaluation of the migration of archival records to linked data in the EPISA project" (https://hdl.handle.net/10216/143042 ).... -
Automatic Quality Assessment of Wikipedia Articles - A Systematic Literature...
This is the result dataset related to the article entitled "Automatic Quality Assessment of Wikipedia Articles - A Systematic Literature Review", which is a systematic... -
Radon data from ENVRIplus TNA campaign RELECT at SMEAR II – HYYTIÄLÄ...
Radon concentration measurements (in Bq/m3) every 2-hours. -
Labadain-30k+: A Monolingual Tetun Document-Level Audited Dataset
Labadain-30k+ is a monolingual Tetun dataset containing 33,550 documents spanning from June 2001 to September 2023, excluding the years 2004 and 2005, for which no documents are... -
ArchOnto - January, 2022
A linked data model for archives composed of five different ontologies - CIDOC CRM (base ontology developed in the museums context), DataObject, N-ary (based in CIDOC... -
Text2Story Lusa
The Text2Story Lusa dataset contains 357 news articles published in European Portuguese by the Lusa news agency mostly between October 2020 and December 2020. The articles are... -
Images annotated according to their content: a study on the description of...
Data description is a fundamental step in Research Data Management (RDM). When it comes to images, the challenge is increased, as they have characteristics that differentiate... -
Evolution of Web search engine interfaces through SERP screenshots and HTML...
This dataset was extracted for a study on the evolution of Web search engine interfaces since their appearance. The well-known list of “10 blue links” has evolved into richer... -
Classification of online health messages
Classification of online health messages The dataset has 487 annotated messages taken from Medhelp, an online health forum with several health communities... -
Radon concentration (Bq.m-3) from INESC TEC station (Porto). Updated monthly.
The dataset consists on measurements every 6-hours of radon concentration on the roof of INESC TEC main building. This dataset has Jupyter notebook for visualization (for better... -
Methods and Tools for Causal Discovery and Causal Inference
Nowadays ML models are used in decision-making processes in real-world problems, by learning a function that maps the observed features with the decision outcomes. However these...