Welcome to the INESC TEC research data repository.
This data repository showcases datasets produced or used by INESC TEC researchers and their partners. It is an embodiment of our institutional commitment to Open Data in research.

CS: Computer Science
The Computer Science Cluster mission is to contribute to the understanding of computing, to the rigorous development...
-
Interface Element Frequencies in Search Engine Results Pages (SERPs) Across...
This dataset contains the data produced for the dissertation ""User Interface Variations in Search Engine Results Pages Across Types of Search Queries and Search Engines"". The... -
Dataset of synthetic clinical notes in European Portuguese generated using...
This dataset was generated using an open-source large language model and carefully curated prompts, simulating realistic clinical narratives while ensuring no real patient data...

INESC TEC
The Institute for Systems and Computer Engineering, Technology and Science – INESC TEC is an Associate Laboratory...
-
LabadainLog-17k+: Search Logs from Tetun-Speaking Users Across Chat, Web,...
1. Overview LabadainLog-17k+ is a dataset of interaction logs in Tetun, collected from three different platforms: Labadain Chat (16,952 prompts): An LLM-powered conversational... -
Labadain-ZSRunS: Sparse and Zero-Shot Dense Retrieval Runs with...
1. Overview Labadain-ZSRunS is a dataset consisting of run files produced by classical sparse and zero-shot dense retrieval models, resulted from the experiments on Tetun ad-hoc...