Text2Story Lusa Annotated Corpus

The Text2Story Lusa Annotated Corpus dataset contains multi-layered manual annotations for 117 articles from the Text2Story Lusa dataset. This resource includes both the text files (.txt) and the annotations in BRAT format (.ann).

This dataset was initially developed in the context of the project "Text2Story: Extracting journalistic narratives from text and representing them in a narrative modeling language" / NORTE-01-0145-FEDER-03185.

To request access to this dataset please fill out this form (Text2Story Lusa Annotated Corpus - Request Form) and send it to: joao.a.castro@inesctec.pt

The Text2Story Annotation Manual is also available with the resource.

If you use this resource, please use the following citations (paper and dataset):

Nunes, S., Jorge, A., Amorim, A., Sousa, H., Leal, A., Silvano, P., Cantante, I.& Campos, R. (2024). Text2Story Lusa: A Dataset for Narrative Analysis in European Portuguese News Articles. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).

Silvano, P., Jorge, A., Leal, A., Amorim, E., Sousa, H., Cantante, I., Campos, R., & Nunes, S. (2023). Text2Story Lusa Annotated Corpus [Data set]. INESC TEC. https://doi.org/10.25747/ESFS-1P16

Data and Resources

Additional Info

Field Value
Author Purificação Silvano, Alípio Jorge, António Leal, Evelin Amorim, Hugo Sousa, Inês Cantante, Ricardo Campos, Sérgio Nunes
Last Updated May 20, 2024, 09:13 (UTC)
Created June 15, 2023, 11:29 (UTC)
Citation Silvano, P., Jorge, A., Leal, A., Amorim, E., Sousa, H., Cantante, I., Campos, R., & Nunes, S. (2023). Text2Story Lusa Annotated Corpus [Data set]. INESC TEC. https://doi.org/10.25747/ESFS-1P16
Contributor CLUP - Centre of Linguistics of the University of Porto
Creation Date 2023-06-15
DOI https://doi.org/10.25747/esfs-1p16
Format BRAT
Language PT
Relation Nunes, S., Jorge, A., Leal, A., Amorim, E., Sousa, H., Cantante, I., Silvano, P., & Campos, R. (2023). Text2Story Lusa [Data set]. INESC TEC. https://doi.org/10.25747/ET95-BX90
Temporal Coverage 2023
Type Annotations in BRAT standoff format