Automated Image Label Extraction from Radiology Reports

This data set contains the raw data from Figures 3 and 4 of the manuscript "Automated Image Label Extraction from Radiology Reports - A Review"

The data used to create Figure 3 (co.autorships) provides Co-authorship links among authors of the studies included in the synthesis. The "nodes" tab contains information on each author, such as their modularity class and their degree (the number of colaborations with other authors). The "edges" tab contains information on the links between authors, such as the weight (the number of colaborations) of each link between two authors. A modularity algorithm (https://dx.doi.org/10.1088/1742-5468/2008/10/P10008) was used for community detection. This data was used to create Figure 3 of the manuscript.

The data used to create Figure 4 of the manuscript (wordcloud_counts) provides the number of occurences of named entities across the abstracts of the included studies. The entities were extracted using the spaCy model en_core_sci_lg, which is appropriate for processing biomedical, scientific and clinical text (https://spacy.io/). This data was used to create Figure 4 of the manuscript.

Dados e Recursos

Informação Adicional

Campo Valor
Autor Sofia Cardoso Pereira, Ana Maria Mendonça, Aurélio Campilho, Pedro Sousa, Carla Teixeira Lopes
Última Atualização abril 29, 2024, 13:23 (UTC)
Data de criação dezembro 4, 2023, 14:49 (UTC)
Citation C. Pereira, S., Mendonça, A. M., Campilho, A., Sousa, P., & Teixeira Lopes, C. (2023). Automated Image Label Extraction from Radiology Reports [Data set]. INESC TEC. https://doi.org/10.25747/XHBN-B855
Creation Date 2023-11-30
DOI https://doi.org/10.25747/xhbn-b855
Tamanho do Ficheiro 16KB/53/KB
Formato excel
Language EN
Temporal Coverage 2013-2023
Tipo Raw data