Automated Image Label Extraction from Radiology Reports

This data set contains the raw data from Figures 3 and 4 of the manuscript "Automated Image Label Extraction from Radiology Reports - A Review"

The data used to create Figure 3 (co.autorships) provides Co-authorship links among authors of the studies included in the synthesis. The "nodes" tab contains information on each author, such as their modularity class and their degree (the number of colaborations with other authors). The "edges" tab contains information on the links between authors, such as the weight (the number of colaborations) of each link between two authors. A modularity algorithm (https://dx.doi.org/10.1088/1742-5468/2008/10/P10008) was used for community detection. This data was used to create Figure 3 of the manuscript.

The data used to create Figure 4 of the manuscript (wordcloud_counts) provides the number of occurences of named entities across the abstracts of the included studies. The entities were extracted using the spaCy model en_core_sci_lg, which is appropriate for processing biomedical, scientific and clinical text (https://spacy.io/). This data was used to create Figure 4 of the manuscript.

Data and Resources

Additional Info

Field Value
Author Sofia Cardoso Pereira, Ana Maria Mendonça, Aurélio Campilho, Pedro Sousa, Carla Teixeira Lopes
Last Updated April 29, 2024, 13:23 (UTC)
Created December 4, 2023, 14:49 (UTC)
Citation C. Pereira, S., Mendonça, A. M., Campilho, A., Sousa, P., & Teixeira Lopes, C. (2023). Automated Image Label Extraction from Radiology Reports [Data set]. INESC TEC. https://doi.org/10.25747/XHBN-B855
Creation Date 2023-11-30
DOI https://doi.org/10.25747/xhbn-b855
File Size 16KB/53/KB
Format excel
Language EN
Temporal Coverage 2013-2023
Type Raw data