Clinical applicability of COVID-19 detection in chest radiography with deep learning - radiologist annotations

Chest radiography (CXR) can be used as a complementary method for diagnosing/following COVID-19 patients. However, experience level and workload of technicians and radiologists may affect the decision process. In order to evaluate the performance of radiologists in the detection of COVID-19 radiological features in CXR, manual annotation of a subset of random CXR images was performed by two radiologists using an in-house software. The software presented CXRs from a randomly selected subset and allowed for window center/width adjustment, zooming and panning. Radiologists were asked to label CXRs into one of 4 classes: Normal, Not indicative of COVID-19 (pathological), Indicative of COVID-19 and Undetermined. The Indicative of COVID-19 class was defined as CXRs where the patient presented findings indicative of COVID-19, namely bilateral pulmonary opacities of low/medium density. The Undetermined class was defined as CXRs where the patient presented findings that could be indicative of COVID-19 but which could also be indicative of another condition, namely unilateral lung opacities, diffuse bilateral opacities of ARDS pattern or diffuse reticular opacities. The Not indicative of COVID-19 (pathological) class was defined as CXRs where the patient presented findings indicative of any other pathology except for COVID-19. CXRs where the patient presented medical devices were classified as Normal if the underlying pathology was not visible. Additionally, CXRs without sufficient quality for visual assessment by the radiologists due to bad image quality, patient positioning or any other factors could be labelled as Compromised for exclusion. A total of 1,845 CXRs were selected for annotation by two radiologists. Of these, 1,256 belong to the Mixed dataset(a combination of multiple public sources), 289 belong to BIMCV and 300 belong to COVIDGR. Selection of CXRs for annotation was performed randomly: for the Mixed dataset, a balanced selection strategy was used during image selection, whereas for BIMCV and COVIDGR, the dataset class distribution was maintained in the subset selected for annotation. Manual labelling of CXRs was performed in two stages. First, both radiologists independently classified each CXR. CXRs where the two radiologists disagreed were then selected for the second stage where the two radiologists assessed the CXRs together to achieve consensus. At no point were radiologists given access to the ground truth label, RT-PCR results or any other information besides the CXR image. To ensure that written information present in the CXR image (such as hospital system, health service, laterality markers, patient positioning, etc.) did not bias the annotation, all written labels were blacked out during before annotation. This dataset contains the information regarding each annotated CXR and the resulting radiologist labels for each image before and after consensus.

The public datasets used in this study are available in the following repositories:

CheXpert: https://stanfordmlgroup.github.io/competitions/chexpert/

ChestXRay-8: https://nihcc.app.box.com/v/ChestXray-NIHCC

COVID-19 IDC: https://github.com/ieee8023/covid-chestxray-dataset

COVIDx: https://github.com/lindawangg/COVID-Net

RSNA-PDC: https://www.kaggle.com/c/rsna-pneumonia-detection-challenge

SAVE LIVES: https://www.hmhospitales.com/coronavirus/covid-data-save-lives

SERAM: https://seram.es/images/site/TUTORIAL_CSI_RX_TORAX_COVID-19_vs_4.0.pdf

Twitter: https://twitter.com/ChestImaging

BIMCV PADCHEST: https://bimcv.cipf.es/bimcv-projects/padchest/

BIMCV COVID-19-PADCHEST: https://bimcv.cipf.es/bimcv-projects/bimcv-covid19/

COVIDGR: https://dasci.es/transferencia/open-data/covidgr-2/

Data and Resources

Metadata AllCSV
Comma separated value file with the information of which CXRs were annotated...
Explore
- More information
- Download
Metadata BIMCVCSV
Same as Metadata All, but only for BIMCV CXRS
Explore
- More information
- Download
Metadata COVIDGRCSV
Same as Metadata All but only for COVIDGR
Explore
- More information
- Download
Metadata MixedCSV
Same as Metadata All but only for Mixed CXRs.
Explore
- More information
- Download

Additional Info

Field	Value
Author	João Pedrosa, Guilherme Aresta, Carlos Ferreira, Catarina Carvalho, Joana Silva, Pedro Sousa, Lucas Ribeiro, Ana Maria Mendonça and Aurélio Campilho
Last Updated	April 13, 2022, 15:36 (UTC)
Created	April 13, 2022, 15:18 (UTC)
Citation	Pedrosa, J., Aresta, G., Ferreira, C., Carvalho, C., Silva, J., Sousa, P., Ribeiro, L., Mendonça, A. M., & Campilho, A. (2022). Clinical applicability of COVID-19 detection in chest radiography with deep learning - radiologist annotations [Data set]. INESC TEC. https://doi.org/10.25747/342B-GF87
DOI	10.25747/342b-gf87
Relation	“Clinical applicability of COVID-19 detection in chest radiography with deep learning” published on Scientific Reports 2022.
dc.Format	CSV files
dc.Language	EN
dc.Temporal Coverage	May - August, 2020