BIMCV COVID-19+: a large annotated dataset of RX and CT images from COVID-19 patients with Extension Part III
- Citation Author(s):
- Submitted by:
- Jose Saborit-Torres
- Last updated:
- Mon, 03/06/2023 - 09:10
- Data Format:
BIMCV-COVID19+ dataset is a large dataset with chest X-ray images CXR (CR, DX) and computed tomography (CT) imaging of COVID-19 patients along with their radiographic findings, pathologies, polymerase chain reaction (PCR), immunoglobulin G (IgG) and immunoglobulin M (IgM) diagnostic antibody tests and radiographic reports from Medical Imaging Databank in Valencian Region Medical Image Bank (BIMCV). The findings are mapped onto standard Unified Medical Language System (UMLS) terminology and they cover a wide spectrum of thoracic entities, contrasting with the much more reduced number of entities annotated in previous datasets. Images are stored in high resolution and entities are localized with anatomical labels in a Medical Imaging Data Structure (MIDS) format. In addition, 5 images were annotated by a team of expert radiologists to include semantic segmentation of radiographic findings. Moreover, extensive information is provided,including the patient’s demographic information, type of projection and acquisition parameters for the imaging study, among others. These iterations of the database include 21342 CR, 34829 DX and 7918 CT studies.
Project/Equipment funded by Consellería de Sanitat Universal i Salut Pública (Generalitat Valenciana, Spain) and the EU Operational Program of the European Regional Development Fund (ERDF) for the Valencian Community 2014-2020, within the framework of the REACT-EU programme, as the Union's response to the COVID-19 pandemic
Once all the compressed files have been downloaded, use 00_extract_data.sh for their correct decompression. For more information, you could see the links on this page.
Currently, we have difficulties to upload all the files, we will put in this section a note when all the dataset will be completed.