Safetensors
custom_code
jonggwon-park commited on
Commit
1e6e7d6
·
1 Parent(s): ee7cec4

update dataset sources

Browse files
Files changed (1) hide show
  1. data/README.md +34 -13
data/README.md CHANGED
@@ -1,18 +1,39 @@
1
- # Dataset Preprocessing Sources
2
 
3
- ## MIMIC-CXR
4
- - Source: [MIMIC-CXR-JPG](https://physionet.org/content/mimic-cxr-jpg/2.1.0/)
5
 
6
- ## ChestXDet10, ChestXray14, CheXpert, OpenI, PadChest
7
- - Preprocessed files from [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) with minor path modifications in CSV files
8
 
9
- ## SIIM
10
- - Preprocessing from [MGCA (NeurIPS 2022)](https://github.com/HKU-MedAI/MGCA/blob/main/mgca/preprocess/siim.py)
 
11
 
12
- ## RSNA
13
- - Preprocessed file from [MedKLIP (ICCV 2023)](https://github.com/MediaBrain-SJTU/MedKLIP/blob/main/Sample_Zero-Shot_Grounding_RSNA/data_sample/test.csv)
14
- - Only file paths in CSV files were modified
15
 
16
- ## MS-CXR
17
- - Dataset: [MS-CXR](https://physionet.org/content/ms-cxr/0.1/)
18
- - Test split: [MedRPG (MICCAI 2023)](https://github.com/eraserNut/MedRPG/tree/master/data/MS_CXR)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Dataset Sources
2
 
3
+ This document describes the datasets and preprocessing sources used in this project.
 
4
 
5
+ ### MIMIC-CXR
6
+ - **Image & Reports**: [MIMIC-CXR-JPG v2.1.0](https://physionet.org/content/mimic-cxr-jpg/2.1.0/)
7
 
8
+ ### OpenI
9
+ - **Images**: [Chest X-rays Indiana University (Kaggle)](https://www.kaggle.com/datasets/raddar/chest-xrays-indiana-university)
10
+ - **Preprocessing**: [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) — minor path modifications in CSV files
11
 
12
+ ### PadChest
13
+ - **Images**: [BIMCV PadChest](https://bimcv.cipf.es/bimcv-projects/padchest/)
14
+ - **Preprocessing**: [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) — minor path modifications in CSV files
15
 
16
+ ### ChestXray14
17
+ - **Images**: [NIH ChestXray](https://nihcc.app.box.com/v/ChestXray-NIHCC/folder/37178474737)
18
+ - **Preprocessing**: [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) — minor path modifications in CSV files
19
+
20
+ ### CheXpert
21
+ - **Images**: [Stanford CheXpert](https://stanfordmlgroup.github.io/competitions/chexpert/)
22
+ - **Preprocessing**: [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) — minor path modifications in CSV files
23
+
24
+ ### ChestXDet10
25
+ - **Images**: [Deepwise AILab ChestX-Det10](https://github.com/Deepwise-AILab/ChestX-Det10-Dataset)
26
+ - **Preprocessing**: [CARZero (CVPR 2024)](https://github.com/laihaoran/CARZero) — minor path modifications in CSV files
27
+
28
+ ### SIIM
29
+ - **Images**: [SIIM-ACR Pneumothorax Segmentation (Kaggle)](https://www.kaggle.com/datasets/jesperdramsch/siim-acr-pneumothorax-segmentation-data)
30
+ - **Preprocessing**: [MGCA (NeurIPS 2022)](https://github.com/HKU-MedAI/MGCA/blob/main/mgca/preprocess/siim.py)
31
+
32
+ ### RSNA
33
+ - **Images**: [RSNA Pneumonia Detection Challenge 2018](https://www.rsna.org/artificial-intelligence/ai-image-challenge/rsna-pneumonia-detection-challenge-2018)
34
+ - **Preprocessing**: [MedKLIP (ICCV 2023)](https://github.com/MediaBrain-SJTU/MedKLIP/blob/main/Sample_Zero-Shot_Grounding_RSNA/data_sample/test.csv) — only file paths in CSV files were modified
35
+
36
+ ### MS-CXR
37
+ - **Images**: [MIMIC-CXR-JPG v2.1.0](https://physionet.org/content/mimic-cxr-jpg/2.1.0/)
38
+ - **Dataset**: [MS-CXR v0.1](https://physionet.org/content/ms-cxr/0.1/)
39
+ - **Test Split**: [MedRPG (MICCAI 2023)](https://github.com/eraserNut/MedRPG/tree/master/data/MS_CXR)