We curated and release a real-world medical clinical dataset, namely MedCD, in the context of building generative artificial intelligence (AI) applications in the clinical setting. The MedCD dataset is one of the accomplishments from our longitudinal applied AI research and deployment in a tertiary care hospital in China. First, the dataset is real and comprehensive, in that it was sourced from real-world electronic health records (EHRs), clinical notes, lab examination reports and more.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Ye Chen, "MedCD: A Medical Clinical Dataset", IEEE Dataport, 2025. [Online]. Available: http://dx.doi.org/10.21227/kh7n-8n28. Accessed: Apr. 23, 2025.
@data{kh7n-8n28-25,
doi = {10.21227/kh7n-8n28},
url = {http://dx.doi.org/10.21227/kh7n-8n28},
author = {Ye Chen },
publisher = {IEEE Dataport},
title = {MedCD: A Medical Clinical Dataset},
year = {2025} }
TY - DATA
T1 - MedCD: A Medical Clinical Dataset
AU - Ye Chen
PY - 2025
PB - IEEE Dataport
UR - 10.21227/kh7n-8n28
ER -
Ye Chen. (2025). MedCD: A Medical Clinical Dataset. IEEE Dataport. http://dx.doi.org/10.21227/kh7n-8n28
Ye Chen, 2025. MedCD: A Medical Clinical Dataset. Available at: http://dx.doi.org/10.21227/kh7n-8n28.
Ye Chen. (2025). "MedCD: A Medical Clinical Dataset." Web.
1. Ye Chen. MedCD: A Medical Clinical Dataset [Internet]. IEEE Dataport; 2025. Available from : http://dx.doi.org/10.21227/kh7n-8n28
Ye Chen. "MedCD: A Medical Clinical Dataset." doi: 10.21227/kh7n-8n28