Dataset for Tongue Image Analysis in Traditional Thai Medicine: A Deep Learning Approach

Citation Author(s):
Kasikrit
Damkliang
Prince of Songkla University
Submitted by:
Kasikrit Damkliang
Last updated:
Fri, 01/03/2025 - 09:55
DOI:
10.21227/56cx-0f96
Data Format:
License:
5
1 rating - Please login to submit your rating.

Abstract 

Traditional Thai medicine (TTM) is an increasingly popular treatment option. Tongue diagnosis is a highly efficient method for determining overall health, practiced by TTM practitioners. However, the diagnosis naturally varies depending on the practitioner's expertise. In this work, we propose tongue image analysis using raw pixels and artificial intelligence (AI) to support TTM diagnoses. The target classification of Tri-Dhat consists of three classes: Vata, Pitta, and Kapha. We utilize our own organized genuine datasets collected from our university's TTM hospital. Class balance and data augmentation were conducted, and we present analysis approaches and experimental designs. Transfer learning techniques for various pretrained models of Deep Learning were exploited. We used two-tailed paired t-tests and single-factor ANOVA analyzes for performance comparisons. Our work demonstrates that DenseNet121 and Xception models provided the most significant results with cropped image datasets, including DSLR-taken and mobile-taken images. Notably, model ensemble evaluations yielded the highest average predictions, achieving a precision of 0.94, an F1 score of 0.96, accuracy of 0.96, sensitivity of 0.96, and specificity of 0.97, supported by a p-value of 0.0003 from ANOVA analysis. We suggest that our methods could be effectively deployed in real-world scenarios to aid TTM practitioners in their diagnoses.

Instructions: 

Cropped imags and tongue ROI imags with respective ground truth masks

Comments

Thank you very much for contributing this valuable dataset to the community. 

Submitted by Tom Ou on Wed, 05/29/2024 - 02:47