Tongue inspection with a long history is the most intuitive, simple and effective diagnostic methods in traditional Chinese medicine (TCM) [1, 2]. However, traditional tongue diagnosis is affected by factors such as the external environment and doctors’ subjective clinical experience. Digital tongue images can objectively reflect tongue characteristics. Computerized tongue diagnosis systems are gradually being accepted by more and more clinicians as a medical application for health assessment and diagnosis of diseases, such as type 2 diabetes mellitus [3–6], breast cancer [7], colorectal cancer [8], appendicitis [9], gastritis [10], etc.
With the popularization of clinical application of digital tongue picture, massive tongue images data will be produced. The quality of tongue image cannot be ignored, and it is closely related to the accuracy of tongue diagnosis and clinical treatment [1].
The quality of tongue image is an important prerequisite for clinical application of tongue diagnosis, see Fig. 1.
A normal quality tongue image should have the following characteristics: the tongue surface is centered, no fog, no light leakage, no overexposure or underexposure, no focus unevenness, and complete tongue surface, see Fig. 2.We found that in the process of using tongue diagnosis equipment, despite having received standardized tongue image acquisition training, abnormal tongue images are still common in the clinical tongue image acquisition process, mainly from two aspects: from the operators and participants.
On the one hand, due to the operator, there is a standard setting for the abnormal parameter settings of the tongue diagnosis instrument, such as the size of the aperture and the shutter speed. If the tongue is dark and cannot truly reflect the color of the tongue, there will be underexposure (Fig. 2-B-a); If the parameter settings of the tongue diagnosis instrument are changed, if the image of the tongue is too bright, this is an overexposure (Fig. 2-B-b); or the operator may not operate correctly, this may produce tongue images with blurry focus (Fig. 2-B-c), or light leakage (Fig. 2-B-d), etc.
On the other hand, from the participants. The breathe and fog when collecting tongue image (Fig. 2-B-e), does not have good tongue extension training, and has abnormal tongue extension posture (Fig. 2-B-g, Fig. 2-B-h), eat food before collecting tongue image, especially food with pigment, appear stained tongue coating, see (Fig. 2-B-a) and other tongue images of foreign objects on the tongue (Fig. 2-B-f).
The images of these situations will affect the process and results of data analysis, and will severely interfere with the results of subsequent tongue image analysis. These tongue images bring interference bias to the later analysis and processing of tongue image data, which will cause misclassification of pattern differentiation of syndrome in TCM and erroneous clinical decision on prevention, diagnosis and treatment of disease.
The evaluation of tongue image quality has also attracted more and more attention from researchers. Image Quality Assessment (IQA) mainly evaluates the quality of images. Both manual and automatic methods can be used to evaluate images quality. At present, the main approach to removing pictures in these situations is manual. The manual method is based on TCM diagnosis and clinical experts' perception assessment of the quality of tongue image, including the sharpness of the tongue image, tongue extension posture, noise, etc. This method is costly, labor-intensive, error-prone and inefficient, and cannot be automated in real time. Therefore, an efficient and accurate quality control model of tongue image is essential to clinical use of tongue diagnosis instruments.
It is easy for human viewers to perceive a significant difference in the quality of a set of tongue images, while it is not so easy for a computer to automatically recognize tongue images of abnormal quality. In recent years, with the improvement of deep neural network models and the development of deep learning algorithms, the classification accuracy and efficiency of image technology based on CNNs have been greatly improved. It has been widely used in image segmentation, image classification, face recognition, etc., and has become the current Mainstream algorithm [11, 12]. Convolutional Neural Networks (CNNs), a representative of deep learning methods, has gradually become a research hotspot in the field of tongue diagnosis objectification.
In this research, we focus on the model construction method of automatically rejecting unqualified tongue images based on the deep CNN model to evaluate the quality of tongue images.