*Corresponding Author:
J. X. Hu
Department of Endocrinology, HwaMei Hospital
University of Chinese Academy of Sciences
Ningbo, Zhejiang 315010, China
E-mail: m16531752663_1@163.com
This article was originally published in a special issue, “Biomedical Research in Healthcare Setting”
Indian J Pharm Sci 2020:82(3)Spl issue5;108-114

This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms

Abstract

To study the automatic detection method of hyperthyroidism based on the deep learning of the modified LeNet-5 network and to establish a detection method with higher precision and better performance, a total of 180 facial images of patients with hyperthyroidism in the ultrasound imaging department of the HwaMei hospital were collected as well as 180 facial images of healthy persons were collected to serve as the control group. A method was proposed for the detection of hyperthyroidism based on the modified LeNet-5. Three test groups were designed by randomized controlled experiment design for the detection of hyperthyroidism, in which the control group was tested manually by experienced doctors, the LeNet-5 network learning group adopted the classical LeNet-5 network learning algorithm and the experimental group used the modified LeNet-5 network learning algorithm. Evaluation indices included the accuracy, detection efficiency, sensitivity, specificity and F1 score of the detection. At the same time, differences between the two algorithms in the detection of thyroid nodules were compared. There was no significant difference between the 3 groups in the accuracy and specificity of detection of hyperthyroidism. In terms of the detection efficiency and sensitivity, the performance of the network learning algorithm group and learning group was better than that of the control group. Both the network learning algorithm group and experimental group could detect the thyroid nodules accurately, but there was no significant difference in the accuracy of detecting the type of thyroid nodules. The correct rate of malignant thyroid nodules was significantly higher than that of the benign thyroid nodules. The modified LeNet-5 network algorithm showed acceptable consistency with experienced doctors in the detection of hyperthyroidism, and this method was useful for the exclusion of thyroid malignant tumours and can be used as a simple method to exclude and identify malignant thyroid tumours.

Keywords

Hyperthyroidism, LeNet-5 network model, Face recognition, Automatic detection

The thyroid gland was one of the most important glands in the human body; the thyroid hormones produced by the thyroid gland promote growth and development and regulate metabolism[1]. Hyperthyroidism was a clinical syndrome characterized by the excessive secretion of thyroid hormones caused by a variety of causes, which was mainly manifested by increased excitability and hyper metabolism of nervous, circulatory, digestive systems with a typical disease face[2-5]. Among them, the Grave’s disease (diffuse toxic goiter) was the most common, accounting for about 85 % of all patients with hyperthyroidism[6]. Hyperthyroidism was a common endocrine disease; the epidemiological analysis shows that the incidence was 0.14-0.5 %[7].In clinical practice, hyperthyroidism was generally diagnosed by the patient’s appearance and ultrasonic examination[8-10]. However, the diagnosis depends on the operator’s clinical experience[6-8], which means that less experienced doctors were prone to misdiagnose resulting in an increase in the number of fine needle aspiration (FNA) biopsies. Therefore, it is necessary to develop an automatic detection method of hyperthyroidism to reduce the workload of the doctors, and the dependence on operators, and at the same time to reduce the misdiagnosis rate.

In recent years, the automatic detection methods of hyperthyroidism can be divided into two categories, the artificial feature method and data-driven method. The commonly used artificial feature methods include the local binary pattern (LBP)[11], gray level co-occurrence matrix (GLCM)[12]and discrete wavelet transform (DWT). However, these methods can only generate low-level information, such as image texture, geometry and statistical distribution and usually need classifiers for classification. Only by giving suitable characteristics for classifying, a proper classification can be arrived at. For data-driven methods, a convolution neural network (CNN) can improve the classification and detection performance of natural images greatly without manual feature description[13-15]. In recent years, deep learning has developed rapidly, especially the LeNet-5 network, which was widely used in a variety of computer vision tasks that greatly promoted development of CNN. However, when the LeNet-5 network was used to classify the thyroid and other multi-class targets, the efficiency was not high[16]. In order to solve these problems, a new modified LeNet-5 network model was proposed in this investigation based on the analysis of LeNet-5 and other classical CNN. The modified LeNet-5 network introduces the convolution kernel module group, changes the number and size of the convolution kernel appropriately, introduces the batch normalization (BN) layer to standardize the input data, replaces the Sigmoid function with Relu activation function and the full connection layer with global pooling layer. And after experimental verification, the expected outcome was achieved.

Materials and Methods

The modified LeNet-5 network

In view of the problems with the traditional LeNet-5 network in identifying thyroid nodules, the following improvements were made. Convolution module groups Inception_conv3 and Inception_conv5 replaced the Conv3 and Conv5 of LeNet-5 network, respectively. While extracting more features of the target, increase the depth of the network. The two convolution module groups contain 2-3 convolution layers, respectively. Selection of the convolution kernel was related to whether the effective features can be extracted. The original 5×5 convolution kernel was not effective in the facial feature extraction of hyperthyroidism. Thus the size of the convolution kernel was changed and replaced the original convolution kernel with 2 3×3 convolution kernels in the Inception_conv3 and Inception_conv5 modules. The traditional LeNet-5 network has a small number of convolution kernels per layer, which cannot extract the features of the target fully from the face data. Therefore, the number of convolution kernels should be changed reasonably. The introduction of the BN layer can standardize the input batch samples and improve the input of the neural network, which can improve the training speed of the neural networks to a certain extent and prevent the over-fitting and gradient disappearance. The Sigmod function adopted by the traditional LeNet-5 was not universal and the Relu function has the characteristic of preventing passing of the negative signal, which was more efficient and can improve the convergence speed. The parameters of the full connection layer account for about 80 % of the network, which greatly increases the network training time. Experiments showed that the full connection layer has no effect on the classification, so the global pooling layer Max_pool8 was used to replace the full connection layer.

The detection of thyroid nodule by the modified LeNet-5 network

The ultrasonic image of the thyroid nodule was the input layer with a size of 64×64. The first layer was Conv1. The size of convolution kernel was 5×5 and the number of convolution kernel was 32 and 32 feature maps with a size of 60×60 can be obtained by Conv1. In the second layer, the pool size of Max_pool2 was 2×2 and the step size was 2. And 32 feature maps with a size of 30×30 can be obtained by Max_pool2. The third layer was Inception_conv3, which has 4 branches, Branch_0, Branch_1, Branch_2 and Branch_3. Branch_0 uses 24 1×1 convolution kernels, Branch_1 uses two cascaded convolution kernels, which were 16 1×1 convolution kernels and 24 5×5 convolution kernels, respectively. Branch_2 replaces the 5×5 convolution kernels in Branch_1 with two 3×3 convolution kernels to reduce the number of parameters and extract more abundant features of the target. That is, Branch_2 consisted of 32 1×1, 64 3×3, and 64 3×3 three-stage convolution kernel cascades and Branch_3 consisted of a 3×3 mean pool and 16 1×1 convolution cascades and finally, 128 feature maps with a size of 30×30 can be obtained by Inception_conv3.

The fourth layer was a Max_pool4, with a maximum pool size of 2×2 and a step size of 2. And 128 feature maps with the size of 15×15 can be obtained by Max_ pool4. The fifth layer was Inception_conv5, which has the same structure as Inception_conv3. It also has 4 branches, namely Branch_0, Branch_1, Branch_2 and Branch_3. The difference was that the number of convolution kernels changed to 64, 64, 96, and 32, respectively and 256 feature maps with the size of 15×15 can be obtained by Inception_conv5. The sixth layer was Max_pool6, the pool size is 3×3 and the step size was 3. And 256 feature maps with the size of 5×5 can be obtained by Max_pool6. In the seventh layer, the size of Conv7 was 3×3, and the number of convolution kernel was 62. And 62 feature maps with the size of 3×3 can be obtained by Conv7.

The eighth layer was the global pooling layer, and the pooling size was 3×3 and 62 feature maps with the size of 3×3 were pooled to 1×1 and the feature vectors were input to the Softmax layer. The convolution kernel size of Conv8 was 3×3 and the number of convolution kernels was 62. An attempt was made to use the Conv8 in the dotted frame to replace the full connection layer of the traditional LeNet-5 network for network improvement. However, the experimental results showed that the effect of Conv8 was not as good as that of Max_pool8. The modified LeNet-5 network parameters were shown in Table 1. The surface of the modified network layer was 8 layers, but it was actually 12 layers (excluding input layer and Softmax layer) due to the Inception_conv3 and Inception_conv5 were all three layers, which increases the depth of the network.

Iterati0076e times Training results (%) p value
LN group RLN group
137 98.334 98.622 0.886
152 98.338 98.513 0.913
201 98.546 98.536 0.996
206 98.664 98.714 0.893
214 98.546 98.742 0.875

Table 1: Training Results of Different Iterations of Two Algorithms

Facial images of patients with hyperthyroidism were received from the ultrasound imaging department of HwaMei hospital. A total of 180 facial images of patients and 180 images of healthy individuals were collected and a data database of these patients was established.

Experimental design

In this investigation, all the image data collected were tested 3 times by experienced clinicians (control group), programs based on LeNet-5 network algorithm (LN group) and programs based on the modified LeNet-5 network algorithm (RLN group), respectively. The 3 tests were carried out randomly and doubleblindly (fig. 1).According to the elements of computer recognition, a standard database of disease-related data was established, including the medical history and facial images of patients. The types of diseases including but not limited to the hyperthyroidism, goiter, hypothyroidism, thyroid tumours, and other non-thyroid-related diseases. The system was training and deep learning, including the batch sample training to the public data resources and the single training to the database data and analyses of the results. On this basis, the empirical parameters set for facial feature recognition of patients with hyperthyroidism were added and the parameters were adjusted to improve the accuracy and sensitivity of the recognition.

IJPS-work-flow

Figure 1: The work flow of the thyroid nodule detection by the modified LeNet-5

Evaluation indicators and methods

In the program learning, according to the real categories and the categories predicted by the classification model, the records in the data set could be divided into 4 categories, true positive (TP), false positive (FP), true negative (TN) and false negative (FN). TP means the correct recognition, the positive sample was correctly identified as positive. FP means the wrong recognition, the negative sample was erroneously identified as positive. TN means correct recognition again, the negative sample being correctly identified as negative. FN means wrong recognition again the positive sample being mistakenly identified as a negative.Accuracy was used to measure the proportion of all correct judgments in the classification model, which was defined as (AC), AC=(TP+TN)/(TP+TN+FP+FN). Sensitivity was used to measure the correct proportion of model prediction, the true value of all results was positive, it was defined as (PC), PC=TP/(TP+FP). Specificity was used to measure the proportion of correct prediction in all results, the true value was negative, it was defined as (ST), ST=TN/ (TN+FP). F1-Score was a comprehensive index, which was the harmonic average of accuracy and sensitivity. It was defined as (FS), FS=2TP/(2TP+FP+FN). The range of accuracy, precision, sensitivity, and specificity was expressed by a decimal between 0 and 1. The higher the value, the better the classification performance of the model.A recognition control experiment was also conducted on hyperthyroidism patients with thyroid nodules. Two algorithm programs were used for nodule segmentation and nodule diagnosis. Each nodule was segmented once. Nodular segmentation was evaluated visually and was divided into 3 categories. Excellent, the segmented part completely matches the nodule. Satisfactory, but not perfect, the segmented part still represented the nodule. The visual estimate of the maximum contour mismatch between the cover layer and the nodule is no more than 30 % and Poor, some nodules are segmented, but the segmented contours do not represent nodules (estimated mismatch >30 %).

The experiment was based on the Tensorflow deep learning platform under Linux system. The processor was Intel core i7mur7700k, the main frequency was 4.2 GHz, the memory was 16 GB, the video card was NVdia GTX1080TI, and the memory of GPU was 11 GB. In order to give full play to the performance of GPU, the size of Mini-batch was set to 2n (256), the number of Train_num was set to 1000, and Learn_rate was set to 0.05. The learning rate was attenuated by the exponential decay method, and the Deacy was set to 0.96, weight attenuation of BN was set to 0.005.

Statistical analysis

The data were processed by SPSS 18.0, the measurement data were expressed as mean±SD. Independent sample t-test was used between groups, paired sample t-test was used within groups, and the counting data were expressed as a percentage and the χ2 test was used. p<0.05 means the difference was statistically significant.

Results and Discussion

When the LeNet-5 network loads data training, the data set was randomly divided into 80 % training set and 20 % verification set and the network weight was randomly initialized, which will cause small fluctuations in the training results. Therefore, when training the network, different iterative times were trained, and the average value was taken for three times in each iteration. The training results of different iterative times were shown in Table 1. The average training results of the two research groups were about 98.5 %. Although the training results of the two groups were different, there was no statistical difference (p>0.05), indicating that the convergence of sample training tends to be stable and will not fluctuate greatly.

After the training of the 2 algorithm programs, the 3 research groups independently used the data collected for face image recognition. The results showed that although the accuracy of the LN group and the RLN group was lower than that of the control group, there was no statistical difference among the 3 groups (fig. 2). The time spent in the LN group and the RLN group was significantly lower than that in the control group, and the time consumed in the RLN group was the least. It indicated that the screening efficiency of patients with hyperthyroidism can be improved based on the LeNet-5 network algorithm.

IJPS-Accuracy

Figure 2: Accuracy and time required for detection in the three study groups

As shown in fig. 3, compared to the control group, the sensitivity of the LN group and the RLN group was significantly higher than that of the control group (p<0.05), but there was no significant difference between the LN group and the RLN group. In terms of F1-Score, there was no significant difference among the 3 groups (p<0.05), indicating that LeNet-5 network algorithm could obtain the disease features of patients with hyperthyroidism effectively, especially our modified LeNet-5 network algorithm.

IJPS-Differences

Figure 3: Differences in sensitivity and F1-Score among the three research groups

The test results were shown in fig. 4, there was no significant difference between the 3 groups (p>0.05), indicating that the LeNet-5 network algorithm has good accuracy in the detection of hyperthyroidism. As shown in fig. 5, the success rate of the RLN group was 5 % higher than that of the LN group and the success rate of the 2 algorithms for thyroid nodules was 80 %. There was no significant difference between the same types of thyroid nodules (p>0.05). The success rate of the segmentation of malignant thyroid nodules was significantly higher than that of benign thyroid nodules (p<0.05). Excellent, satisfactory and poor segments of benign thyroid nodules were 15.3 (n=9), 66.1 (n=39) and 18.6 % (n=11), respectively. The excellent, satisfactory and poor segments of malignant thyroid nodules were 27.9 (n=12), 67.4 (n=29) and 4.7 % (n=2), respectively (p=0.04).

IJPS-three-research

Figure 4: Differences in the specificity of the three research groups

IJPS-thyroid-nodules

Figure 5: Detection of thyroid nodules by two algorithms

In this study, the modified LeNet-5 network algorithm for automatic recognition and diagnosis of hyperthyroidism was evaluated. The accuracy and specificity of the modified LeNet-5 network algorithm for the detection of hyperthyroidism were consistent as noted with experienced doctors and the efficiency and sensitivity were higher than that of the experienced doctors. At the same time, the success rate of the modified LeNet-5 network algorithm in the detection of malignant thyroid nodules was significantly better than that of benign thyroid nodules.

In recent years, the face recognition technology has developed rapidly and achieved many research results, which has become one of the important research directions in the field of computer vision[17]. However, because of its complexity, it was easily affected by many factors, such as light, expression, posture. Therefore, the designed algorithm needs to weaken the intra-class changes between the same individuals and strengthen the inter-class changes between different individuals, such as PCA[18], LDA[19], Bayesian Face[20] and other face recognition algorithms based on subspace learning. The research of face recognition algorithm based on CNN has made great progress in the past decade. After a lot of data training, these models changed from DeepID, DeepID2, LeNet-5, Face-Net to VGG-Face, these can recognize the features of human face accurately, thus achieving better results in face recognition verification. The LeNet-5 network deep learning method has high accuracy and sensitivity, which accelerates the popularization and application of neural network learning. However, the classical LeNet-5 network algorithm involves many manually modified parameters and complex data algorithm structure, so its efficiency was not high, and the parameter setting depends on experience. Therefore, this investigation made an improvement and introduced the convolution group, which was the core module of the network learning depth algorithm. Firstly, the input facial data of patients were standardized, and then the convolution kernel was modified according to the facial characteristics of patients with hyperthyroidism. Finally the global optimization is carried out by the Relu function to improve the efficiency of solution and recognition in the case of global optimization.

In order to evaluate the efficiency and success rate of LeNet-5 network algorithm in detecting hyperthyroidism, experienced doctors were introduced to diagnose hyperthyroidism. Results obtained showed that there was no significant difference in the detection rate between LeNet-5 network algorithm and the experienced doctors in terms of accuracy or specificity in the detection of hyperthyroidism, indicating that the LeNet-5 network algorithm was a reliable and faulttolerant method for the detection of hyperthyroidism. At the same time, the detection efficiency of LeNet-5 network algorithm was about 3 times higher than that of the control group, and its sensitivity to hyperthyroidism was also significantly higher than that of the control group, which showed that LeNet-5 network algorithm could detect hyperthyroidism efficiently and quickly. In order to further verify the superiority and reliability of this modified LeNet-5 network algorithm, thyroid nodules were detected. The detection results showed that the success rate of the improved LeNet-5 network algorithm was higher than that of LeNet-5 network algorithm. At the same time, it was found that the success rate of LeNet-5 network algorithm for the segmentation of malignant thyroid nodules was significantly higher than that of benign thyroid nodules, which indicated that the LeNet-5 network algorithm has the potential to be used to detect malignant thyroid nodules.

In summary, the use of our improved LeNet-5 network algorithm shows acceptable consistency with the experienced doctors in detecting the characterization of hyperthyroidism, and this method was useful for the exclusion of thyroid nausea tumours, and can be used as a simple method to exclude and determine the biopsy requirements of malignant thyroid tumours.

Acknowledgement

This study was supported by Zhejiang Medical Science and Technology Research Foundation, China (No. 2018KY697).

Conflict of Interest

All authors report no conflicts of interest in this work.

References