Automated Glaucoma Detection Using Deep Convolutional Neural Networks

doi:10.21203/rs.3.rs-2788554/v1

Download PDF

Research Article

Automated Glaucoma Detection Using Deep Convolutional Neural Networks

https://doi.org/10.21203/rs.3.rs-2788554/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Glaucoma is a degenerative eye disease that affects the optic nerve. If untreated, it can lead to irreversible vision loss and blindness. Early detection and treatment of glaucoma are essential to prevent and control irreversible vision loss. In this paper, we have proposed a deep learning-based method for the automated detection of glaucoma from fundus images. We have designed and implemented two convolutional neural network models, namely modified VGG16 and modified ResNet-50, for automatic feature extraction and classification. On the ACRIMA dataset, the proposed modified VGG16 achieved 94% accuracy, 80.95% specificity and 97.47% sensitivity. In comparison, the modified ResNet-50 model achieved 93% accuracy, 85.71% specificity and 94.94% sensitivity. Both the models outperformed the existing glaucoma detection methods in literature and provided state-of-the-art results. The proposed deep learning models have the potential to significantly improve the accuracy, speed, and convenience of glaucoma screening and diagnosis, especially in resource-limited settings. The results of our study suggest that deep learning models can serve as practical tools for automated glaucoma detection and assist clinicians in early diagnosis, leading to timely treatment.

Glaucoma

Deep learning

VGG16

ResNet-50

CNN

Glaucoma is a chronic, progressive eye disease identified as one of the prime causes of blindness among the age group 40–60 years worldwide. It accounts for over 10 million physician visits annually[1]. In India, 12 million will be affected by glaucoma by 2022, which is expected to double by 2040[2]. Glaucoma constitutes a group of eye conditions that impairs the optic nerve, which relays visual information from the eye to the brain. Optic nerve damage is usually associated with high pressure in the eye, but glaucoma can also occur when eye pressure is normal. Glaucoma grows slowly over time and initially affects peripheral vision[3]. In case of delayed diagnosis or treatment, glaucoma can even result in blindness. Since the condition has minor symptoms in the initial stages, early detection of glaucoma is crucial for preventing vision loss[4]. The main symptoms of glaucoma are blurred vision or iridescent circles around bright lights, usually in both eyes or possibly worse in one eye. Sometimes, glaucoma develops suddenly and causes severe eye pain, nausea, vomiting, red eyes, headache, tenderness around the eyes, blurred vision etc.[5].

The optic disc is a round, yellowish area in the center of the retina where millions of retinal nerve fibers reside, transmitting visual signals from the eye to the brain, where they converge and exit the retina[6]. A transparent central cup-shaped region of the optic disc[7] is identified as the pale horizontal oval-shaped depression that covers almost 30% of the optic disc in healthy adults[8]. The fundus is located at the back of the eye and consists of the retina, optic nerve and retinal blood vessels. The fundus can be imaged using a fundus camera with a dedicated low-power microscope[9]. Fig. 1 shows the optic disc and cup in a fundus image taken with a fundus camera. Fundus photography has emerged as a non-invasive, effective tool for checking individual eye health.

Traditionally, glaucoma detection is done using methods such as tonometry, which measures intraocular pressure [10], and optical coherence tomography (OCT), which measures the thickness of the retina[11]. While effective, these methods have limitations like dependence on operator skills, patient movement, etc.[12]. Further, manual mass screening of fundus images can be exhausting for the ophthalmologist and require extreme precision to avoid errors in disease identification[13]. To overcome these limitations, the proposed work focuses on a deep learning-based glaucoma detection solution. Deep learning is an artificial intelligence-based technique that learns significant features from existing data and uses them to predict new data[14]. It has been successfully applied in various fields, including image analysis, natural language processing, and drug discovery. Deep learning has also recently been applied to medicine[15], offering promising results over traditional machine learning and image processing techniques[16].

Contribution

In this paper, we have proposed two convolutional neural network-based models for automatically detecting glaucoma from fundus images. The key contributions of the proposed work are as follows

We have proposed a CNN-based model for the detection of glaucoma from fundus images. The original VGG16 network is modified by adding two fully connected layers, a ReLU activation layer and a SoftMax layer. These modifications help in achieving higher accuracy and better performance than the original VGG16 network.

We have proposed a residual network-based model for classifying glaucoma from fundus images. The original ResNet-50 network is modified by adding two fully connected layers, a ReLU activation layer and a SoftMax layer. The residual network is easy to optimize and reduces the degradation problem.

In Comparison to existing work, the proposed work achieves state-of-the-art results for classifying glaucoma.

Paper organization

Section 2 comprehensively reviews existing literature's deep learning-based glaucoma detection and classification methods. Section 3 details our proposed approach with focus on datasets, data preparation, modified VGG16, modified ResNet-50 and performance parameters. Section 4 presents the experimental setup, hyperparameter tuning, proposed models, their results on different datasets, and comparative analysis with existing research. We have concluded the paper by summarizing contributions, limitations, and future work in section 5.

Convolutional neural networks (CNN) were proposed by Yann LeCun[17] in 1989. He described them as a biological-inspired adaptation of multilayer perceptrons. In 2012, the ImageNet competition showed the vast potential of CNNs in various fields. Their success was mainly due to better and fast computational resources. Recently, different models based on deep learning have been used in the analysis of medical images, especially in the detection of diseases such as cancer [18], diabetic retinopathy[19], glaucoma[20], etc. Several studies have proposed deep learning-based classification models for the detection of glaucoma in fundus images.

In 2015, Chen et al.[21] proposed a six-layer deep-learning model to detect glaucoma in fundus images. Input images were preprocessed to obtain regions of interest. Then, data augmentation was performed to extract random patches of size 224 X 224 for CNN training. The model provided an area under the curve (AUC) values of 0.832 and 0.887 for the ORIGA and SCES datasets, respectively.

Alghamdi et al.[22] used two sequential deep learning architectures to detect optic disc abnormalities in fundus images. The author used multiple classifiers and deep CNNs for extracting optic disc regions. These were further given as input to the second layer of deep CNN for identifying whether an image was healthy or not. The model achieved an accuracy of 86.52% on the HAPIEE dataset and 97.76% on the PAMDI dataset.

Deep learning-based glaucoma detection was also done by Abbas[23]. He used a convolutional neural network architecture to extract the features from the fundus images and then used a deep belief network to hand-pick the most discriminating features. The model achieved 84.50% average sensitivity, 98.01% specificity and 99% accuracy on a dataset consisting of PRV-Glaucoma datasets, DRIONS-DB, HRF and sjchoi86 HRF.

Orlando et al. [24] used two convolutional neural networks, OverFeat and VGG-S, for detecting glaucoma. The authors also used preprocessing techniques such as vessel repair, adaptive equalization of the contrast-limited histogram, and clipping around the optic nerve head for better classification. The model obtained AUC values of 0.7212 and 0.6655 when tested on the Drishti-GS1 dataset on OverFeat and VGG-S, respectively.

Andres Diaz-Pinto[24] compared the performance of five distinct ImageNet-trained models (Xception, Inception V3, VGG16, ResNet-50 and VGG19) for the automatic detection of glaucoma in fundus images. The model was evaluated on five datasets, namely Drishti-GS1, HRF, sjchoi86-HRF, RIM-ONE and ACRIMA. The authors showed that the Xception model outperformed the rest of the models with an accuracy of 80% on the HRF dataset.

Sertan Serte & Ali Serener[25] also compared the glaucoma classification performance of ResNet-50, ResNet-150 and GoogLeNet architectures using five datasets, namely HRF, Drishti-GS1, RIM-ONE, sjchoi86-HRF and ACRIMA. ResNet-152 obtained the highest accuracy of 77% on the RIM-ONE dataset compared to other networks understudy.

The recent literature discussed above shows the potential of using deep learning to automatically detect glaucoma using fundus images. Such deep learning models are more effective than traditional approaches and can also assist ophthalmologists in the early diagnosis and treatment of the disease.

The section provides an overview of the methodology used in the proposed work. A schematic diagram of the suggested method is shown in Fig. 2. The input fundus images are initially resized to 256 X 256. After that, data augmentation techniques are applied to the images to increase the size of the dataset. These augmented images are then split into training, validation and test sets. These images are then fed to a deep learning-based glaucoma detection model where automatic feature extraction and classification occur in the model's initial and last layers, respectively. The results obtained are then validated using various performance metrics.

3.1. Dataset

In the proposed work, we have used three publicly available datasets, namely Drishti-GS [26], ACRIMA[24], and sjchoi86-HRF[27]. The details of these are given in Table 1.

Drishti-GS

This dataset comprises of 70 glaucomatous and 31 healthy eye images from Aravind Eye Hospital, Madurai, India. The fundus images are collected from different patients and represent a diverse range of glaucoma cases.

ACRIMA

The dataset is collected in Spain under the Automated Central Retinal Image Analysis project, which aims to develop an automated central retinal image analysis method. The dataset comprises of 705 images, with 396 unhealthy and 309 healthy fundus images.

sjchoi86-HRF

The dataset has 401 images, 101 glaucomatous and 300 healthy fundus images. The images in the sjchoi86-HRF dataset were collected from various sources, including clinics and hospitals in the Netherlands.

Table 1

Dataset details for glaucoma detection
Dataset	Glaucoma	Healthy	Total
DRISHTI-GS	70	31	101
ACRIMA	396	309	705
sjchoi86-HRF	101	300	401

3.2 Data preparation

In the proposed work, images from three publicly available datasets, namely Drishti-GS, ACRIMA, and sjchoi86-HRF are preprocessed before training the glaucoma detection model. The fundus images are resized to 256 X 256 pixels. Then the images are split into 70:20:10 for model training, testing, and validation. Since deep learning models require a large amount of data to get the best results, seven data augmentation techniques are applied, namely zooming, width shift, channel shift, rotation, shear, height shift and horizontal flip. The details of these are provided in Table 2.

Table 2

Data Augmentation Technique
Data Augmentation Technique	Value
Zooming	0.1
Width shift	0.1
Channel shift	10
Rotation	10°
Shear	0.15
Height shift	0.1
Horizontal flips	Yes

3.3 Proposed model

In the proposed work, we have modified two CNN architectures, VGG16 and ResNet-50, for binary classification of glaucoma from fundus images. The model automatically performs feature extraction and classification from input images. The VGG16 model, due to the presence of small filters and deep architectures, quickly learns robust and invariant features for image classification. The residual-based learning approach in ResNet-50 focuses on learning the residual mapping between input and output instead of complete mapping, which helps mitigate the vanishing gradient problem.

3.3.1 Modified VGG16 Architecture

The VGG16 architecture is a deep convolutional neural network developed by the Visual Geometry Group (VGG) at the University of Oxford in 2014 [28]. The network has 16 layers and performed well on the ImageNet dataset.

The VGG16 architecture uses small 3x3 convolutional filters and multiple convolutional and pooling layers. The pooling layers reduce the spatial dimensions of the feature maps, allowing the network to learn hierarchical features and reduce the number of parameters. The architecture also uses fully connected layers to make the final classification decision.

In the proposed work, we have used a modified VGG16 architecture for glaucoma detection, as shown in Fig. 3. The model consists of 3 X 3 convolution layers followed by flatten, dense and SoftMax layers. In comparison to the original VGG16, the modified version consists of two extra layers, namely flattening, followed by a dense layer of 512 units with a ReLU activation. Lastly, a SoftMax layer with two outputs for healthy and glaucomatous classification is present. These added layers helps to achieve higher accuracy and better performance than the original VGG16 architecture.

The modified VGG16 architecture consists of an input image of size 256 X 256, followed by five sets of convolutional layers with filter sizes (3 X 3, 64), (3 X 3, 128), (3 X 3, 256), (3 X 3, 512), and (3 X 3, 512) respectively. Further, each set is followed by a 2 X 2 max pooling layer. The output of the last convolutional layer is then flattened and passed through 3 fully connected layers of size 7 X 7. The model also consists of an additional flattened layer which helps the model to capture important spatial information in the output of the final dense layer. This is followed by 7 X 7 Dense and SoftMax layers. The details of the modified VGG16 network are shown in Fig. 4.

3.3.2 Proposed ResNet-50 Architecture

The residual models are deep convolutional neural network architecture introduced by Microsoft Research[29] and designed for image classification tasks. Figure 5 shows the modified residual network with 50 layers that have been used for glaucoma classification in the proposed work.

In comparison to the original network, the modified ResNet-50 architecture has three additional layers: a flattening layer, a dense layer with 512 neurons and ReLU activation, and a final SoftMax layer with two outputs for binary classification of glaucoma.

The details of the modified ResNet-50 for glaucoma detection are shown in Fig. 6. The network consists of a 7 X 7 convolutional layer with 64 filters and a stride of 2. This is followed by 3 X 3 max pooling with a stride of 2. There are four more convolutional layers, each consisting of a 1 X 1 convolution layer followed by a 3 X 3 convolution layer and a 1 X 1 convolution layer. The first, second, third and fourth convolutional layers have 3, 4, 6 and 3 residual blocks, respectively. The output of the last convolutional layer is passed through an average pooling layer, a flattened layer and three fully connected layers of size 7 X 7. This is further followed by an additional flattened layer which helps the model to capture important spatial information from the final dense layer. This is followed by a 7 X 7 Dense layer and, finally, a SoftMax layer.

3.4 Performance Analysis

The confusion matrix is a performance measurement tool used to evaluate various parameters of a classification model. In binary classification, the confusion matrix is represented as a 2 X 2 matrix, with the rows indicating the actual class and columns indicating the predicted class. The entries in the matrix are the number of true positive (TP), false positive (FP), true negative (TN), and false negative (FN) predictions made by the model [30].

True positives (TP) are the number of images correctly predicted as glaucoma. Conversely, false positives (FP) refer to the number of images incorrectly predicted as glaucoma. True negative (TN) refers to the number of frames correctly predicted as healthy. Conversely, false negatives (FN) refer to the number of images incorrectly predicted to be healthy.

Accuracy is the proportion of correctly identified cases (both positive and negative) out of the total number of cases, given by Eq. (1).

Sensitivity is the proportion of true positives determined as positive out of the total number of true positives, given by Eq. (2).

Specificity is the proportion of true negatives identified as negative out of the total number of true negative cases, given by Eq. (3).

This section details the experimental setup, hyperparameters, and experiments performed in the proposed work.

4.1 Experimental Setup

In the proposed work, we have used two open-source deep learning libraries, TensorFlow[31] and Keras [32], for training the classification models. TensorFlow is a prominent open-source platform for building and training machine learning models. At the same time, Keras is a high-level API built on top of TensorFlow, making it easier to build and experiment with deep-learning models. We have used Intel(R) Xeon(R) processor and TESLA K80 GPU with 12GB RAM for the model training.

4.2. Hyperparameter tuning

In glaucoma detection, the images were split into 70:20:10 for training, testing and validating the deep learning models. The images were also augmented to increase the dataset size. Thereafter, VGG16 and ResNet-50 models were tuned using various hyperparameters. It is an essential aspect of deep learning models and can significantly affect the performance of the models. Our study uses several hyperparameters such as learning rate, optimizer, loss function, batch size, and epochs discussed as follows:

Learning rate: This is one of the most critical hyperparameters that determines the model's pace of learning and convergence[33]. In our study, the initial learning rate is fixed at 0.1. However, we use an exponential decay function to fine-tune the learning rate during the training process. The learning rate decreases as the number of iterations increases, allowing the model to converge to a better solution. This planning improves model performance and avoids overfitting.

Optimizer: It is an algorithm that helps to minimize the loss function during training. In our study, we have used the Adam optimizer as it helps the model to converge faster without getting stuck in a suboptimal solution [34].

Loss function: The loss function evaluates the model's performance. In our study, we have used categorical cross-entropy loss function. It measures the difference between the predicted and actual probabilities of the class [35].

Batch Size: The batch refers to the dataset being divided into smaller parts to be fed into the network during training. It is an essential hyperparameter in training deep learning models, as it affects the speed and quality of the learning process[36]. In our work, a batch size of 16 provided optimal results.

Epochs: Epoch refers to a single iteration over the entire dataset during the training of the model [37]. We obtained the best results at 50 epochs for both the proposed models.

In conclusion, the choice of hyperparameters significantly impacts the model's performance. The hyperparameters used in our study were selected based on previous studies and experiments. The details of hyperparameters used in training modified VGG16 and modified ResNet-50 glaucoma classification models are provided in Table 3.

Table 3

Hyperparameters for the glaucoma classification model.
Parameters	Modified VGG16 model	Modified ResNet-50 model
Target labels	2	2
Image size (pixels)	256 X 256	256 X 256
Batch size	16	16
Initial learning rate	0.1	0.1
Decay Rate	0.96	0.96
Number of epochs	50	50
Optimizer	Adam	Adam
Loss function	Categorical Cross entropy	Categorical Cross entropy

4.3 Proposed models

This section details the experiments performed and corresponding results obtained using modified VGG16 and modified ResNet-50 for glaucoma detection. We performed three experiments over two proposed glaucoma detection models. In the first experiment, the sjchoi86-HRF dataset is used to train and test modified VGG16 and ResNet-50 models. In the second experiment, the Drishti-GS1 dataset is used for training and testing the two proposed glaucoma detection models. The ACRIMA dataset was used in model training and testing in the third experiment. Further, performance was also evaluated on parameters like accuracy, sensitivity and specificity.

4.3.1 Results on modified VGG16

The results of various experiments conducted for the modified VGG16 glaucoma detection model are discussed in this section.

In the first experiment, the images from the sjchoi86-HRF dataset were used to train the modified VGG16 model for 50 epochs. The model obtained optimal accuracy of 87.5%, a sensitivity of 100% and a specificity of 86.96% on image size 256 X 256 with batch size 16. In the second experiment, the images from the Drishti-GS1 dataset were used to train the modified VGG16 model for 50 epochs with training, validation and test ratio split of 60:10:30. The model obtained an accuracy of 70%, a sensitivity of 100% and a specificity of 68.97%. In the third experiment, ACRIMA dataset was used to train the modified VGG16 model for 50 epochs. The model obtained 94% accuracy, 80.95% sensitivity and 97.47% specificity when tested on image size 256 X256 with batch size 16.

Table 4

Experiments on various datasets VGG-16
Model	Dataset	Accuracy (%)	Sensitivity (%)	Specificity (%)
VGG16	sjchoi86-HRF	87.5	100	86.96
	Drishti-GS1	70	100	68.97
	ACRIMA	94	80.95	97.47

The details of various experiments conducted are shown in Table 4. It is observed from the Table that the modified VGG16 model provides the best accuracy and specificity on the ACRIMA dataset. However, the model delivers good sensitivity for the sjchoi86-HRF dataset.

4.3.2 Results on modified ResNet-50

This section discusses the results of various experiments conducted for the modified ResNet-50 glaucoma detection model. After extensive experiments, it has been observed that the optimal results are obtained on image size 256 X 256 with batch size 16 and 50 epochs.

In the first experiment, the modified ResNet-50 model achieved 91.6% accuracy, 75% sensitivity and 95% specificity on images from the sjchoi86-HRF dataset. In the second experiment, the proposed model obtained 73.33% accuracy, 100% sensitivity and 71.43% specificity on the Drishti-GS1 image set with training, validation and a test ratio of 60:10:30. In the third experiment, the proposed residual model obtained 93% accuracy, 85.71% sensitivity and 94.94% specificity on ACRIMA dataset.

The details of various experiments conducted are shown in Table 5. It is observed, that the modified ResNet-50 model provides the best accuracy of 93% on the ACRIMA dataset. However, the model obtains optimal sensitivity and specificity on Drishti-GS1 and sjchoi86-HRF datasets.

Table 5

Experiments on various datasets using ResNet-50
Model	Dataset	Accuracy (%)	Sensitivity (%)	Specificity (%)
ResNet-50	sjchoi86-HRF	91.6	75	95
	Drishti-GS1	73.33	100	71.43
	ACRIMA	93	85.71	94.94

4.4 Comparative Analysis

The comparison of the proposed modified VGG16 and modified ResNet-50 glaucoma detection model with the existing literature based on sjchoi86-HRF (Table 6), Drishti-GS1 (Table 7) and ACRIMA (Table 8) dataset are shown. The results of the comparative study show that our proposed model outperforms both the Diaz-Pinto et al.[24] and Sertan Serte's[25] models in terms of accuracy on all three datasets.

Table 6

Performance comparison based on the sjchoi86-HRF dataset
Author/Method	Model	Accuracy (%)	Sensitivity (%)	Specificity (%)
Diaz-Pinto et al.[24]	Xception	70	70.33	70
Sertan Serte [25]	GoogLeNet	72	-	86
	ResNet-50	70	-	86
	ResNet-152	74	-	95
Proposed	VGG-16	87.5	100	86.96
Proposed	ResNet-50	91.6	75	95

On the sjchoi86-HRF dataset, our proposed model achieved an accuracy of 91.6% using the modified ResNet-50 and 87.5% using the modified VGG16 architecture, as shown in Table 6. The modified VGG16 model obtained 17.5%, and the modified ResNet-50 obtained 21.6% higher accuracy than the Xception model [25]. Also, the proposed models delivered optimal results with 50 epochs compared to the 200 epochs used by Diaz-Pinto et al.[24]. In comparison to GoogleNet [25], the modified VGG16 model and modified ResNet-50 achieved 15.5% and 19.5% higher accuracy, respectively. The modified VGG16 model provided 17.5% and 13.5% improvement in accuracy compared to ResNet-50[25] and ResNet-152[25] models. In the case of 50 layered residual models, the proposed ResNet model showed improvement in accuracy and specificity by 21.6% and 9%, respectively. Further, the modified ResNet-50 also showed an improvement in accuracy by 17.6% over the ResNet-152[25] model. However, both models exhibited comparable specificity. Overall, it was observed that the proposed models showed a minimum increase of 13.5% in accuracy as compared to existing models in the literature.

Table 7

Performance comparison based on the Drishti-GS1 dataset
Author/Method	Model	Accuracy (%)	Sensitivity (%)	Specificity (%)
Sertan Serte [25]	GoogLeNet	55	-	81
	ResNet-50	53	-	77
	ResNet-152	63	-	74
Proposed	VGG-16	70	100	68.97
Proposed	ResNet-50	73.33	100	71.43

From Table 7, it is observed that the modified VGG16 model provided an improvement in accuracy by 7%, 17% and 15% over ResNet-152 [25], Resnet-50[25] and GoogleNet[25], respectively. The modified ResNet-50 model improved the accuracy by 10.33%, 20.33%, and 18.33% compared to ResNet-152 [25], ResNet-50[25] and GoogleNet [25], respectively. However, the specificity of the proposed model was less than the existing literature.

Table 8

Performance comparison based on the ACRIMA dataset
Author/Method	Model	Accuracy (%)	Sensitivity (%)	Specificity (%)
Diaz-Pinto et al.[24]	Xception	70	68.93	70
Sertan Serte [25]	GoogLeNet	65	-	87
	ResNet-50	62	-	84
	ResNet-152	48	-	83
Proposed	VGG-16	94	80.95	97.47
Proposed	ResNet-50	93	85.71	94.94

On the ACRIMA dataset, our proposed model performed exceptionally well. The modified VGG16 achieved 24% higher accuracy, about 12% higher sensitivity and 27.47% higher specificity as compared to the Diaz-Pinto et al.[24].

Further, the model also reported an improvement of 29%, 32% and 46% in accuracy when compared to GoogLeNet [25], ResNet-50[25] and ResNet-152[25], respectively. The modified VGG16 showed a minimum increase of 10.47% in specificity compared to the work of Sertan Serte[25]. The modified ResNet 50 model exhibited similar results with an increase of 23%, 16.78%, and 24.94% in accuracy, sensitivity and specificity, respectively, compared to the work of Diaz-Pinto[24] In contrast to GoogLeNet [25], ResNet-50[25] and ResNet-152 [25], the proposed ResNet model reported an increase of 28%, 31%, and 45% accuracy, respectively. For the proposed models, the minimum increase in specificity is around 7.94%.

In general, the proposed models outperform the existing models in the literature for accuracy. The modified VGG16 and modified ResNet-50 glaucoma detection models obtained optimum results for accuracy, sensitivity and specificity over the ACRIMA dataset.

Glaucoma is a retinal disease that affects many people worldwide. The timely detection of glaucoma can help to control its gradual progression, thereby preventing permanent blindness. Traditional methods require manual inputs like the dimensions of the disc and cup from ophthalmologists, which causes a delay in the treatment. Therefore, automatic, faster and more accurate detection methods for glaucoma detection need to be developed. In this paper, we proposed two deep-learning-based automatic glaucoma detection models, namely modified VGG16 and modified ResNet-50.

On the sjchoi86-HRF dataset, the optimal sensitivity of 75%, the accuracy of 91.6%, and the specificity of 95% were obtained by modified ResNet-50. On Drishti-GS1, modified ResNet-50 obtained optimal results with 100% sensitivity, 73.3% accuracy and 71.43% specificity. On the ACRIMA dataset, modified VGG16 provided optimal accuracy of 94% and specificity of 97.47%. Overall, the proposed glaucoma detection model shows comparable or better performance than existing models in the literature.

In the future, performance improvement can be achieved by exploring more efficient data augmentation approaches and training the model on larger datasets. Further, research can be focused towards the development of a multistage glaucoma classification model, as it would enable better risk assessment.

Competing Interests: The authors declare no competing interests.

Authors Contributions:

Sukhpal Singh: Data pre-processing, design and implementation of the deep learning models.

Nitigya Sambyal: Conceptualization, methodology, paper writing.

Ashutosh Aggarwal: Advice on the conception of the study and interpretation and discussion of results.

Funding: Not applicable.

S. S. Senjam, "Glaucoma blindness–A rapidly emerging non-communicable ocular disease in India: Addressing the issue with advocacy," J Family Med Prim Care, vol. 9, no. 5, p. 2200, 2020, doi: 10.4103/JFMPC.JFMPC_111_20.
"Glaucoma eye disorder is expected to double in India by 2040, suggests ophthalmologist, Health News, ET HealthWorld." https://health.economictimes.indiatimes.com/news/industry/glaucoma-eye-disorder-is-expected-to-double-in-india-by-2040-suggests-ophthalmologist/89116871 (accessed Feb. 05, 2023).
"Glaucoma - Symptoms and causes - Mayo Clinic." https://www.mayoclinic.org/diseases-conditions/glaucoma/symptoms-causes/syc-20372839 (accessed Feb. 05, 2023).
"Glaucoma: Causes, Types, Symptoms, Diagnosis, and Treatment." https://www.webmd.com/eye-health/glaucoma-eyes (accessed Mar. 01, 2023).
"Glaucoma - NHS." https://www.nhs.uk/conditions/glaucoma/ (accessed Feb. 05, 2023).
R. E. Kirsch and D. R. Anderson, “Clinical Recognition of Glaucomatous Cupping,” Am J Ophthalmol, vol. 193, pp. xxviii–xxxviii, Sep. 2018, doi: 10.1016/J.AJO.2018.06.008.
H. Quigley and A. T. Broman, "The number of people with glaucoma worldwide in 2010 and 2020," Br J Ophthalmol, vol. 90, no. 3, pp. 262–267, Mar. 2006, doi: 10.1136/BJO.2005.081224.
E. Waisberg and J. A. Micieli, "Neuro-Ophthalmological Optic Nerve Cupping: An Overview," Eye Brain, vol. 13, p. 255, 2021, doi: 10.2147/EB.S272343.
M. D. Abramoff, M. K. Garvin, and M. Sonka, "Retinal Imaging and Image Analysis," IEEE Trans Med Imaging, vol. 3, pp. 169–208, Jan. 2010, doi: 10.1109/RBME.2010.2084567.
P. C. Alguire, "Tonometry," Clinical Methods: The History, Physical, and Laboratory Examinations, 1990, Accessed: Mar. 01, 2023. [Online]. Available: https://www.ncbi.nlm.nih.gov/books/NBK222/
"What Is Optical Coherence Tomography? - American Academy of Ophthalmology." https://www.aao.org/eye-health/treatments/what-is-optical-coherence-tomography (accessed Mar. 02, 2023).
S. Maetschke, B. Antony, H. Ishikawa, G. Wollstein, J. Schuman, and R. Garnavi, "A feature agnostic approach for glaucoma detection in OCT volumes," PLoS One, vol. 14, no. 7, Jun. 2019, doi: 10.1371/JOURNAL.PONE.0219126.
E. A. Muro-Fuentes and L. Stunkel, "Diagnostic Error in Neuro-ophthalmology: Avenues to Improve," Curr Neurol Neurosci Rep, vol. 22, no. 4, p. 243, Apr. 2022, doi: 10.1007/S11910-022-01189-4.
I. H. Sarker, "Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions," SN Comput Sci, vol. 2, no. 6, pp. 1–20, Nov. 2021, doi: 10.1007/S42979-021-00815-1/FIG.S/6.
M. Kim et al., "Deep Learning in Medical Imaging," Neurospine, vol. 16, no. 4, p. 657, Dec. 2019, doi: 10.14245/NS.1938396.198.
Y. Lecun, Y. Bengio, and G. Hinton, "Deep learning," Nature, vol. 521, no. 7553, pp. 436–444, May 2015, doi: 10.1038/NATURE14539.
Y. LeCun, "Generalization and network design strategies," Connectionism in perspective, Jun. 1989, Accessed: Feb. 05, 2023. [Online]. Available: https://www.academia.edu/2813343/Generalization_and_network_design_strategies
K. Munir, H. Elahi, A. Ayub, F. Frezza, and A. Rizzi, "Cancer Diagnosis Using Deep Learning: A Bibliographic Review," Cancers 2019, Vol. 11, Page 1235, vol. 11, no. 9, p. 1235, Aug. 2019, doi: 10.3390/CANCERS11091235.
N. Sambyal, P. Saini, R. Syal, and V. Gupta, "Modified residual networks for severity stage classification of diabetic retinopathy," Evolving Systems, vol. 14, no. 1, pp. 17–35, Feb. 2022, doi: 10.1007/S12530-022-09427-3/TABLES/14.
D. Mirzania, A. C. Thompson, and K. W. Muir, "Applications of deep learning in detection of glaucoma: A systematic review," https://doi.org/10.1177/1120672120977346, vol. 31, no. 4, pp. 1618–1642, Dec. 2020, doi: 10.1177/1120672120977346.
X. Chen, Y. Xu, D. W. Kee Wong, T. Y. Wong, and J. Liu, "Glaucoma detection based on deep convolutional neural network," Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, vol. 2015-November, pp. 715–718, Nov. 2015, doi: 10.1109/EMBC.2015.7318462.
H. S. Alghamdi, H. L. Tang, S. A. Waheeb, and T. Peto, "Automatic Optic Disc Abnormality Detection in Fundus Images: A Deep Learning Approach," pp. 17–24, May 2017, doi: 10.17077/OMIA.1042.
Q. Abbas, "Glaucoma-Deep: Detection of Glaucoma Eye Disease on Retinal Fundus Images using Deep Learning," International Journal of Advanced Computer Science and Applications, vol. 8, no. 6, 2017, doi: 10.14569/IJACSA.2017.080606.
A. Diaz-Pinto, S. Morales, V. Naranjo, T. Köhler, J. M. Mossi, and A. Navea, "CNNs for automatic glaucoma assessment using fundus images: An extensive validation," Biomed Eng Online, vol. 18, no. 1, pp. 1–19, Mar. 2019, doi: 10.1186/S12938-019-0649-Y/FIG.S/11.
S. Serte and A. Serener, "A Generalized Deep Learning Model for Glaucoma Detection," 2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Oct. 2019, doi: 10.1109/ISMSIT.2019.8932753.
J. Sivaswamy, A. Chakravarty, G. Datt Joshi, and T. Abbas Syed, "A Comprehensive Retinal Image Dataset for the Assessment of Glaucoma from the Optic Nerve Head Analysis," 2015.
"GitHub - cvblab/retina_dataset: Retina dataset containing 1) normal 2) cataract 3) glaucoma 4) retina disease." https://github.com/cvblab/retina_dataset (accessed Feb. 05, 2023).
K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, Sep. 2014, doi: 10.48550/arxiv.1409.1556.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2016-December, pp. 770–778, Dec. 2015, doi: 10.48550/arxiv.1512.03385.
"Taking the Confusion Out of Confusion Matrices | by Allison Ragan | Towards Data Science." https://towardsdatascience.com/taking-the-confusion-out-of-confusion-matrices-c1ce054b3d3e (accessed Mar. 01, 2023).
"Introduction to TensorFlow." https://www.tensorflow.org/learn (accessed Feb. 05, 2023).
"About Keras." https://keras.io/about/ (accessed Feb. 05, 2023).
"Understand the Impact of Learning Rate on Neural Network Performance - MachineLearningMastery.com." https://machinelearningmastery.com/understand-the-dynamics-of-learning-rate-on-deep-learning-neural-networks/ (accessed Feb. 05, 2023).
P. L. Lagari, L. H. Tsoukalas, and I. E. Lagaris, "Variance Counterbalancing for Stochastic Large-scale Learning," International Journal on Artificial Intelligence Tools, vol. 29, no. 5, Aug. 2020, doi: 10.1142/S0218213020500104.
"How to Choose Loss Functions When Training Deep Learning Neural Networks - MachineLearningMastery.com." https://machinelearningmastery.com/how-to-choose-loss-functions-when-training-deep-learning-neural-networks/ (accessed Feb. 05, 2023).
"How does Batch Size impact your model learning | by Devansh- Machine Learning Made Simple | Geek Culture | Medium." https://medium.com/geekculture/how-does-batch-size-impact-your-model-learning-2dd34d9fb1fa (accessed Mar. 01, 2023).
"Epoch vs Batch Size vs Iterations | by SAGAR SHARMA | Towards Data Science." https://towardsdatascience.com/epoch-vs-iterations-vs-batch-size-4dfb9c7ce9c9 (accessed Mar. 01, 2023).

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Automated Glaucoma Detection Using Deep Convolutional Neural Networks

Status:

Version 1

Abstract

Figures

1. Introduction

2. Literature Survey

3. Proposed Approach

3.1. Dataset

3.2 Data preparation

3.3 Proposed model

3.3.1 Modified VGG16 Architecture

3.3.2 Proposed ResNet-50 Architecture

3.4 Performance Analysis

4. Results And Discussions

4.1 Experimental Setup

4.2. Hyperparameter tuning

4.3 Proposed models

4.3.1 Results on modified VGG16

4.3.2 Results on modified ResNet-50

4.4 Comparative Analysis

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1