Detection of lung cancer using modified co-learning technique based on ten Convolutional Neural Network models in PET/CT image

doi:10.21203/rs.3.rs-2325143/v1

Download PDF

Research Article

Detection of lung cancer using modified co-learning technique based on ten Convolutional Neural Network models in PET/CT image

https://doi.org/10.21203/rs.3.rs-2325143/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: A proposed Lung Cancer Detection System (LCDS) lacks sensitivity and accuracy due to low spatial resolution in Positron Emission Tomography (PET) image and low contrast in Computed Tomography (CT) image.

Method: Such an issue has been resolved by creating a modified co-learning learning technique which will be based on ten Convolutional Neural Network (CNN) (Alexnet, Vgg16, Vgg19, Squeezenet, Googlenet, Inceptionv3, Mobilenetv2, Densenet201, Resnet18, Xception) models. This technique encodes modality specific features and utilizes them to acquire a spatially varying fusion map. These fusion maps are multiplied using modality feature map for an utilization of image analysis.

Result: By the use of modified co-learning technique, this system attained (Densenet201) 97.3% sensitivity, 98.2% accuracy and 1.6 false positives per scan in PET/CT.

Conclusion: A proposed LCDS attained tremendously minimum false positive rate and it is a promising technique in support of cancerous recognition due to improved sensitivity and accuracy.

lung cancer

Positron emission tomography/computed tomography

fusion

Lung cancer [1] [23] is a principal cause of cancer related deaths world wide. FDG PET (Fluorodeoxyglucose positron emission tomography) [2] is the most precise imaging method to identify functional rather than anatomical data. It’s main issue was lack of spatial resolution. CT is the most precise imaging method to identify anatomical rather than functional data. It’s main issue was low contrast. A proposed LCDS lacks sensitivity and accuracy due to low spatial resolutionin PET image and low contrast in CT image. To identify this syndrome at a beginning phase, modified co-learning technique based on ten CNN model is utilized by using Computed Tomography (CT) and Positron Emission Tomography (PET) images in a proposed LCDS.

Manual interpretation of a PET and CT image is a time consuming task for radiologists due to the differences in inter and intra observer variations [3]. The role of PET-CT in cancer care has aggravated extensive research into methods to create the feature and fusion map. These techniques can be separated into two types: (i) process each image individually and then combine individual attributes of an image [9] – [18] and (ii) combine or fuse complementary attributes from each image [19 – 22]. Methods that process each image separately are inherently limited when the intent is to consider both the functional and anatomical extent of the syndrome. In contrast, methods that fuse information from two images often utilize a priori knowledge about characterizing different images to prioritize information from one of two images for varied tasks. Alternatively, they may fuse information using a representation that model relationships between the two images. The fusion is particularly necessary in cases where different images identify different attributes of the same region of interest (ROI), with no one image capturing the entire ROI.

In a proposed LCDS, the objective of this research work is to employ the modified co-learning technique based on ten CNN models.Ten CNN models (Alexnet, VGG16, VGG19, Squeezenet, Googlenet, Inceptionv3, Mobilenetv2, Densenet201,Resnet18, XCeption) has presented which learns to fuse anatomical and functional data from PET-CT images in a spatially varying manner. The modified co-learning technique based on 10 CNN models is intended as a general approach for integrating feature and fusion maps in PET/CT image for the improvement in contrast and spatial resolution. It leads to improvement in accuracy, sensitivity and specificity.

The dataset for the study was collected from Anderson Diagnostics & Labs, Chennai and MGH Department of Radiology website.Each study comprised one CT volume and one PET volume: the CT resolution was 512 x 512 pixels at 0.98mm x 0.98mm, the PET resolution was 200 x 200 pixels at 4.07mm x 4.07mm, with a slice thickness and an interstice distance of 3mm. Both volumes were reconstructed with the same number of slices. Studies contained between 1 to 7 tumors (inclusive) in the thorax. The tumor locations included the different lung lobes, the mediastinum, and hilar nodes. The images were rescaled to a resolution of 256 x 256 pixels (x-y axes); rescaling the PET and CT volumes so that they share the same coordinate space is a standard process for analysis of PET-CT data [18] [24] [25] [26]. The PET images were normalized by a transformation to standard uptake values (SUVs).

Table 1 Specification of Ten CNN Models

Network	Depth	Size	Parameters (Million)
Alexnet	8	227 MB	61.0
Vgg16	16	515 MB	138
Vgg19	19	535 MB	144
Squeezenet	18	4.6 MB	1.24
Googlenet	22	27 MB	7.0
Inceptionv3	48	89 MB	23.9
Mobilenetv2	53	13 MB	3.5
Densenet201	201	77 MB	20.0
Resnet18	18	44 MB	11.7
Xception	71	85 MB	22.9

Fig 1 shows the architecture of our modified co-learning technique based on ten Convolutional Neural Network (CNN) models. The Purpose of every ten CNN model is to derive the image features that are most relevant to each specific images. The inputs given to each CNN model are PET image and CT image. The modified co-learning technique uses the modality specific features produced by the ten CNN models to derive a spatially varying fusion map to weight the modality-specific features at different locations. Finally, the reconstruction component integrates the modality-specific fused features across multiple scales to produce the final prediction map.

A. Creation of modified co-learning technique

Let G = X * Y + c be the output feature map of a CNN model where * is the convolution operation, Y is a input to CNN model, X is the learned weight and c is the learned bias. A batch normalization layer has been utilized to normalize every output feature dimension G to a distribution with zero mean and unit variance. The Leaky rectified linear unit (Leaky ReLU) activation function was utilized after feature map normalization:

Where g is a normalized feature and i is a parameter controlling the ‘leakiness’ of the activation function with the constraint that 0 < i < 1. The Leaky ReLU activation avoids the dead neuron problem that can occur with the standard ReLU function where some weights in X can be updated to a value where their training gradients are forever stuck at 0, thus preventing the weights from being updated in the future. The parameter i enables the introduction of a small non zero gradient when g < 0, thereby preventing the weights from being stuck at an unrecoverable value. For simplicity of notion, the output of a convolutional layer has referred by G = Α_i(X * Y + c) which is a feature map generated from Y after convolution, batch normalization and activation.

B. Modified co-learning technique based on ten CNN model

The modified co-learning technique contains two parts:

(i) a modified co-learning technique based on CNN model which learns to derive spatially varying fusion maps (ii) fusion operation utilizes the fusion maps to prioritize different features. Fig 2 shows a conceptual example of the ten CNN model based modified feature co-learning and fusion unit. The inputs to the modified feature co- learning unit are two feature maps G_CTand G_PET(each CNN model) of size w x h x c with w width, h height and c channels. These feature maps are stacked to form Y_multi (w x h x m x c) with m = 2 number of modalities. The channels of Y_multiare then convolved with the channels of a learnable 3D kernel X_multiof size k x k x m, where k is the width and height of the kernel and m = 2 is the number of modalities.

By performing ten CNN models without padding the modality dimension, we obtain for a given channel c a feature map with a singleton third dimension where the value at location (a,b) is determined from the neighborhood of both G_CT(a,b) and G_PET(a,b):

We then squeeze the singleton third dimension to obtain an output feature map X_multi*Y_multi of size w x h x 2c, the same width and height as the two modality-specific input feature maps F_CT(a, b) and F_PET (a,b) and double the number of channels, which is important for the weighting of modality-specific feature maps by the modifying co-learned fusion maps.

The modified co-learned fusion map controls the level of importance given to information from each modality at each location, in contrast to the global fusion ratio in PET-CT pixel intermixing [22]–[24]. Thus the modifying co-learned fusion maps directly affect the input distribution of the learnable layers that immediately follow the modified co-learning unit. Hence, we do not normalize the output of the CNN model within the modified co-learning unit. As with the CNN model, we utilized a Leaky ReLU activation function to obtain the multi modality modifying co-learned fusion map:

Where c_multiare the learned biases. The multi modality fusion map G_CNNis obtained by the modified co-learning unit based on ten CNN models. The fusion operation integrates the modality-specific feature maps according to the values (coefficients) in the multi-modality fusion map is as follows:

where G_CNNis the modifying co-learned feature map, is the stacking operation and is an element wise multiplication. This process merges the two modality-specific feature maps. G_CT and G_PET weights them by the modifying co-learned fusion map similar to pixel intermixing. Our modified co-learning based ten CNN models generates fused feature maps, one for each PET and CT image.

C. Reconstruction

The reconstruction part of our CNN creates a prediction map of the ROIs within the PET-CT image. It does this by integrating the modified co-learned feature maps from each of ten CNN model. The concept behind reconstruction block is to generate higher dimensional feature maps that better correspond to the features for different ROIs by merging lower dimensional information with features that were fused from multiple image modalities. As with the modality-specific encoders, we use batch normalization [20] and Leaky ReLU [21] activations. After the last reconstruction block, the output feature map has the same width and height as the input PET-CT image,with 128 channels in the third dimension. This is analogous to a final 128-dimensional feature vector for each pixel in the original image. Ten CNN models utilize a 1x1 convolution to map these feature vectors into R + 1 feature maps, where R is the number of ROIs. Finally, these observations have transformed into a probability or prediction map that corresponds to the likelihood of the pixel belonging to a particular class using the softmax function [25]:

where Q_j(p) is the probability that the pixel with observation vector p belongs to the region j, p_j the j-th element of vector p and is the activation corresponding to region j.

Table 3 shows the comparison of ten CNN models on ROI detection experiments. The data are presented individually for lung ROI. In Table 4, the segmentation experiment of our proposed modified co-learning technique based on ten CNN models has higher Dice score when compared to tumor segmentation baseline [13] [17]

Table 2 Comparison of CNN Models on Detection of Lung ROI

S.no	ROI	CNN Models	Detection
S.no	ROI	CNN Models	Acc (%)	Sen (%)
1	Lung	Alexnet	94.2	94.8
2		Vgg16	89.5	88.4
3		Vgg19	92.4	90.8
4		Squeezenet	87.4	82.2
5		Googlenet	97.5	98.3
6		Inceptionv3	91.2	92.2
7		Mobilenetv2	96.5	95.4
8		Densenet201	98.2	97.3
9		Resnet18	97.2	97.5
10		Xception	97.55	97.8

	Sensitivity (%)	Accuracy (%)	False positive
Dehmeshki et al. (2007)	90%	81%	16
Suarez Cuenca et al. (2009)	80%	84%	8
Proposed modified co-learning method based on ten CNN models	97.3% (Densenet201)	98.2% (Densenet201)	1.6

The proposed computer aided detection scheme using 10 CNN Models attained 98.2% accuracy and 97.3% sensitivity is better to recognize lung cancer in beginning phase for improving survival rates. It is a promising method for radiologists to recognize an abnormality by PET/CT images from Anderson Diagnostics & Labs image set. By utilizing 10 CNN Models, false positive of the proposed CAD scheme have diminished to 1.6 which was lower than previous works

Authors’ Contribution

Mr.R.Kishore and Dr.R.Suresh Babu wrote the manuscript. All authors read and approved the final manuscript.

Funding: no funding received

Availability of data and materials:

The dataset that support the findings and conclusion of this study are publicly available

Ethics approval and consent to participate:

Not Applicable

Consent for publication: Not Applicable

Competing interests: The authors have declared that there is no competing interest

Author details

¹Research Scholar, Department of Electronics and Communication Engineering (ECE), Kamaraj college of engineering and technology (Autonomous), Virudhunagar.

²Professor, Department of ECE, Kamaraj college of engineering and technology (Autonomous), Virudhunagar.

Acknowledgements

We will give our sincere thanks to Kamaraj College of Engineering and Technology, Virudhunagar, India for doing this research work in the Department of Electronics and Communication Engineering. Our special thanks to Dr. S.Senthil, M.E., Ph.D., Principal, Kamaraj college of Engineering and Technology, Virudhunagar, India for his encouragement and continuous support for this research work. Our regards to Dr. Joe Pradeep Kumar, MBBS, Indian MRI Diagnostic & Research Limited, Madurai for his guidance and validation of data. The authors would like to thank all anonymous reviewers for their advice.

https://thetruthaboutcancer.com/lung-cancer-causes/
S. Kligerman and S. Digumarthy, “Staging of non–small cell lung cancer using integrated PET/CT”, Am J Roentgenol(2009), vol. 193, no. 5, pp. 1203–1211.
T. M. Blodgett, C. C. Meltzer, and D. W. Townsend, “PET/CT: Form and Function”, Radiology(2007), vol. 242, no. 2, pp. 360–385.
F. C. Detterbeck, D. J. Boffa, L. T. Tanoue, “The new lung cancer staging system”, Chest(2009), vol. 136, no. 1, pp. 260–271.
S. B. Edge, D. R. Byrd, C. C. Compton, A. G. Frtiz, F. L. Greene, and A. Trotti, Eds., AJCC Cancer Staging Manual (2010). Springer New York, pp. 1471 – 74.
S. B. Edge , C. C. Compton, “The American Joint Committee on Cancer: the 7th Edition of the AJCC Cancer Staging Manual and the Future of TNM”, Ann Surg Oncol(2010), vol. 17, pp. 1471–1474.
E. Tatci, O. Ozmen, Y. Dadali, I. U. Biner, A. Gokcek, F. Demirag, F. Incekara, and N. Arslan, “The role of FDG PET/CT in evaluation of mediastinal masses and neurogenic tumors of chest wall”, Int J Clin Exp Med(2015), vol. 8, no. 7, pp. 11 146–52.
W. Ju, D. Xiang, B. Zhang, L. Wang, I. Kopriva, X. Chen, “Random Walk and Graph Cut for Co-Segmentation of Lung Tumor on PET-CT Images”, IEEE T Imag Process(2015), vol. 24, no. 12, pp. 5854–5867.
A. Teramoto, H. Fujita, O. Yamamuro, and T. Tamaki, “Automated detection of pulmonary nodules in PET/CT images:Ensemble false positive reduction using a convolutional neural network technique”,Med Phys(2016), vol. 43, no. 6, pp. 2821–2827.
L. Bi, J. Kim, A. Kumar, L. Wen, D. Feng, and M. Fulham, “Automatic detection and classification of regions of FDG uptake in whole-body PET-CT lymphoma studies”, Comput Med Imag Grap(2017), vol. 60, pp. 3–10.
L. Xu, G. Tetteh, J. Lipkova, Y. Zhao, H. Li, P. Christ, M. Piraud, A. Buck, K. Shi, and B. H. Menze, “Automated whole-body bone lesion detection for multiple myeloma on 68Ga-Pentixafor PET/CT imaging using deep learning methods”, Contrast Media Mol I(2018), vol. 2018, p. 11.
F. Milletari, N. Navab, and S. A. Ahmadi, “V-net: Fully convolutional neural networks for volumetric medical image segmentation”, in Fourth International Conference on 3D Vision (3DV) (2016), pp. 565–571.
Z. Zhong, Y. Kim, L. Zhou, K. Plichta, B. Allen, J. Buatti, and X. Wu, “3D fully convolutional networks for co-segmentation of tumors on PETCT images”, in IEEE ISBI(2018), pp. 228–231.
J. Zhao, G. Ji, Y. Qiang, X. Han, B. Pei, and Z. Shi, “A new method of detecting pulmonary nodules with PET/CT based on an improved watershed algorithm”, PLOS ONE(2015), vol. 10, no. 4, pp. 1–15.
C. Lartizien, M. Rogez, E. Niaf, and F. Ricard, “Computer aided staging of lymphoma patients with FDG PET/CT imaging based on textural information”, IEEE J Biomed Health(2014), vol. 18, no. 3, pp. 946–955.
Y. Song, W. Cai, H. Huang, X. Wang, Y. Zhou, M. J. Fulham, and D. D. Feng, “Lesion detection and characterization with context driven approximation in thoracic FDG PET-CT images of NSCLC studies”, IEEE T Med Imaging(2014), vol. 33, no. 2, pp. 408–421.
Q. Song, J. Bai, D. Han, S. Bhatia, W. Sun, W. Rockey, J. E. Bayouth, J. M. Buatti, and X. Wu, “Optimal co-segmentation of tumor in PET-CT images with context information”, IEEE T Med Imaging(2013), vol. 32, no. 9, pp. 1685–1697.
D. Han, J. Bayouth, Q. Song, A. Taurani, M. Sonka, J. Buatti, and X.Wu,“Globally optimal tumor segmentation in PET-CT images: A graph based co-segmentation method”, in Information Processing in Medical Imaging. Springer Berlin Heidelberg(2011), pp. 245–256.
T. Bradshaw, T. Perk, S. Chen, H.-J. Im, S. Cho, S. Perlman, and R. Jeraj, “Deep learning for classification of benign and malignant bone lesions in [F-18]NaF PET/CT images”, J Nucl Med(2018), vol. 59, no. S1, p. 327,
Y. Song, W. Cai, J. Kim, and D. D. Feng, “A multistage discriminative model for tumor and lymph node detection in thoracic images”, IEEE T Med Imaging(2012), vol. 31, no. 5, pp. 1061–1075.
L. Bi, J. Kim, D. Feng, and M. Fulham, “Multi-stage thresholded region classification for whole-body PET-CT lymphoma studies”, in MICCAI(2014),pp. 569–576.
A. Kumar, J. Kim, L. Wen, M. Fulham, and D. Feng, “A graph-based approach for the retrieval of multi-modality medical images”,Med Image Anal(2014), vol. 18, no. 2, pp. 330–342.
Nilendu C Purandare, Venkatesh Rangarajan, “Imaging of lung cancer: Implications on staging and management”, Indian Journal of Radiology and Imaging(2015), vol.25, no.2, pp 109-120.
X. Zhao, L. Li, W. Lu, and S. Tan, “Tumor co-segmentation in PET/CT using multi-modality fully convolutional neural network”, Physics in Medicine and Biology (2018), vol. 64, no. 1, pp 1-29.
L. Li, X. Zhao, W. Lu, and S. Tan, “Deep learning for variational multimodality tumor segmentation in PET/CT”, Neurocomputing, pp 1-19.
Z. Zhong, Y. Kim, K. Plichta, B. G. Allen, L. Zhou, J. Buatti, and X. Wu, “Simultaneous cosegmentation of tumors in PET-CT images using deep fully convolutional networks”, Medical Physics(2019), vol. 46, no. 2, pp. 619–633.

Tables 3 & 4 are not available with this version.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Detection of lung cancer using modified co-learning technique based on ten Convolutional Neural Network models in PET/CT image

Status:

Version 1

Abstract

Figures

1 Background

2 Materials

3 Specification Of Ten Cnn Models

4 Proposed Lung Cancer Detection Scheme

5 Results

6 Conclusion

7 Declaration

8 References

Tables

Additional Declarations

Status:

Version 1