Deep Learning Predictive Model For Colon Cancer
Deep Learning Predictive Model For Colon Cancer
Asif Karim6
Member, IEEE
Abstract—In recent years, the area of Medicine and issue, countries must make significant investments in public
Healthcare has made significant advances with the assistance of health, establish a large number of labs and pathology centres
computational technology. During this time, new diagnostic with the requisite technology, and educate more people to
techniques were developed. Cancer is the world's second-largest perform diagnostic operations. Furthermore, keeping the costs
cause of mortality, claiming the lives of one out of every six of these examinations within reach of those who are poor is
individuals. The colon cancer variation is the most frequent and necessary. Finding new techniques for diagnosing cancer will
lethal of the numerous kinds of cancer. Identifying the illness at give a genuine chance of survival.
an early stage, on the other hand, substantially increases the odds
of survival. A cancer diagnosis may be automated by using the
power of Artificial Intelligence (AI), allowing us to evaluate more
cases in less time and at a lower cost. In this research, CNN
models are employed to analyse imaging data of colon cells. For
colon cell image classification, CNN with max pooling and
average pooling layers and MobileNetV2 models are utilized. To
determine the learning rate, the models are trained and
evaluated at various Epochs. It's found that the accuracy of the
max pooling and average pooling layers is 97.49% and 95.48%,
respectively. And MobileNetV2 outperforms the other two
models with the most remarkable accuracy of 99.67% with a
data loss rate of 1.24.
I. INTRODUCTION
Cancer refers to a category of illnesses in which abnormal
cells develop within the human body as a result of random
mutations. When these cells are formed, they divide
abnormally and spread throughout the organs. If left untreated,
most cancers will eventually kill their victims. Fig. 1A, which
shows the 4-tier Human Development Index (HDI) based on
the UN's 2019 Human Development Report, shows how much
cancer's position as a cause of early death corresponds with
nation levels of social and economic development.
In rare situations, a person inherits from their parents the
faulty gene that causes cancer. Regular checks are required for
those who are at risk of getting hereditary malignancies. Many
Fig. 1. (A) The Four-Tiered Human Development Index (HDI) and (B) the
individuals cannot afford these diagnostic procedures since 20 World Regions. The Legend Includes the Population Sizes for Each
they are expensive. Cancer is responsible for over 70% of Population. Source: United Nations Development Program/United Nations
fatalities in poor and middle-income nations [1]. To meet this Procurement Division. Source: World Health Organization (WHO).
687 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
688 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
In the proposed study, authors tested image data for colon B. Environment Setup
cells obtained from online data sources to detect colon cancer. Tensorflow and the Keras library were used to carry out
They are using the Transfer learning model MobileNetV2. The this analysis. Tensorflow is a free, open-source Python library
process contains two CNN layers, Max Pooling, and average for performing large-scale machine learning calculations.
pooling. The image data goes through a number of Tensorflow is used extensively in artificial neural networks and
preprocessing steps to give a better classification outcome. The is used in Keras' backend.
performance of the model is evaluated based on the confusion
matrix.
689 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
D. CNN Classifier
CNN is an example of a Deep Learning algorithm that
takes an input image and assigns priority to different aspects of
the image, allowing it to distinguish one image from another
based on its features. In this system, two convolutional layers
in the CNN model are used where each convolutional layer
used convolutional 2D. In both convolutional 2D layers, 'Relu
activation' is utilized. For complete connectivity, two Dense
Layers are used. 'Relu activation' for the first dense layer and
'Sigmoid activation' for the second dense layer is used. Aside
from these layers, there are several hidden layers, as well as an
input layer. In this study, two pooling layers: Max Pooling 2D
Fig. 6. Two Convolution Layer with Average Pooling Action.
and Average Pooling 2D, are implemented [18]. Finally, for
the classification of image data MobileNetV2 classifier is used.
690 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
691 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
692 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
Fig. 13. Test Accuracy and Training Accuracy in Average Pooling Layer for • False positives (FP): 466.0000
Different Epochs.
• False negatives (FN): 578.00
As shown in Fig. 13, the accuracy of the average pooling • Sensitivity or Recall = 0.9606 = 96.06%
model progressively improves as the number of epochs grows.
The highest accuracy for the test set is in the 46th epoch and the • Specificity = 0.9548 = 95.48%
training set is in the 45th epoch. • Precision = 0.9702 = 97.02%
The model's data loss in the training and testing datasets
• F1-Score = 0.9657 = 96.57%
reduces quickly with the number of epochs, as seen in Fig. 14
for the average pooling layer. F. Classification Outcome of MobileNetV2 Model
After loading the MobileNetV2 model, the top layer is
frozen and the weights from ImageNet are loaded. A custom
model is placed there, and the architecture is trained. The
AveragePooling2D operation is included in the model, and the
pool size is (7, 7). There is a 128-node hidden layer, and the
ReLU activation function is used to remove features correctly.
Because deep learning models are prone to overfitting, dropout
is used to select training images at random. All of
MobileNetV2's trainable layers are no longer used. The Adam
Fig. 14. Test and Training Data Loss in Average Pooling Layer For Different optimizer feature is used to better learn models from errors. By
Epochs. setting the trainable layer parameter to False, the base layers of
all transfer learning models were frozen. A customize trainable
D. MSE (Mean Square Error) and AUC layer consisting of one hidden layer with 128 neurons was
The following MSE and AUC were achieved by applying introduced at this stage. The Average Pooling operation was
the test data set on the Average Pooling Layer: applied where the pool size is (7,7). The process is shown in
Fig. 17.
• MSE (Mean Square Error) of 0.0588 (Fig. 15)
For the back-propagation process, the learning rate is set to
• AUC of 0.9753 (Fig. 16) 0.01. Binary cross-entropy is used to calculate the loss
function. SoftMax activation is included in the output layer and
is more accurate than other activation functions. Table IV
displays the training and test accuracy, as well as the data loss
rate.
693 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
Test
Training Training Test
Model Name Data
Accuracy Data Loss Accuracy
Loss
694 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
695 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 8, 2021
[11] Yuan, Z.; IzadyYazdanabadi, M.; Mokkapati, D.; Panvalkar, R.; Shin, 2020 3rd Artificial Intelligence and Cloud Computing Conference.
J.Y.; Tajbakhsh, N.; Gurudu, S.; Liang, J. Automatic polyp detection in Association for Computing Machinery, New York, NY, USA, 38–45.
colonoscopy videos. Med. Imaging 2017 Image Process. 2017, 10133, DOI:https://doi.org/10.1145/3442536.3442543.
101332K. [18] S. Chakraborty, F. M. J. M. Shamrat, M. M. Billah, M. A. Jubair, M.
[12] Babu, T.; Gupta, D.; Singh, T.; Hameed, S. Colon Cancer Prediction on Alauddin and R. Ranjan, "Implementation of Deep Learning Methods to
Different Magnified Colon Biopsy Images. In Proceedings of the 10th Identify Rotten Fruits," 2021 5th International Conference on Trends in
International Conference on Advanced Computing (ICoAC), Chennai, Electronics and Informatics (ICOEI), 2021, pp. 1207-1212, doi:
India, 13–15 December 2018; pp. 277–280. 10.1109/ICOEI51242.2021.9453004.
[13] Mo, X.; Tao, K.; Wang, Q.; Wang, G. An Efficient Approach for Polyps [19] Akter, S. , Shekhar, H. and Akhteruzzaman, S. (2021) Application of
Detection in Endoscopic Videos Based on Faster R-CNN. In Biochemical Tests and Machine Learning Techniques to Diagnose and
Proceedings of the International Conference on Pattern Recognition Evaluate Liver Disease. Advances in Bioscience and Biotechnology, 12,
(ICPR), Beijing, China, 20–24 August 2018; pp. 3929–3934. 154-172. doi: 10.4236/abb.2021.126011.
[14] Urban, G.; Tripathi, P.; Alkayali, T.; Mittal, M.; Jalali, F.; Karnes,W.; [20] Lee, H., Park, J., & Hwang, J. Y. (2020). Channel attention module with
Baldi, P. Deep Learning Localizes and Identifies Polyps in Real Time multiscale grid average pooling for breast cancer segmentation in an
With 96% Accuracy in Screening Colonoscopy. Gastroenterology 2018, ultrasound image. IEEE transactions on ultrasonics, ferroelectrics, and
155, 1069–1078.e8. frequency control, 67(7), 1344-1353.
[15] Akbari, M.; Mohrekesh, M.; Rafiei, S.; Reza Soroushmehr, S.M.; [21] Toğaçar, M., Cömert, Z., & Ergen, B. (2021). Intelligent skin cancer
Karimi, N.; Samavi, S.; Najarian, K. Classification of Informative detection applying autoencoder, MobileNetV2 and spiking neural
Frames in Colonoscopy Videos Using Convolutional Neural Networks networks. Chaos, Solitons & Fractals, 144, 110714.
with BinarizedWeights. In Proceedings of the Annual International [22] Rathore, S., Hussain, M., & Khan, A. (2015). Automated colon cancer
Conference IEEE Engineering in Medicine and Biology Society detection using hybrid of novel geometric features and some traditional
(EMBS), Honolulu, Hawaii, 17–22 July 2018; pp. 65–68. features. Computers in biology and medicine, 65, 279-296.
[16] Masud, M.; Sikder, N.; Nahid, A.-A.; Bairagi, A.K.; AlZain, M.A. A [23] Hamida, A. B., Devanne, M., Weber, J., Truntzer, C., Derangère, V.,
Machine Learning Approach to Diagnosing Lung and Colon Cancer Ghiringhelli, F.,. & Wemmert, C. (2021). Deep learning for colon cancer
Using a Deep Learning-Based Classification Framework. Sensors 2021, histopathological images analysis. Computers in Biology and Medicine,
21, 748. https://doi.org/10.3390/s21030748. 104730.
[17] Satvik Garg and Somya Garg. 2020. Prediction of lung and colon cancer [24] Liang, M., Ren, Z., Yang, J., Feng, W., & Li, B. (2020). Identification of
through analysis of histopathological images by utilizing Pre-trained colon cancer using multi-scale feature fusion convolutional neural
CNN models with visualization of class activation and saliency maps. network based on shearlet transform. IEEE Access, 8, 208969-208977.
696 | P a g e
www.ijacsa.thesai.org