Authors :
Shaurya Gupta; Mohammed Khundmeer Siddiq; Gaurav Panwar; Deepti Gupta
Volume/Issue :
Volume 9 - 2024, Issue 4 - April
Google Scholar :
https://tinyurl.com/muxs6sfz
Scribd :
https://tinyurl.com/yc5s2aa7
DOI :
https://doi.org/10.38124/ijisrt/IJISRT24APR2631
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
This groundbreaking research introduces a
comprehensive strategy for advancing medical image
segmentation, merging two pivotal concepts to
significantly enhance both the accuracy and efficiency
of the segmentation process. The first component of our
approach involves the integration of polar
transformations as a preprocessing step applied to the
original dataset. This transformative technique is
designed to address the challenges associated with
segmenting single structures of elliptical shape in
medical images, such as organs (e.g., heart and kidneys),
skin lesions, polyps, and various abnormalities. By
centering the polar transformation on the object's focal
point, a reduction in dimensionality is achieved, coupled
with a distinct separation of segmentation and
localization tasks. Two distinct methodologies for
selecting an optimal polar origin are proposed: one
involving estimation through a segmentation neural
network trained on non-polar images, and the other
employing a dedicated neural network trained to pre
dict the optimal origin.
The second key element of our approach is around
the integration of the DoubleU-Net architecture, a
powerful encoder-decoder model specifically designed
for the task of semantic image segmentation. DoubleU-
Net is a group of two U-Net architectures, each with a
specific purpose. The initial U-Net is pre-trained on
VGG-19 as the encoder and uses features learned from
ImageNet to provide efficient information transfer. In
order to store more semantic information and content, a
second U-Net was added to the base to enhance the
capabilities of the network. Join Atrous Spatial
Pyramid Pooling (ASPP) to develop network data
extraction content. The combination of DoubleU-Net
architecture and joint transformation as a step forward
shows good segmentation performance in different
clinical tasks, including liver segmentation, polyp
detection vision, skin segmentation, and epicardial fat
tissue segmentation. It shows that various medical
projects, including various diagnostic methods such as
colonoscopy, dermoscopy, microscopy, have a positive
impact on the plan. More importantly, the method
performs well in difficult cases, such as the
segmentation of small and flat polyps in CVC-ClinicDB
and the 2015 subset of the MICCAI Automated Polyp
Detection dataset. The results demonstrate the accuracy
and generality of the combination, making it the best
way to evaluate medical images in context; Our study
has revealed a new method of skin cancer diagnosis that
combines the power of deep learning with innovation.
Advanced technology. The combination of dual U-Net
architecture and polar coordinate transformation not
only improves the accuracy of classification of lesions
but also improves the robustness of the model to
changes in image features. This study contributes to the
development of computer-aided diagnostic systems for
early diagnosis. Experimental results show that our
method provides good accuracy, sensitivity, and
specificity in detecting malignant and benign tumors.
Additionally, we are conducting ablation studies to
determine the contribution of each presentation and
treatment of skin cancer to ultimately benefit patients
and determine these benefits. We also apply the
transformation of the joint as the first step to improve
the discrimination ability of the model. This mechanical
change effectively reduces the impact caused by changes
in wound size, shape, and direction by displaying the
original image of the polar system. By standardizing the
representation of skin diseases, polar transformation
improves the model's ability to generalize across
different data sets and improves overall performance.
References :
- D. Jha, M. A. Riegler, D. Johansen, P. Halvorsen and . H. D. Johansen, “DoubleU-net: A deep convolutional neural network for medical image segmentation,” 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), vol. abs/2006.04868v2, no. 8 Jun 2020, pp.1-7,2020. https://doi.org/10.1109/cbms49503. 2020.00111
- C. Kaul, S. Manandhar and N. Pears, “FocusNet: An attention-based Fully Convolutional Network for Medical Image Segmentation,” 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), vol. abs/1902.03091v1, no. 8 Feb 2019, pp. 1-5, 2019. https://doi.org/10.1109/isbi. 2019.8759477
- M. BENČEVIĆ, I. GALIĆ, M. HABIJAN and D. BABIN, “Training on Polar Image Transformations,” IEEE Access, vol. 9, no. September 29, 2021, p. 133365–133375, 2021. https://doi.org/10.1109/access.2021.3116265
- Chen, P., Huang, S., & Yue, Q. (2022). Skin lesion segmentation using recurrent attentional convolutional networks. IEEE Access, vol. 10, 94007–94018. https://doi.org/10.1109/access. 2022.3204280
This groundbreaking research introduces a
comprehensive strategy for advancing medical image
segmentation, merging two pivotal concepts to
significantly enhance both the accuracy and efficiency
of the segmentation process. The first component of our
approach involves the integration of polar
transformations as a preprocessing step applied to the
original dataset. This transformative technique is
designed to address the challenges associated with
segmenting single structures of elliptical shape in
medical images, such as organs (e.g., heart and kidneys),
skin lesions, polyps, and various abnormalities. By
centering the polar transformation on the object's focal
point, a reduction in dimensionality is achieved, coupled
with a distinct separation of segmentation and
localization tasks. Two distinct methodologies for
selecting an optimal polar origin are proposed: one
involving estimation through a segmentation neural
network trained on non-polar images, and the other
employing a dedicated neural network trained to pre
dict the optimal origin.
The second key element of our approach is around
the integration of the DoubleU-Net architecture, a
powerful encoder-decoder model specifically designed
for the task of semantic image segmentation. DoubleU-
Net is a group of two U-Net architectures, each with a
specific purpose. The initial U-Net is pre-trained on
VGG-19 as the encoder and uses features learned from
ImageNet to provide efficient information transfer. In
order to store more semantic information and content, a
second U-Net was added to the base to enhance the
capabilities of the network. Join Atrous Spatial
Pyramid Pooling (ASPP) to develop network data
extraction content. The combination of DoubleU-Net
architecture and joint transformation as a step forward
shows good segmentation performance in different
clinical tasks, including liver segmentation, polyp
detection vision, skin segmentation, and epicardial fat
tissue segmentation. It shows that various medical
projects, including various diagnostic methods such as
colonoscopy, dermoscopy, microscopy, have a positive
impact on the plan. More importantly, the method
performs well in difficult cases, such as the
segmentation of small and flat polyps in CVC-ClinicDB
and the 2015 subset of the MICCAI Automated Polyp
Detection dataset. The results demonstrate the accuracy
and generality of the combination, making it the best
way to evaluate medical images in context; Our study
has revealed a new method of skin cancer diagnosis that
combines the power of deep learning with innovation.
Advanced technology. The combination of dual U-Net
architecture and polar coordinate transformation not
only improves the accuracy of classification of lesions
but also improves the robustness of the model to
changes in image features. This study contributes to the
development of computer-aided diagnostic systems for
early diagnosis. Experimental results show that our
method provides good accuracy, sensitivity, and
specificity in detecting malignant and benign tumors.
Additionally, we are conducting ablation studies to
determine the contribution of each presentation and
treatment of skin cancer to ultimately benefit patients
and determine these benefits. We also apply the
transformation of the joint as the first step to improve
the discrimination ability of the model. This mechanical
change effectively reduces the impact caused by changes
in wound size, shape, and direction by displaying the
original image of the polar system. By standardizing the
representation of skin diseases, polar transformation
improves the model's ability to generalize across
different data sets and improves overall performance.