<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Automatic Polyp Segmentation via Parallel Reverse Atention Network</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ge-Peng Ji</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Deng-Ping Fan</string-name>
          <email>dengpfan@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tao Zhou</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Geng Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Huazhu Fu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ling Shao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Inception Institute of Artificial Intelligence (IIAI)</institution>
          ,
          <addr-line>Abu Dhabi, UAE</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computer Science, Wuhan University</institution>
          ,
          <addr-line>Hubei</addr-line>
          ,
          <country country="CN">China.</country>
          <institution>https://github.com/GewelsJI/MediaEval2020-IIAI-Med</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>14</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>In this paper, we present a novel deep neural network, termed Parallel Reverse Attention Network (PraNet), for the task of automatic polyp segmentation at MediaEval 2020. Specifically, we first aggregate the features in high-level layers using a parallel partial decoder (PPD). Based on the combined feature, we then generate a global map as the initial guidance area for the following components. In addition, we mine the boundary cues using the reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues. Thanks to the recurrent cooperation mechanism between areas and boundaries, our PraNet is capable of calibrating misaligned predictions, improving the segmentation accuracy and achieving real-time eficiency ( ∼30fps) on a single NVIDIA GeForce GTX 1080 GPU.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Conv1  1 Conv2  2
Low-level feature
Conv3  3 Conv4  4
High-level feature
lev  i
il-ehg
edH  i Sigmoid
l
sapup
m
Flow of feature
Flow of decoder
Flow of up-sampling
Deep supervision
Conv layer
Partial Decoder
3×3 Conv+BN+ReLU
1×1 Conv+BN+ReLU</p>
      <p>Reverse Attention
Multiplication</p>
      <p>Reverse
Prediction
 i</p>
      <p>RA
 3</p>
      <p>RA
 4
tiionddA Up-sample itinoddA
ddon
tii
Up-sample A
Sigmoid
 3
 4
 5</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>
        Aiming at developing computer-aided diagnosis systems for
automatic polyp segmentation, and detecting all types of polyps (i.e.,
irregular polyps, smaller or flat polyps) with high eficiency and
accuracy, Medico Automatic Polyp Segmentation Challenge 20201 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
benchmarks semantic segmentation methods for segmenting polyp
regions in colonoscopy images on a publicly available dataset,
emphasizing robustness, speed, and generalization. Following the
protocols of this challenge, we participate two required sub-tasks
including (i) Polyp segmentation task and (ii) Algorithm eficiency
task, more task descriptions refer to the challenge guidelines.
      </p>
      <p>
        Recent years have witnessed promising progress in addressing
the task of automatic polyp segmentation using traditional [
        <xref ref-type="bibr" rid="ref12 ref16">12, 16</xref>
        ]
and deep learning based [
        <xref ref-type="bibr" rid="ref1 ref13 ref2 ref20 ref21 ref7">1, 2, 7, 13, 20, 21</xref>
        ] methods. However, there
are three core challenges in this field, including (a) the polyps often
vary in appearance, e.g., size, color and texture, even if they are of
the same type; (b) in colonoscopy images, the boundary between
a polyp and its surrounding mucosa is usually blurred and lacks
the intense contrast required for segmentation approaches; (c) the
practical applications of existing algorithms are hindered by their
low performance and eficiency.
      </p>
      <p>Based on these observations, we develop a real-time and
accurate framework, termed Parallel Reverse Attention Network
(PraNet2), for the automatic polyp segmentation task. As can be
Conv5</p>
      <p>5
RA
 5
Down-sample</p>
      <p>Partial Decoder
 3  4pu pu ×up5 ×u2p
×2 ×2 2
C</p>
      <p>C
2×up
seen from Fig. 1, PraNet utilizes a parallel partial decoder (see §
2.1) to generate a high-level semantic global map and a set of
reverse attention modules (see § 2.2) for accurate polyp segmentation
from the colonoscopy images. All components and implementation
details will be elaborated as follows.</p>
    </sec>
    <sec id="sec-3">
      <title>APPROACH</title>
    </sec>
    <sec id="sec-4">
      <title>Parallel Partial Decoder (PPD)</title>
      <p>
        Current popular medical image segmentation networks usually
rely on a U-Net [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] or a U-Net shaped network (e.g., U-Net++ [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ],
ResUNet [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], etc). These models are essentially encoder-decoder
frameworks, which typically aggregate all multi-level features
extracted from convolutional neural networks (CNNs). As
demonstrated by Wu et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], compared with high-level features,
lowlevel features demand more computational resources due to their
larger spatial resolutions, but contribute less to performance.
Motivated by this observation, we propose to aggregate high-level
features with a parallel partial decoder component. More
specifically, for an input polyp image  with size ℎ ×  , five levels of
features {f,  = 1, ..., 5} with resolution [ℎ/2−1,  /2−1] can be
extracted from a Res2Net-based [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] backbone network. Then, we
divide f features into low-level features {f,  = 1, 2} and high-level
features {f,  = 3, 4, 5}. We introduce the partial decoder  (·) [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ],
a new state-of-the-art (SOTA) decoder component, to aggregate the
high-level features with a paralleled connection. As shown in Fig.
1, the partial decoder feature is computed by PD =  ( 3, 4, 5),
and we can obtain a global map S .
2.2
      </p>
    </sec>
    <sec id="sec-5">
      <title>Reverse Attention (RA)</title>
      <p>
        In a clinical setting, doctors first roughly locate the polyp region,
and then carefully inspect local tissues to accurately label the polyp.
As discussed in § 2.1, our global map S is derived from the deepest
CNN layer, which can only capture a relatively rough location of the
polyp tissues, without structural details (see Fig. 1). To address this
issue, we propose a principle strategy to progressively mine
discriminative polyp regions by erasing foreground objects [
        <xref ref-type="bibr" rid="ref18 ref3">3, 18</xref>
        ]. Instead
of aggregating features from all levels like in [
        <xref ref-type="bibr" rid="ref22 ref23 ref9">9, 22, 23</xref>
        ], we propose
to adaptively learn the reverse atention
in three parallel
highlevel features. In other words, our architecture can sequentially
mine complementary regions and details by erasing the existing
estimated polyp regions from high-level side-output features, where
the existing estimation is up-sampled from the deeper layer.
      </p>
      <p>Specifically, we obtain the output reverse attention features 
{,  = 3, 4, 5} by a reverse attention weight  , as follows:
by multiplying (element-wise ⊙) the high-level side-output feature
 =  ⊙  .</p>
      <p>
        The reverse attention weight  is de-facto for salient object
detection in the computer vision community [
        <xref ref-type="bibr" rid="ref24 ref3">3, 24</xref>
        ], and can be
formulated as:
      </p>
      <p>= ⊖( (P (+1))),
where P (·) denotes an up-sampling operation,  (·) is the Sigmoid
function, and ⊖(·) is a reverse operation subtracting the input from
matrix E, in which all the elements are 1. Fig. 1 (RA) shows the
details of this process. It is worth noting that the erasing strategy
driven by the reverse attention can eventually refine the imprecise
and coarse estimation into an accurate and complete prediction
(1)
(2)
map.
3
3.1</p>
    </sec>
    <sec id="sec-6">
      <title>EXPERIMENTS</title>
    </sec>
    <sec id="sec-7">
      <title>Learning Strategies</title>
      <p>3

We use a hybrid loss function in the training process, which is
defined as</p>
      <p>
        L = L + L , where L

and L represent
the weighted Intersection over Union (IoU) loss and binary cross
entropy (BCE) loss for the global restriction and local (pixel-level)
restriction. The definitions of these losses are the same as in [
        <xref ref-type="bibr" rid="ref14 ref17">14, 17</xref>
        ]
and their efectiveness has been validated in the field of salient
object detection. Here, we adopt deep supervision for the three
side-outputs (i.e., 3, 4, and 4) and the global map  . Each map
is up-sampled (e.g.,
      </p>
      <p>) to the same size as the ground-truth map
as: L = L (,  ) + =3 L (,  )
Í=5
 .
 . Thus the total loss for the proposed PraNet can be formulated
3.2</p>
    </sec>
    <sec id="sec-8">
      <title>Evaluation Metrics</title>
      <p>We employ the metrics widely used in the medical segmentation
ifeld, including mean IoU (mIoU or Jaccard index), Dice coeficient,
recall, precision, acccuracy and frame per second (FPS) for a
comprehensive evaluation.
3.3</p>
    </sec>
    <sec id="sec-9">
      <title>Implementation Details and Datasets</title>
      <p>
        We randomly split the whole Kvaris-SEG3 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] into a training set
(900 images) and validation set (100 images). Note that we do not
3https://datasets.simula.no/kvasir-seg/
tion (task 1) and algorithm eficiency (task 2) on a single
      </p>
      <sec id="sec-9-1">
        <title>NVIDIA GeForce GTX 1080 GPU of Medico Automatic Polyp</title>
      </sec>
      <sec id="sec-9-2">
        <title>Segmentation Challenge 2020.</title>
        <p>Team Name Jaccard DSC Recall Precision Accuracy
F2</p>
        <p>
          FPS
IIAI-Med
are uniformly resized to 352 × 352 and we augment all the
training images using multiple strategies, including random horizontal
lfipping, rotating, color enhancement and border cropping.
Parameters of the Res2Net-50 [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] backbone are initialized from the model
pre-trained on ImageNet [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Other parameters are initialized
using the default PyTorch settings. The Adam algorithm is used to
optimize our model, and it is accelerated by an NVIDIA TITAN
RTX GPU. We set the initial learning rate is 1e-4 and divide it by 10
every 50 epochs. It takes about 40 minutes to train the model with
a mini-batch size of 26 over 100 epochs. Our final prediction map
 is generated by 3 after a Sigmoid function. The testing dataset
consists of 160 polyp images provided by organisers. Our code can
be found at https://github.com/GewelsJI/MediaEval2020-IIAI-Med.
3.4
        </p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>Results and Analysis</title>
      <p>
        Without any bells and whistles, such as extra training data or a
model ensemble, we introduce a new training scheme for addressing
the polyp segmentation challenge based on the previous work [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
For more hyper-parameters and data augmentation settings refer
to § 3.3. In the submission phase, we train our model on the
KvasirSEG dataset [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and submit the inference results only once. Tab.
1 reports the quantitative results of our approach on sub-task 1,
which achieve a very high performance (Precision=0.901 and
Accuracy=0.960 on test set). Meantime, PraNet also runs at ∼30fps
on a single NVIDIA GeForce GTX 1080 GPU, demonstrating its
simplicity and efectiveness. As a robust, general, and real-time
framework, PraNet can help facilitate future academic research and
computer-aided diagnosis for automatic polyp segmentation.
4
      </p>
    </sec>
    <sec id="sec-11">
      <title>CONCLUSION</title>
      <p>
        Automatic polyp segmentation is a challenging problem because
of the diversity in appearance at polyps, complex similar
environments, and require high accuracy and inference speed. We have
presented a novel architecture, PraNet, for automatically
segmenting polyps from colonoscopy images. PraNet eficiently integrates
a cascaded mechanism and a reverse attention module with a
parallel connection, which can be trained in an end-to-end manner.
Another advantage is that PraNet is universal and flexible, meaning
that more efective modules can be added to further improve the
accuracy. We hope this study will ofer the community an
opportunity to explore more powerful models for related topics such as
lung infection segmentation [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], or even on upstream tasks such as
video-based understanding.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Mojtaba</given-names>
            <surname>Akbari</surname>
          </string-name>
          , Majid Mohrekesh, Ebrahim Nasr-Esfahani, SM Reza Soroushmehr, Nader Karimi, Shadrokh Samavi, and
          <string-name>
            <given-names>Kayvan</given-names>
            <surname>Najarian</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Polyp segmentation in colonoscopy images using fully convolutional network</article-title>
          .
          <source>In IEEE EMBC</source>
          .
          <volume>69</volume>
          -
          <fpage>72</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Brandao</surname>
          </string-name>
          , Evangelos Mazomenos, Gastone Ciuti, Renato Caliò, Federico Bianchi, Arianna Menciassi, Paolo Dario, Anastasios Koulaouzidis, Alberto Arezzo, and
          <string-name>
            <given-names>Danail</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Fully convolutional neural networks for polyp segmentation in colonoscopy</article-title>
          .
          <source>In Medical Imaging</source>
          <year>2017</year>
          :
          <string-name>
            <surname>Computer-Aided</surname>
            <given-names>Diagnosis</given-names>
          </string-name>
          , Vol.
          <volume>10134</volume>
          .
          <year>101340F</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Shuhan</given-names>
            <surname>Chen</surname>
          </string-name>
          , Xiuli Tan, Ben
          <string-name>
            <surname>Wang</surname>
            , and
            <given-names>Xuelong</given-names>
          </string-name>
          <string-name>
            <surname>Hu</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Reverse attention for salient object detection</article-title>
          .
          <source>In ECCV</source>
          .
          <volume>234</volume>
          -
          <fpage>250</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Jia</given-names>
            <surname>Deng</surname>
          </string-name>
          , Wei Dong, Richard Socher,
          <string-name>
            <surname>Li-Jia</surname>
            <given-names>Li</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Kai</given-names>
            <surname>Li</surname>
          </string-name>
          , and
          <string-name>
            <surname>Li</surname>
          </string-name>
          Fei-Fei.
          <year>2009</year>
          .
          <article-title>Imagenet: A large-scale hierarchical image database</article-title>
          .
          <source>In IEEE CVPR</source>
          .
          <volume>248</volume>
          -
          <fpage>255</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Deng-Ping</surname>
            <given-names>Fan</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge-Peng</surname>
            <given-names>Ji</given-names>
          </string-name>
          , Tao Zhou, Geng Chen, Huazhu Fu,
          <string-name>
            <given-names>Jianbing</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Ling</given-names>
            <surname>Shao</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>PraNet: Parallel Reverse Attention Network for Polyp Segmentation</article-title>
          .
          <source>MICCAI</source>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Deng-Ping</surname>
            <given-names>Fan</given-names>
          </string-name>
          , Tao Zhou,
          <string-name>
            <surname>Ge-Peng</surname>
            <given-names>Ji</given-names>
          </string-name>
          , Yi Zhou, Geng Chen, Huazhu Fu,
          <string-name>
            <given-names>Jianbing</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Ling</given-names>
            <surname>Shao</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images</article-title>
          .
          <source>IEEE TMI</source>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Yuqi</given-names>
            <surname>Fang</surname>
          </string-name>
          , Cheng Chen, Yixuan Yuan, and
          <string-name>
            <surname>Kai-yu Tong</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Selective feature aggregation network with area-boundary constraints for polyp segmentation</article-title>
          .
          <source>In MICCAI</source>
          . Springer,
          <fpage>302</fpage>
          -
          <lpage>310</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Shang-Hua</surname>
            <given-names>Gao</given-names>
          </string-name>
          , Ming-Ming Cheng, Kai Zhao,
          <string-name>
            <surname>Xin-Yu</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , MingHsuan Yang, and
          <string-name>
            <given-names>Philip</given-names>
            <surname>Torr</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Res2Net: A New Multi-scale Backbone Architecture</article-title>
          .
          <source>IEEE TPAMI</source>
          (
          <year>2020</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>1</lpage>
          . https://doi.org/10.1109/ TPAMI.
          <year>2019</year>
          .2938758
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Zaiwang</given-names>
            <surname>Gu</surname>
          </string-name>
          , Jun Cheng, Huazhu Fu, Kang Zhou, Huaying Hao,
          <string-name>
            <given-names>Yitian</given-names>
            <surname>Zhao</surname>
          </string-name>
          , Tianyang Zhang, Shenghua Gao, and Jiang Liu.
          <year>2019</year>
          .
          <article-title>CE-Net: Context encoder network for 2d medical image segmentation</article-title>
          .
          <source>IEEE TMI 38</source>
          ,
          <issue>10</issue>
          (
          <year>2019</year>
          ),
          <fpage>2281</fpage>
          -
          <lpage>2292</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Debesh</surname>
            <given-names>Jha</given-names>
          </string-name>
          , Steven A.
          <string-name>
            <surname>Hicks</surname>
            , Krister Emanuelsen,
            <given-names>Håvard D.</given-names>
          </string-name>
          <string-name>
            <surname>Johansen</surname>
          </string-name>
          , Dag Johansen, Thomas de Lange,
          <article-title>Michael A</article-title>
          .
          <string-name>
            <surname>Riegler</surname>
            , and
            <given-names>Pål</given-names>
          </string-name>
          <string-name>
            <surname>Halvorsen</surname>
          </string-name>
          .
          <year>2020</year>
          . Medico Multimedia Task at MediaEval 2020:
          <article-title>Automatic Polyp Segmentation</article-title>
          .
          <source>In Proc. of MediaEval 2020 CEUR Workshop.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Debesh</surname>
            <given-names>Jha</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pia H Smedsrud</surname>
          </string-name>
          ,
          <article-title>Michael A Riegler, Pål Halvorsen</article-title>
          , Thomas de Lange, Dag Johansen, and
          <string-name>
            <given-names>Håvard D</given-names>
            <surname>Johansen</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>KvasirSEG: A Segmented Polyp Dataset</article-title>
          . In MMM.
          <fpage>451</fpage>
          -
          <lpage>462</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Alexander</surname>
            <given-names>V Mamonov</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isabel N Figueiredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>Pedro N Figueiredo</surname>
          </string-name>
          , and Yen-Hsi Richard Tsai.
          <year>2014</year>
          .
          <article-title>Automated polyp detection in colon capsule endoscopy</article-title>
          .
          <source>IEEE TMI 33</source>
          ,
          <issue>7</issue>
          (
          <year>2014</year>
          ),
          <fpage>1488</fpage>
          -
          <lpage>1502</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Balamurali</surname>
            <given-names>Murugesan</given-names>
          </string-name>
          , Kaushik Sarveswaran, Sharath M Shankaranarayana,
          <string-name>
            <given-names>Keerthi</given-names>
            <surname>Ram</surname>
          </string-name>
          , Jayaraj Joseph, and
          <string-name>
            <given-names>Mohanasankar</given-names>
            <surname>Sivaprakasam</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation</article-title>
          .
          <source>In IEEE EMBC</source>
          .
          <volume>7223</volume>
          -
          <fpage>7226</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Xuebin</surname>
            <given-names>Qin</given-names>
          </string-name>
          , Zichen Zhang, Chenyang Huang,
          <string-name>
            <given-names>Chao</given-names>
            <surname>Gao</surname>
          </string-name>
          , Masood Dehghan, and
          <string-name>
            <given-names>Martin</given-names>
            <surname>Jagersand</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Basnet: Boundary-aware salient object detection</article-title>
          .
          <source>In IEEE CVPR</source>
          .
          <volume>7479</volume>
          -
          <fpage>7489</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Olaf</surname>
            <given-names>Ronneberger</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Philipp</given-names>
            <surname>Fischer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Brox</surname>
          </string-name>
          .
          <year>2015</year>
          . U-Net:
          <article-title>Convolutional networks for biomedical image segmentation</article-title>
          .
          <source>In MICCAI</source>
          . Springer,
          <fpage>234</fpage>
          -
          <lpage>241</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Nima</surname>
            <given-names>Tajbakhsh</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suryakanth R Gurudu</surname>
            , and
            <given-names>Jianming</given-names>
          </string-name>
          <string-name>
            <surname>Liang</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Automated polyp detection in colonoscopy videos using shape and context information</article-title>
          .
          <source>IEEE TMI 35</source>
          ,
          <issue>2</issue>
          (
          <year>2015</year>
          ),
          <fpage>630</fpage>
          -
          <lpage>644</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Jun</surname>
            <given-names>Wei</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Shuhui</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Qingming</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>F3Net: Fusion, Feedback and Focus for Salient Object Detection</article-title>
          .
          <source>In AAAI.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Yunchao</surname>
            <given-names>Wei</given-names>
          </string-name>
          , Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao,
          <string-name>
            <given-names>and Shuicheng</given-names>
            <surname>Yan</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Object region mining with adversarial erasing: A simple classification to semantic segmentation approach</article-title>
          .
          <source>In IEEE CVPR</source>
          .
          <volume>1568</volume>
          -
          <fpage>1576</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Zhe</surname>
            <given-names>Wu</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Li</given-names>
            <surname>Su</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Qingming</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Cascaded partial decoder for fast and accurate salient object detection</article-title>
          .
          <source>In IEEE CVPR</source>
          .
          <volume>3907</volume>
          -
          <fpage>3916</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Lequan</surname>
            <given-names>Yu</given-names>
          </string-name>
          , Hao Chen, Qi Dou, Jing Qin, and Pheng Ann Heng.
          <year>2016</year>
          .
          <article-title>Integrating online and ofline three-dimensional deep learning for automated polyp detection in colonoscopy videos</article-title>
          .
          <source>IEEE JBHI 21</source>
          ,
          <issue>1</issue>
          (
          <year>2016</year>
          ),
          <fpage>65</fpage>
          -
          <lpage>75</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Ruikai</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Yali Zheng,
          <source>Carmen CY Poon, Dinggang Shen, and James YW Lau</source>
          .
          <year>2018</year>
          .
          <article-title>Polyp detection during colonoscopy using a regression-based convolutional neural network with a tracker</article-title>
          .
          <source>Pattern Recognition</source>
          <volume>83</volume>
          (
          <year>2018</year>
          ),
          <fpage>209</fpage>
          -
          <lpage>219</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Shihao</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Huazhu Fu, Yuguang Yan, Yubing Zhang, Qingyao Wu, Ming Yang,
          <string-name>
            <given-names>Mingkui</given-names>
            <surname>Tan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Yanwu</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Attention Guided Network for Retinal Image Segmentation</article-title>
          .
          <source>In MICCAI</source>
          .
          <volume>797</volume>
          -
          <fpage>805</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Zhijie</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Huazhu Fu, Hang Dai, Jianbing Shen,
          <string-name>
            <given-names>Yanwei</given-names>
            <surname>Pang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Ling</given-names>
            <surname>Shao</surname>
          </string-name>
          .
          <year>2019</year>
          . ET-Net:
          <article-title>A generic edge-attention guidance network for medical image segmentation</article-title>
          .
          <source>In MICCAI</source>
          . Springer,
          <fpage>442</fpage>
          -
          <lpage>450</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Zhao</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Zheng Lin, Jun
          <string-name>
            <surname>Xu</surname>
          </string-name>
          , Wenda Jin,
          <string-name>
            <surname>Shao-Ping Lu</surname>
            , and
            <given-names>DengPing</given-names>
          </string-name>
          <string-name>
            <surname>Fan</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Bilateral attention network for rgb-d salient object detection</article-title>
          . arXiv preprint arXiv:
          <year>2004</year>
          .
          <volume>14582</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Zhengxin</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Qingjie Liu, and
          <string-name>
            <given-names>Yunhong</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Road extraction by deep residual u-net</article-title>
          .
          <source>IEEE Geoscience and Remote Sensing Letters</source>
          <volume>15</volume>
          ,
          <issue>5</issue>
          (
          <year>2018</year>
          ),
          <fpage>749</fpage>
          -
          <lpage>753</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Zongwei</surname>
            <given-names>Zhou</given-names>
          </string-name>
          , Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and
          <string-name>
            <given-names>Jianming</given-names>
            <surname>Liang</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Unet++: A nested u-net architecture for medical image segmentation</article-title>
          .
          <source>IEEE TMI</source>
          (
          <year>2019</year>
          ),
          <fpage>3</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>