<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Deep Learning based TB Severity Prediction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ujjwel Balwal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Srinivasa Arun Yeragudipati</string-name>
          <email>srinivasaarun17166g@cse.ssn.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bhuvana Jayaraman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mirnalinee Thanga Nadar Thanga Th</string-name>
          <email>mirnalineettg@ssn.edu.in</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of CSE, SSN College of Engineering</institution>
          ,
          <addr-line>Chennai</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Computer Aided Diagnosis (CAD) of diseases has undergone large developments with the application of deep learning algorithms to detect the presence of diseases. This paper presents an approach for predicting the presence of tuberculosis, caverns and pleurisy in a set of 3D CT scans of the chests of patients, which is the key task of the ImageCLEF 2020 Tuberculosis challenge. We used the masks provided by the ImageCLEF organizers to segment the 3D CT images, made 2D projections of the segmented 3D images, and augmented them in order to balance the images in the dataset. An AlexNet based model is used to predict the probability of the presence of tuberculosis, caverns and pleurisy from these 2D projections. We achieved the eighth place out of all the teams who made a submission in this task, achieving a mean Area Under the Curve (AUC) score of 0.601 and a minimum AUC score of 0.432. An analysis of the results obtained by the authors following this approach presented, exploring the role of the model's complexity in reduction of the desired performance.</p>
      </abstract>
      <kwd-group>
        <kwd>Deep Learning</kwd>
        <kwd>Projections</kwd>
        <kwd>AlexNet</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Tuberculosis</p>
      <p>
        Computer Tomography
Tuberculosis (TB) is caused by bacteria (Mycobacterium tuberculosis) that most
often a ect the lungs of human beings [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], though can also a ect other parts of
the body. Conventionally, Lung TB diagnosis is done by analysing chest X-rays
(CXR) and/or microbiological con rmation (looking for bacterium
Mycobacterium tuberculosis, MTB) using various techniques [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In recent years,
developments in Computer-Aided Diagnosis (CAD) have started gaining traction and
has made huge contributions to the detection and diagnosis of Tuberculosis and
analysis of 3D Computed Tomography (CT) images is a vital step in diagnosing
TB. Techniques using heuristic knowledge extracted from the bacilli bacteria's
shape and colour have shown promising results [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and proprietary
technology around automated screening is also being developed by major healthcare
companies [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. According to a report in 2013, around 3 million cases of TB
went undiagnosed, mainly because of undertrained sta , inaccurate tests, lack
of equipment [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Thus, the availability of Digital CXR, CTs with automated
computer-aided interpretation is much needed to curb the potentially lethal
disease, particularly in low resource high-burden settings.
      </p>
      <p>This approach of analyzing 3D CT images of lungs for Tuberculosis consists
of projecting the 3D image into 2D on the three planes - XY, YZ and XZ
respectively. Following this, an AlexNet based model is used to predict the probability
of a particular lung a ected by TB, the probability of the presence of Caverns,
and the probability of the presence of Pleurisy, respectively.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Task and Dataset</title>
      <p>
        The main task for ImageCLEF 2020 tuberculosis [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is to determine the
probability of a patient su ering from tuberculosis by determining the probabilities for
the following criteria: LeftLungA ected, RightLungA ected, CavernsLeft,
CavernsRight, PleurisyLeft, PleurisyRight.
      </p>
      <p>The working notes of the participating teams are to be published in the
Proceedings of the 11th International Conference of the CLEF Association (CLEF
2020) [11]</p>
      <p>The provided dataset, ImageCLEFmed Tuberculosis 2020, comprises of chest
Computed Tomography (CT) scans of 403 TB patients. Out of these samples,
283 are designated for training and remaining 120 for testing. The CT images
are digitized as a set of 2D slices and the distance between each of these 2D slices
can vary between 0.5 - 5mm in any axis, depending upon the resolution of the
3D image. In our case, these slices are stored in the compressed NIfTI format.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <sec id="sec-3-1">
        <title>Data Preprocessing and Dataset Creation</title>
        <p>
          The data preprocessing task is performed similar to the method used by
Vitali Liauchuk [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] in the submission of the previous year's ImageCLEF Medical
Tuberculosis task. For our submissions, we use the rst mask [12] for
predicting LungA ected, Caverns and the second mask [13] for detecting Pleurisy. The
3D NIfTI image is compressed into a pseudo-RGB image where the rst (red)
channel contains mean values, second (green) channel contains maximum
values and the third (blue) channel comprises standard deviation values. For each
lung, these pseudo-RGB images are generated for three planes - XY, YZ and
XZ. After the projection process, we cut the images in half, separating the two
lungs. Thus, each of the 3D image is mapped to six 2D projections: XY Left,
XY Right, YZ Left, YZ Right, XZ Left, XZ Right.
        </p>
        <p>Using random selection on generated images, we take a ratio of 3:1 for training
and validation sets, respectively. We observed that the data is skewed, especially
with pleurisy where the number of una ected samples outweigh the a ected
ones. We augment the images using random ipping and rotation to balance the
dataset wherever possible.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Proposed Model Architecture</title>
        <p>The 2D projections of the lung images does the heavy-lifting of extracting the
important features required to mark the lesion or deformity in the lung. These
features can easily be detected by a simple neural network. It would thus reduce
the problem to detection of these lesions in the 2D projections. After
prepossessing and data augmentation, we observed that recent state-of-the-art image
classi cation neural networks over t quickly because of their inherent complex
feature extraction. Hence we choose a relatively simple AlexNet-based solution
to train for prediction.</p>
        <p>AlexNet [14] is a CNN with stacked convolutional layers. It consists of 11 11,
5 5, 3 3, convolutions, max pooling, dropout, data augmentation, ReLU
activations and SGD with momentum. It attaches ReLU activations after every
convolutional and fully-connected layer.</p>
        <p>Keeping the underlying AlexNet structure, we changed the number of out
channels for ve ConvBlocks to 64, 128, 256, 512, 512 respectively, each one of
the ConvBlock is then followed by ReLU and MaxPooling of stride 2.</p>
        <p>The basic block of our architecture, ConvBlock, is a simple 2D convolutional
layer, followed be a ReLU layer and a MaxPool2D layer. The simple 2D
convolutional layer has a kernel size of 2 and stride of 2, and the MaxPool2D layer has
a kernel size of 3 and stride of 1. We added Dropout, linear, ReLU and again
Linear layer after the ve ConvBlocks, in order to build a suitable model for
Binary classi cation, which is then changed into probability distribution.</p>
        <p>The architecture of the proposed model used for this task is shown in Fig 1.
The input to this model is a pseudo-RGB image which consists of three channels.
The output is the probability distribution of the target variable that signi es the
probability of lung being a ected. We use PyTorch [16] to construct and train
the model. Starting from left, the number in parentheses indicates the number of
output channels for each of the ConvBlocks. Following ConvBlocks is the dropout
layer, which has an activation probability of 0.5. The following Linear layer is a
fully connected layer, the number of nodes present in whom are represented by
the bracket enclosed numbers. To introduce non linearity among the two Linear
layers, we use ReLU activation function. SoftMax layer follows the nal Linear
layer, which converts the output of the model into a probability distribution.
We trained separate models for Lung, Caverns and Pleurisy respectively. We
observed that the patterns associated with the disease do not change with respect
to the side of the lung, whether left or right. Therefore, we use left and right
a ected lung samples from the dataset collectively, to train one model.
Separate models were trained for all 3 di erent 2D projections, thus for each subtask
say, LungA ected, we train one model for XY-projection, one for YZ-projection
and one for ZX-projection. We approach the training as a classi cation task.
We used Kaiming initialization [17] to initialize the weights of the network and
trained the model using a mini-batch size of 4. We used Categorical cross entropy
loss and a learning rate of 10-5 while training. We used Adam optimizer with
weight decay of 0.0005. The loss and accuracy saturated after 5 to 7 epochs for
all the training sets. In the end, we have a total of 9 trained models, XY, YZ,
ZX - projection model for each of Lung, Caverns and Pleurisy detection subtask.</p>
        <p>We performed test time augmentation while predicting the probability
values. The XY, YZ and ZX projections were taken and given as input to the model
separately. The trained model does not di erentiate in left and right lung, so we
pass both projections one after the other to the same model and store the
prediction values separately. The resulting outputs were taken and given as input to
a SoftMax layer, which converts them into a probability distribution. This
provides us with the probabilities for the predictions. While predicting the values
individually for XY, YZ and ZX projections, the prediction scores for
LungAffected and Caverns were almost similar, but the scores of Pleurisy values for
the XY and YZ axes were low while the ZX-axis score was substantially higher.
Hence, we used only the ZX-projection for Pleurisy and mean of all the
projections for others.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Results</title>
      <p>We did two submissions for the task. The primary di erence between the two
models was the number of channels and inclusion of test time augmentation,
which gave a signi cant improvement to the scores, bringing the mean AUC to
0.601 shown in Table 1. We mentioned the approach and methodology in section
3. With our best prediction, we achieved the eighth place out of the nine teams
who made a submission and the results are shown in Table 2. Our best
submission, Run 2 (JBTTM) achieved a mean AUC score of 0.601 and min AUC score
of 0.432, Run 1 achieved better min AUC score of 0.471 but poor Mean AUC
score of 0.484. The top ranking team SenticLab.UAIC scored 0.924 and 0.885
on Mean and Min AUC respectively.</p>
      <p>The method of using AlexNet on 2D projections suggested promising results.
We detected TB lesions with a substantial certainty and performance in case of
caverns is good. However, it did not perform as expected in detecting pleurisy,
partly because it is a non-localized phenomenon like a TB lesion. We can also
attribute the reason for a lower score to the large number of channels in the
network which added unnecessary complexity to the model and caused it to
over t that resulted in poor performance of our model, especially in detecting
pleurisy.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this work, we experimented with an AlexNet Based Model to predict the
probability of a lung a ected by TB, the probability of the presence of Caverns,
and the probability of the presence of Pleurisy respectively. Data preprocessing
and augmentations are done as described in the previous sections to prepare the
images for the deep neural network. The performance of the models submitted
are measured using mean AUC and minimum AUC. Our second submission
run has achieved 0.602 and 0.432 for the speci ed measures. The performance
when compared with the other submissions of this task has shown reasonable
yet improvable results. Our team, JBTTM, has achieved eighth place out of the
nine teams who have submitted their runs. We observed that the increased model
complexity led to over tting and thereby pulling down the model performance.
A smaller and simpler model, along with proper regularization techniques, could
be used in order to achieve a better result.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. WHO Fact Sheet:
          <volume>24</volume>
          /03/2020: Tuberculosis. World Health Organization, https://www.who.int/news-room/fact-sheets/detail/tuberculosis.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Suleiman</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lessem</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>An Activist's Guide To Tuberculosis Drugs</article-title>
          . Treatment Action Group (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Forero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cristobal</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alvarez-Borrego</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Automatic identi cation techniques of tuberculosis bacteria</article-title>
          .
          <source>In: Proceedings of SPIE - The International Society for Optical Engineering</source>
          , Vol.
          <volume>5203</volume>
          (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Veropoulos</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campbell</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Learmonth</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knight</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simpson</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The automatic identi cation of tubercle bacilli using image processing and neural computing techniques</article-title>
          .
          <source>In: Proceedings of the eighth international conference on arti cial neural networks</source>
          , Vol.
          <volume>2</volume>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Zaidi</surname>
            ,
            <given-names>S.M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Habib</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Ginneken</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Evaluation of the diagnostic accuracy of Computer-Aided Detection of tuberculosis on Chest radiography among private sector patients in Pakistan</article-title>
          .
          <source>Sci Rep</source>
          <volume>8</volume>
          ,
          <issue>12339</issue>
          (
          <year>2018</year>
          ). https://doi.org/10.1038/s41598- 018-30810-1
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Faiz</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Pande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Tessema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Benedetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Pai</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ,
          <string-name>
            <surname>L</surname>
          </string-name>
          onnroth, M.,
          <string-name>
            <given-names>M.</given-names>
            <surname>Denkinger</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Computer-aided reading of tuberculosis chest radiography: moving the research agenda forward to inform policy</article-title>
          .
          <source>European Respiratory Journal Jul</source>
          <year>2017</year>
          ,
          <volume>50</volume>
          (
          <issue>1</issue>
          ) 1700953; DOI: 10.1183/13993003.
          <fpage>00953</fpage>
          -
          <lpage>2017</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Habib</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaidi</surname>
            ,
            <given-names>S.M.A.</given-names>
          </string-name>
          :
          <article-title>Computer aided detection of tuberculosis on chest radiographs: An evaluation of the CAD4TB v6 system</article-title>
          .
          <source>Sci Rep</source>
          <volume>10</volume>
          ,
          <issue>5492</issue>
          (
          <year>2020</year>
          ). https://doi.org/10.1038/s41598-020-62148-y
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Stop TB Partnership - Fact Sheet</surname>
          </string-name>
          :
          <source>The Missing 3 Million</source>
          , http://www.stoptb.org/assets/documents/resources/factsheets/ Stop%20TB
          <source>%20infographic%20Missing%203%20Million.pdf.</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kozlovski</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liauchuk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dicente</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yashin</surname>
            , Tarasau,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovalev</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller</surname>
          </string-name>
          , H.:
          <article-title>Overview of ImageCLEFtuberculosis 2020 - Automatic CT-based Report Generation and Tuberculosis Severity Assessment</article-title>
          . CLEF working notes,
          <source>CEUR</source>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Liauchuk</surname>
          </string-name>
          , V.:
          <article-title>Projection-based CT Image Analysis for TB Severity Scoring and CT Report Generation</article-title>
          .
          <source>In: CLEF2017 Working Notes. CEUR Workshop Proceedings</source>
          , Dublin, Ireland (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>