<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ORCID:</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Segmentation of Seismic Images</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ekaterina Tolstaya</string-name>
          <email>ekaterina.tolstaya@aramcoinnovations.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anton Egorov</string-name>
          <email>anton.egorov@aramcoinnovations.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aramco Research Center, Moscow, Aramco Innovations LLC</institution>
          ,
          <addr-line>Leninskie Gory, 1 bld. 75b, Moscow, 119234</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Seismic Facies Labeling, UNet</institution>
          ,
          <addr-line>Domain Specific Augmentation, Pseudo Labels</addr-line>
        </aff>
      </contrib-group>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>In this paper we propose a method of seismic facies labeling. Given the three-dimensional image cube of seismic sounding data, labeled by a geologist, we first train on the part of the cube, then we propagate labels to the rest of the cube. We use open-source fully annotated 3D geological model of the Netherlands F3 Block. We apply state-of-the-art deep network architecture, adding on top a 3D fully connected conditional random field (CRF) layer. This allows to get smoother labels on data cube cross-sections. Pseudo labeling technique is used to overcome training data scarcity and predict more reliable labels for geological units. Additional data augmentation allows also to enlarge training dataset. The results show superior network performance over existing baseline model.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Seismic sounding of the Earth gives some insight on the geological structure of the formation and
allows predicting the presence of oil or gas traps inside. Usually, seismic sounding is carried out at the
early stage of potential reservoir exploration, to mark prospective locations of exploration and
production wells.</p>
      <p>Seismic facies labeling task consists of assigning specific geological rock types to the seismic data.
When done manually, this work is tedious and time-consuming. That is why automation of this
procedure can save a lot of time and effort of geologists. A lot of work is already invested in the
automation of seismic data labeling. The recent trend includes using modern approaches based on deep
learning and semantic segmentation.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Prior work</title>
      <p>
        [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]).
      </p>
      <p>
        A lot of researcher efforts are now aimed at automating the process of seismic labeling task ([
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
One of the first attempts of seismic image analysis and geological features detection (gas chimneys
and faults) with use of neural network was made by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The authors applied image processing
techniques to analyze seismic data represented by sections of 3D cubes containing information related
to seismic attributes – measured time, amplitude, frequency and attenuation of reflected seismic waves.
Authors of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] applied competitive networks for labeling seismic facies known from well information
and interpolating known facies from wells to the rest of the reservoir region. Later a lot of researchers
continued this work using different network architectures.
      </p>
      <p>
        A newly presented by [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] UNet architecture showed higher performance in the tasks of image
segmentation. Authors of [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] compared performance of multi-layer perceptron and Unet for the task of
facies labeling. Their conclusion was that Unet performs better. This architecture was also applied to
seismic labeling by [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>2021 Copyright for this paper by its authors.</p>
      <p>In case when no labels for facies are available, unsupervised learning methods can be applied. For
example, [9] proposed to use deep convolutional autoencoders and clustering of deep-feature vectors.
This approach allows performing fast analysis of geological patterns.</p>
      <p>Recently self-supervised learning methods were applied on a wide range of different tasks. Starting
from natural language processing, where a lot of data is already available in textual form, this technique
made possible to train large scale neural nets. Similar technique was also applied to image data for
learning representations. This helped to pre-train neural nets for the task where only a small amount of
labeled data is available.</p>
      <p>To overcome the problem of labeled data scarcity, semi-supervised learning techniques were
proposed in the literature, such as
 transfer learning, where the model is first pre-trained on some data and then trained on available
labeled data [10];
 weak labels technique, which allows learning from non-accurate labels [11];
 pseudo-labels – use a trained model to predict labels for unlabeled data, and possibly add these
data to the training set ([12], [13]);
 meta-pseudo labels – state-of-the-art technique where two models are trained in parallel,
teacher and student, first student learns from teacher, and after that teacher is learning on student
outputs from class conditional probabilities of the unlabeled test set.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed solution 3.1. Data</title>
      <p>
        In this work we consider an open-source fully annotated 3D geological model of the Netherlands F3
Block [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This model is based on the study of the 3D seismic data in addition to 26 well logs and
contains seismic data and labeled geological structures. Training data is an image cube of size
401×701×255 cells, test set #1 contains cube of size 201×701×255 cells. The figures below illustrate
the data:
      </p>
      <p>Figure 1 shows full seismic image with labels, Figure 2 shows spatial position of training and testing
cubes.
inline slices
test
train
t
e
s
t
s
e
T</p>
      <p>Train set
300
100
300</p>
      <p>inline
700</p>
      <p>The train and test sets contain six groups of lithostratigraphic units. Table 1 below illustrates the
percentage of present classes in the sets:</p>
      <p>The test data is adjacent to the training cube, so, the slices that are closer to the boundary will be
predicted better than the ones that are further away.
3.2.</p>
    </sec>
    <sec id="sec-4">
      <title>Model</title>
      <p>We apply the well-known UNet architecture with Efficient net B1 as backbone ([14], [15], [16]) and
five pooling levels, including scSE (Concurrent Spatial and Channel “Squeeze &amp; Excitation”) attention
layer [17]. The model has ~9M trainable parameters.</p>
      <p>The model is two-dimensional, it takes patches of size 256 by 256 pixels as input and predicts the
labels of the geological structures. Additional channel with depth gradient is introduced to the seismic
image patches, so that the model does not mix up deep and shallow layers (Figure 3).
a)
b)
c)
d)
simulate stretching and faults in the data (shown in Figure 4).
Therefore, it was decided to use the approach of pseudo-labels.</p>
      <p>Firstly, a model is trained. After that, a part of labels is predicted and added to the training set. Then
the model is re-trained on the expanded dataset as shown in Figure 5.</p>
      <sec id="sec-4-1">
        <title>Train UNet</title>
        <p>Make inference
of class
probabilities by</p>
      </sec>
      <sec id="sec-4-2">
        <title>UNet</title>
      </sec>
      <sec id="sec-4-3">
        <title>Train 3D CRF model</title>
      </sec>
      <sec id="sec-4-4">
        <title>Predict part of</title>
        <p>3D step with CRF model is added to force label continuity [18], because very often inline label slices
have a lot of artifacts near the class boundaries, where the model is not confident enough in prediction,
and assigns probabilities close to 0.5 to two different classes. CRF layer helps to smooth these
discontinuities and provide smoother labels as pseudo-labels for model re-training. We also apply an
idea from ([19] and [20]), where authors propose to use LSTM block in the bottleneck of the UNet
architecture but did not gain success.</p>
        <p>Loss function is a combination of cross-entropy (  ), dice loss ( 
) and total variation (  ) loss
 =    
+  
where C is number of classes, N is number of observations (e.g. pixels in the image), and   , are output
class probabilities and   , is target distribution for observation i and class c. All loss functions use class
weights   , which are inversely proportional to the class sizes, with their sum equal to 1.</p>
        <p>Dice loss usually performs better for imbalanced classes, which is often the case for the segmentation
tasks, where background (non-object) area is larger than the area occupied by the object. Dice loss was
(1)
(2)
proposed in the paper by [21], where it was stated, that it works better than logistic loss with class
weights. We apply multi-class dice loss function  
as follows:
1 
∑</p>
        <p>=1   ∑ =1(| 1(  , )| + | 2(  , )|),
where ε is an additional term that solves the division-by-zero problem (it is set to be a small number).</p>
        <p>TV loss is computed as
predicted labels.
3.3.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>where  1 and  2 are finite difference operations along the first and second spatial dimensions. This
term enforces the predicted probabilities to be more ‘blocky’, which potentially leads to smoother
The weights for loss terms were chosen empirically:  
= 0.25,  
= 0.65 and  
= 0.1.</p>
      <p>The results show improvement after the application of the pseudo labeling technique. The test dataset
was divided in two parts, after the first training round the labels of the first half of test cube were
predicted, and a half of test set was added to the training set. Then the model was re-trained. The metrics
(3)
(4)
are shown in the Table 2 below.</p>
    </sec>
    <sec id="sec-6">
      <title>4. Discussion</title>
      <p>In this work we applied the pseudo labeling technique to the seismic facies segmentation task. We
show that adding predictions of slices that are close to the training cube can improve network’s overall
performance on the test set. We also applied the TV loss component that forced the predictions to be
smooth, and the special type of non-linear warping of patches, to increase the diversity in the training
data. The results show superior performance over reported techniques.
5. References
[9] V. Puzyrev, C. Elders, Unsupervised seismic facies classification using deep convolutional
autoencoder. arXiv preprint arXiv:2008.01995 (2020). URL:https://arxiv.org/abs/2008.01995
[10] D. Chevitarese, D. Szwarcman, R. M. D. Silva, E. V. Brazil, Transfer learning applied to seismic
images classification. AAPG Annual and Exhibition (2018).
URL:https://www.searchanddiscovery.com/documents/2018/42285chevitarese/ndx_chevitarese.p
df
[11] Y. Alaudah, S. Gao, G. AlRegib, Learning to label seismic structures with deconvolution networks
and weak labels, in: SEG Technical Program Expanded Abstracts 2018, pp. 2121-2125.
doi:10.1190/segam2018-2997865.1
[12] A. Saleem, J. Choi, D. Yoon, J. Byun, Facies classification using semi-supervised deep learning
with pseudo-labeling strategy, in: SEG Technical Program Expanded Abstracts 2019, pp.
31713175. doi:10.1190/segam2019-3216086.1
[13] Y. Babakhin, A. Sanakoyeu, and H. Kitamura, “Semi-supervised Segmentation of Salt Bodies in
Seismic Images Using an Ensemble of Convolutional Neural Networks,” Pattern Recognition
2019, pp. 218–231. doi:10.1007/978-3-030-33676-9_15
[14] M. Tan, Q. Le, Efficientnet: Rethinking model scaling for convolutional neural networks,
in: Proceedings of International Conference on Machine Learning 2019, pp. 6105-6114
[15] B. Baheti, S. Innani, S. Gajre, S. Talbar, Eff-unet: A novel architecture for semantic segmentation
in unstructured environment, in: Proceedings of the IEEE/CVF Conference on Computer Vision
and Pattern Recognition Workshops 2020, pp. 358-359. doi:10.1109/cvprw50498.2020.00187
[16] L. D. Huynh, N. Boutry, A U-Net++ With pre-trained EfficientNet backbone for segmentation of
diseases and artifacts in endoscopy images and videos, in: EndoCV@ ISBI 2020, pp. 13-17.</p>
      <p>URL:http://ceur-ws.org/Vol-2595/endoCV2020_paper_id_11.pdf
[17] A. G. Roy, N. Navab, C. Wachinger, Concurrent spatial and channel ‘squeeze &amp; excitation’in fully
convolutional networks, in: Proceedings of International conference on medical image computing
and computer-assisted intervention 2018, pp. 421-429. doi:10.1007/978-3-030-00928-1_48
[18] L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A. L.Yuille, Semantic image segmentation
with deep convolutional nets and fully connected CRFs. arXiv preprint arXiv:1412.7062 (2014).</p>
      <p>URL:https://arxiv.org/abs/1412.7062
[19] A. A. Novikov, D. Major, M. Wimmer, D. Lenis, K. Bühler, Deep sequential segmentation of
organs in volumetric medical scans. IEEE transactions on medical imaging, 38(5), (2018)
12071215. doi:10.1109/tmi.2018.2881678
[20] A. Pfeuffer, K. Schulz, K. Dietmayer, Semantic segmentation of video sequences with
convolutional lstms, in: Proceedings of IV IEEE Intelligent Vehicles Symposium 2019, pp.
14411447. doi:10.1109/ivs.2019.8813852
[21] F. Milletari, N. Navab, S. A. Ahmadi, V-net: Fully convolutional neural networks for volumetric
medical image segmentation, in: Proceedings of 4th international conference on 3D vision 2016,
pp. 565-571. doi:10.1109/3dv.2016.79
[22] M. Q. Nasim, T. Maiti, A. Shrivastava, T. Singh, J. Mei, Seismic facies analysis: a deep domain
adaptation approach. arXiv preprint arXiv:2011.10510 (2020).</p>
      <p>URL:https://arxiv.org/abs/2011.10510
[23] M. H. Vu, G. Grimbergen, T. Nyholm, T. Löfstedt, Evaluation of multislice inputs to convolutional
neural networks for medical image segmentation. Medical Physics, 47(12), (2020) 6216–6231.
doi:10.1002/mp.14391</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Alaudah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Michałowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Alfarraj</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. AlRegib,</surname>
          </string-name>
          <article-title>A machine-learning benchmark for facies classification</article-title>
          .
          <source>Interpretation</source>
          ,
          <volume>7</volume>
          (
          <issue>3</issue>
          ), (
          <year>2019</year>
          ),
          <fpage>SE175</fpage>
          -
          <lpage>SE187</lpage>
          . doi:
          <volume>10</volume>
          .1190/INT-2018
          <source>-0249.1</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.E.</given-names>
            <surname>Clee</surname>
          </string-name>
          ,
          <article-title>A scalable deep learning platform for identifying geologic features from seismic attributes</article-title>
          .
          <source>The Leading Edge</source>
          ,
          <volume>36</volume>
          (
          <issue>3</issue>
          ), (
          <year>2017</year>
          )
          <fpage>249</fpage>
          -
          <lpage>256</lpage>
          . doi:
          <volume>10</volume>
          .1190/tle36030249.1
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Alaudah</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          and
          <string-name>
            <surname>AlRegib</surname>
          </string-name>
          , G.:
          <article-title>Weakly-supervised labeling of seismic volumes using reference exemplars</article-title>
          ,
          <source>in: Proceedings of the 2016 IEEE International Conference on Image Processing</source>
          <year>2016</year>
          , pp.
          <fpage>4373</fpage>
          -
          <lpage>4377</lpage>
          . doi:
          <volume>10</volume>
          .1109/icip.
          <year>2016</year>
          .7533186
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Meldahl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Heggland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bril</surname>
          </string-name>
          , P. de Groot,
          <article-title>Identifying faults and gas chimneys using multiattributes and neural networks</article-title>
          .
          <source>The Leading Edge</source>
          ,
          <volume>20</volume>
          (
          <issue>5</issue>
          ), (
          <year>2001</year>
          )
          <fpage>474</fpage>
          -
          <lpage>482</lpage>
          . doi:
          <volume>10</volume>
          .1190/1.1438976
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>M. M. Saggaf</surname>
            ,
            <given-names>M. N.</given-names>
          </string-name>
          <string-name>
            <surname>Toksöz</surname>
            , and
            <given-names>M. I. Marhoon</given-names>
          </string-name>
          ,
          <article-title>Seismic facies classification and identification by competitive neural networks</article-title>
          .
          <source>Geophysics</source>
          ,
          <volume>68</volume>
          (
          <issue>6</issue>
          ), (
          <year>2003</year>
          )
          <fpage>1984</fpage>
          -
          <lpage>1999</lpage>
          . doi:
          <volume>10</volume>
          .1190/1.1635052
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>O.</given-names>
            <surname>Ronneberger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fischer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Brox</surname>
          </string-name>
          , U-Net:
          <article-title>Convolutional Networks for Biomedical Image Segmentation</article-title>
          .
          <source>arXiv:1505.04597</source>
          (
          <year>2015</year>
          ). URL:https://arxiv.org/abs/1505.04597
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>H.</given-names>
            <surname>Di</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>AlRegib, Why using CNN for seismic interpretation? An investigation</article-title>
          ,
          <source>in: SEG Technical Program Expanded Abstracts</source>
          <year>2018</year>
          , pp.
          <fpage>2216</fpage>
          -
          <lpage>2220</lpage>
          . doi:
          <volume>10</volume>
          .1190/segam2018-
          <fpage>2997155</fpage>
          .
          <fpage>1</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Alfarhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Deriche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Maalej</surname>
          </string-name>
          ,
          <article-title>Robust concurrent detection of salt domes and faults in seismic surveys using an improved UNet architecture</article-title>
          .
          <source>IEEE Access</source>
          .
          <article-title>(</article-title>
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .1109/access.
          <year>2020</year>
          .3043973
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>