<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Crack Extension Life and Critical Crack Length Prediction Based on XGBoost</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yu Liu</string-name>
          <email>liuyu1@comac.cc</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kaixing Zhao</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fusheng Hou</string-name>
          <email>houfusheng@comac.cc</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaohui Hao</string-name>
          <email>haoxiaohui@comac.cc</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>COMAC Shanghai Aircraft Design &amp; Research Institute</institution>
          ,
          <addr-line>No.5188 Jinke Road, Shanghai, 200120</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Northwestern Polytechnical University</institution>
          ,
          <addr-line>127 West Youyi Road, Beilin District, Xi'an Shaanxi, 710072</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <fpage>86</fpage>
      <lpage>91</lpage>
      <abstract>
        <p>Damage tolerance design can ensure the structural safety of civil aircraft throughout its life cycle, which requires accurate analysis of crack extension life and crack extension length. This paper proposes an XGBoost-based crack extension life and crack extension length prediction method for civil aircraft structures. The method uses machine learning algorithms to train the structural state prediction model. The advantage of this method is that it can quickly determine the crack life and crack length without relying on the processing technology and engineering diagnosis experience of a large amount of collected data, which provides a more flexible method to determine the crack extension life and crack extension length under various influencing factors. By comparing a variety of machine learning algorithms, XGBoost model obtained the highest test scores, the experimental results show that the method achieves accurate prediction of crack extension life and critical crack length, which can be used for rapid engineering evaluation.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Damage tolerance</kwd>
        <kwd>crack extension</kwd>
        <kwd>machine learning</kwd>
        <kwd>XGBoost</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Damage tolerance design is a modern fatigue fracture control method developed and progressively
applied since the 1970s to ensure the structural safety of structures throughout their life cycle. The
theory assumes that each structural material has internal defects in the course of processing or using, so
the rate of expansion of these defects and the remaining strength of the structure needs to be determined
using various damage theories (such as fracture mechanics) and a given external load [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Damage
tolerance is also a major component of the assessment for strength certification of aircraft by NASA,
FAA, and other agencies. In the damage tolerance analysis, in addition to determining the critical parts
of the damage tolerance and their sensitive parts, the accuracy of the crack extension life calculation
and crack extension length calculation for this part is an important part to ensure the correct conclusion
of the analysis [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, there are various types of structural details in the airframe structure and
their load states are complex, while the material of each part of the structure are different [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Therefore,
in addition to the traditional fracture mechanics-based analysis methods, a tool that can quickly evaluate
the crack extension life and the final crack length of the corresponding structure is also needed.
      </p>
      <p>
        XGBoost, which is short for eXtreme Gradient Boosting, is an algorithmic toolkit based on the
Boosting framework and is very powerful in parallel computation efficiency, missing value handling,
and prediction performance [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In data science, XGBoost is well suited for performing data mining; in
industrial large-scale data, the distributed version of XGBoost has extensive portability and supports
running on various distributed environments such as Kubernetes and Hadoop, making it a good solution
for industrial large-scale data.
      </p>
      <p>
        The current theories based on data-driven deep learning and machine learning, as the latest research
results in pattern recognition, have achieved fruitful results in big data processing in various fields with
powerful modeling and characterization capabilities. Therefore, using XGBoost-based crack extension
prediction can get rid of the reliance on the traditional analysis method of mechanism-based fracture
mechanics, complete the adaptive extraction of structural states and the construction of complex models,
and finally realize end-to-end modeling to complete the prediction of complex indicators [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Crack extension dataset creation</title>
      <p>As a part of the aircraft, the fuselage skin is an important part of the whole aircraft structure. In this
paper, crack expansion prediction is performed for the fuselage and wing skin structure, which is usually
considered as the central crack of an infinite plate, and the simplified model is shown below.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Crack extension cycle and critical crack prediction based on XGBoost</title>
      <p>This section covers the modeling and prediction of crack extension life/critical crack length based
on XGBoost, and also includes the modeling process, data preprocessing, parameter optimization,
model training, model testing results and comparison of other algorithms.
3.1.</p>
    </sec>
    <sec id="sec-4">
      <title>Modeling Process</title>
      <p>The overall XGBoost prediction model can be divided into several parts: data reading and
preprocessing, parameter optimization for cross-validation, model training, model testing, and result
output. The flow is shown in Figure 2.</p>
    </sec>
    <sec id="sec-5">
      <title>Data preprocessing and parameter optimization</title>
      <p>The original data was saved in csv format, and after removing the samples containing missing values
and samples with too high lifetimes, the final 900 data sets were left. The total data set was partitioned
8:2, in which 80% of the training set (720 samples) was used for training the xgboost model and 20%
of the test set (180 samples) was used for performance testing.</p>
      <p>In modeling and training with xgboost, where the boosting round as an important parameter can be
searched using the cross-validation function that comes with the model. In this paper, we use Random
Search to find the optimal boosting round, with the upper limit of 500 cycles, Early stopping and
automatic termination after 5 times of no change in the optimization index, tolerance of 0.01,
crossvalidation of 10fold, and optimization index of MSE. The final best The final best value of boosting
round is 143.
3.3.</p>
    </sec>
    <sec id="sec-6">
      <title>Model training and model testing</title>
      <p>
        The XGBoost model is trained using the training dataset, where the boosting round is 143. the rest
of the model parameters are listed in the following table [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <sec id="sec-6-1">
        <title>Parameters</title>
        <p>lambda</p>
        <p>eta
grow_policy
alpha
objective</p>
      </sec>
      <sec id="sec-6-2">
        <title>Value</title>
        <p>1.0
0.3
depthwise</p>
        <p>1.0
reg:squared error</p>
        <p>The model performance was tested using the full test set data, and the test set prediction scores are
shown in the figure. After 143boosting rounds of training, the performance metrics of the model are
shown in the following table.</p>
        <p>The predicted model weights are analyzed. As shown in figure 5, the weights vary widely among
the different attributes, which is of greater significance for analyzing the key influencing factors and
optimizing the model inputs, and the relatively high-weighted attribute such as width may provide more
information about the overall structural state of the aircraft.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Comparison with other algorithms</title>
      <p>
        This study also applies other machine learning methods for modeling, including generalized linear
regression models, decision tree, and support vector machine, to model the prediction of crack
expansion number and critical crack length as well. The prediction models were all trained using the
same 80% dataset (720 samples) and tested using 20% dataset (180 samples). One of the decision tree
models, max_depth, was kept consistent with the XGBoost parameters above. The corresponding
evaluation indexes are given for the performance of XGBoost and the other three prediction models in
the test set. The prediction result scores for the number of crack expansion cycles are shown in Table
3, and the prediction result scores for the critical crack length are shown in Table 4 [
        <xref ref-type="bibr" rid="ref12">12, 13, 14</xref>
        ].
      </p>
    </sec>
    <sec id="sec-8">
      <title>4. Conclusion</title>
      <p>The XGBoost model has the best test score regardless of the number of crack expansion cycles or
the length of critical cracks, and the model accuracy can meet engineering applications. Among the
other tested models, the decision tree model has the closest score to Xgboost, which is also consistent
with the performance consistency of the same tree model on the same data set.</p>
      <p>In this paper, the Xgboost-based model achieves the prediction of the extended life and critical crack
length of the center penetration crack of a flat plate. The model can be applied to the rapid assessment
of damage tolerance in engineering. However, the model data is obtained based on fracture mechanics
software simulations, which will have some deviation when applied to real aircraft structures, and can
be combined with crack extension test data for further migration learning.</p>
    </sec>
    <sec id="sec-9">
      <title>5. References</title>
      <p>[13] Myles A J, Feudale R N, Liu Y, et al. An introduction to decision tree modeling[J]. Journal of</p>
      <p>Chemometrics: A Journal of the Chemometrics Society, 2004, 18(6): 275-285.
[14] Nelder J A, Wedderburn R W M. Generalized linear models[J]. Journal of the Royal Statistical
Society: Series A (General), 1972, 135(3): 370-384.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Reifsnider</surname>
            <given-names>K L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Case</surname>
            <given-names>S W.</given-names>
          </string-name>
          <article-title>Damage tolerance and durability of material systems</article-title>
          [M].
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Jones</surname>
            <given-names>R.</given-names>
          </string-name>
          <article-title>Fatigue crack growth and damage tolerance</article-title>
          [J].
          <source>Fatigue &amp; Fracture of Engineering Materials &amp; Structures</source>
          ,
          <year>2014</year>
          ,
          <volume>37</volume>
          (
          <issue>5</issue>
          ):
          <fpage>463</fpage>
          -
          <lpage>483</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Toor P M.</surname>
          </string-name>
          <article-title>A review of some damage tolerance design approaches for aircraft structures</article-title>
          [J].
          <source>Engineering fracture mechanics</source>
          ,
          <year>1973</year>
          ,
          <volume>5</volume>
          (
          <issue>4</issue>
          ):
          <fpage>837</fpage>
          -
          <lpage>880</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Chen</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benesty</surname>
            <given-names>M</given-names>
          </string-name>
          , et al.
          <article-title>Xgboost: extreme gradient boosting</article-title>
          [J].
          <source>R package version 0</source>
          .4-
          <issue>2</issue>
          ,
          <year>2015</year>
          ,
          <volume>1</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Konda</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verma</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jayaganthan</surname>
            <given-names>R</given-names>
          </string-name>
          .
          <source>Machine Learning Based Predictions of Fatigue Crack Growth Rate of Additively Manufactured Ti6Al4V[J]. Metals</source>
          ,
          <year>2021</year>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ):
          <fpage>50</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Pierson</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rahman</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spear A D. Predicting</surname>
          </string-name>
          microstructure
          <article-title>-sensitive fatigue-crack path in 3D using a machine learning framework</article-title>
          [J].
          <source>Jom</source>
          ,
          <year>2019</year>
          ,
          <volume>71</volume>
          (
          <issue>8</issue>
          ):
          <fpage>2680</fpage>
          -
          <lpage>2694</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Raja</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chukka</surname>
            <given-names>S T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jayaganthan R</surname>
          </string-name>
          .
          <article-title>Prediction of fatigue crack growth behaviour in ultrafine grained al 2014 alloy using machine learning[</article-title>
          <source>J]. Metals</source>
          ,
          <year>2020</year>
          ,
          <volume>10</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1349</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Wang</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Du</surname>
            <given-names>Y</given-names>
          </string-name>
          , et al.
          <article-title>Prediction of fatigue stress concentration factor using extreme learning machine</article-title>
          [J].
          <source>Computational Materials Science</source>
          ,
          <year>2016</year>
          ,
          <volume>125</volume>
          :
          <fpage>136</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Mettu</surname>
            <given-names>S R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shivakumar</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beek J M</surname>
          </string-name>
          , et al.
          <source>NASGRO 3</source>
          .
          <article-title>0: A software for analyzing aging aircraft</article-title>
          [C]//The Second Joint NASA/FAA/DoD Conference on Aging Aircraft.
          <source>1999 (Pt. 2).</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>NASGRO F M. Fatigue Crack Growth Analysis Software</surname>
          </string-name>
          [J].
          <source>Reference manual</source>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Brownlee J. XGBoost</surname>
          </string-name>
          <article-title>With python: Gradient boosted trees with XGBoost and scikit-learn[M]</article-title>
          .
          <source>Machine Learning Mastery</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Noble</surname>
            <given-names>W S.</given-names>
          </string-name>
          <article-title>What is a support vector machine?</article-title>
          [J].
          <source>Nature biotechnology</source>
          ,
          <year>2006</year>
          ,
          <volume>24</volume>
          (
          <issue>12</issue>
          ):
          <fpage>1565</fpage>
          -
          <lpage>1567</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>