<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Hybridizing AI and Domain Knowledge in Nanotechnology: the Example of Surface Roughness Efects on Weting Behavior</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Antonios Stellas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>George Giannakopoulos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vassilios Constantoudis</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wetting roughness, mathematical modeling, Wenzel model, contact</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Mathematics and</institution>
          ,
          <addr-line>Computer Science</addr-line>
          ,
          <institution>Technical University of Eindhoven</institution>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Informatics and</institution>
          ,
          <addr-line>Telecommunications, NCSR Demokritos, and, SciFY P.N.P.C.</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Institute of Nanoscience and</institution>
          ,
          <addr-line>Nanotechnology, NCSR Demokritos, and, Nanometrisis P.C.</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>angle</institution>
          ,
          <addr-line>Machine learning, Artificial Intelligence, Nanotechnology, Rough Surfaces</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper, we propose a scheme for the hybridization of domain modeling and theoretical knowledge in nanotechnology with Artificial Intelligence(AI) techniques and evaluate the success of its application to predict the relationship between nanosurface morphology and wettability. We utilize domain knowledge consisting of two parts. The first part is a mathematical modeling based on the inverse Fourier transform for the generation of rough surfaces with Gaussian or non-Gaussian height distributions, characterized by their first moments (Rms, skewness, kurtosis) and the correlation lengths along x and y-axes. The second part lies in the assumption that the Wenzel scenario for wetting of rough surfaces holds where the critical parameter for contact angle determination is the roughness ratio  , defined as the ratio of true (active) area of the solid surface to the apparent (projected) area. By creating diferent types of surfaces with a variety of input parameters, we create a database linking surface roughness parameters to the ratio  . This database is used to train Machine Learning (ML) models and validate them appropriately. Specifically, we train deep, feed-forward neural networks and random forest models and validate them on a separate (held-out) test dataset. We investigate systematically the amount of input data needed to get accurate predictions on the test data. We also evaluate the importance of diferent input roughness parameters with respect to their efects on surface wettability. To this end, we study the weights that the learning AI models assigned to roughness parameters through training and discuss the findings with respect to experimental expectations.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Nanostructuring plays a fundamental role in nanotechnology since
it enables new properties and functionalities of material surfaces.
In order to provide a quantitative link between the geometry of
nanostructure morphologies and the induced surface properties, we
ifrst need to find the proper mathematical tools to describe surface
morphology. To this end, several parameters and metrics have been
proposed for the quantitative characterization of nanostructure
morphology. Some of them are more closely linked to the
fabrication process, while others more directly fit to the critical property
of a targeted application. For example, surfaces with stochastic
morphologies (rough surfaces) are widely used in the strong
modification of the wetting behaviour of materials. According to the first
scenario (Wenzel model) for the impact of surface roughness on
wetting and contact angle, the critical parameter of surface
nanoroughness is the roughness ratio  [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], defined as the ratio of true
(active) area of the solid surface to the apparent (projected) area
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. On the other side, the fabrication of surfaces is more usually
related with the surface Rms, correlation length or other surface
height parameters. Furthermore, the measurement of the latter is
more straightforward and accurate with respect to the full active
surface area [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Therefore, an estimated function connecting
fabrication parameter of full active area) (and thus, the roughness ratio
 and contact angle) to roughness parameters (Rms,  and other
moments), can significantly help the fabrication parameter selection
process.
      </p>
      <p>
        Up until now, theoretical modelling and experiments have been
used to quantify these links [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6">3–6</xref>
        ]. However, these methods are
time-consuming and their results are limited to the specific cases
they investigate. Could integrating AI techniques improve the
timeeficiency of these methods, making them more applicable in an
industrial environment? If yes, how can we incorporate domain
knowledge coming from both modeling and experimental results in
AI models to achieve a hybridization of both, improve the accuracy
of results and the success of the AI predictions? By the term domain
knowledge, we refer to the specific scientific area of used data which,
in our case is nanoscience/nanotechnology.
      </p>
      <p>
        During the last decade, several nanotechnology areas have started
to benefit from AI techniques aiming for example, to predict the
properties of new nanomaterials, to enhance microscopy results,
to accelerate simulations, to link manufacturing conditions with
nanostructure morphology and then with the properties and
performance of the nanostructured devices [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Due to the scientific and
technological nature of data in these applications, there is an
increasing need to devise ways to match AI methods with the domain
knowledge, as reported in the relevant theoretical and experimental
works.
      </p>
      <p>The overall aim of this paper is to propose a hybridization scheme
to facilitate the synergy of data-centric methods with domain
modelling and theoretical knowledge concerning the link of the
morphology of nanostructured surfaces with their properties and
functionalities. The key ideas of this approach are: a) to use properly
designed modeling results to train and validate AI methods along
with experimental data when they are available and b) to exploit
the ability of ML techniques to reverse input and output so that
the targeted design of a nanostructured product dictates the choice
of nanostructure geometry and manufacturing conditions. We will
focus on the first idea and we will apply it to evaluate the success of
AI techniques hybridized with domain modeling results to predict
the relationship between nanosurface morphology and wettability.
More specifically, the contributions of the paper can be outlined as
follows:
• An implementation of domain modeling results in AI
techniques is realized for training and validation.
• A comparison of diferent AI models is performed based on
the physical modeling results.
• A study on the numerosity of required data (rough surfaces),
to train suficiently accurate AI methods.
• An estimate of the relative importance of input roughness
parameters on wetting behavior, supported by a discussion
to feed domain decisions and evaluations.</p>
      <p>The paper is structured as follows. We begin, in section 2, with
a presentation of the related recent work and the need to
investigate further the predictive capability of AI techniques in the
nanotechnology applications and specifically wetting behavior. The
mathematical modeling methodology for the generation of rough
surfaces to train AI models as well as the AI techniques used in the
paper is the subject of the section 3. Section 4 presents the results
of AI techniques and their comparison. The paper closes with the
summary in the final section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>
        The applications of AI to physical problems and especially materials
science have been studied in many contexts during the last decade,
creating a novel research framework termed ”data-driven materials
science” (Teng Zhou et al., 2019 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] ). Teng Zhou highlighted the new
opportunities that a data-driven approach can provide in the study
of materials and named it the 4th paradigm in materials science.
The previous three paradigms are assumed to be the empirical,
theoretical and computational ones respectively. In the framework
of this data-driven materials science, several studies have achieved
to make significant contributions to the design and development of
new materials (Sutton, C. et al., 2019 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]), the prediction of material
properties (electronic, mechanic, thermal,. . . ) (Schütt, K. T. et al.,
2014 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]) or the evaluation of the importance of manufacturing
and structure parameters on material surface functionalities such
as wetting behavior (Amir Kordijazi et al., 2020 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]). Furthermore,
a critical question in this framework has been the incorporation of
the domain knowledge of materials science (theoretical concepts
and laws, modeling and simulation results) coming from previous
paradigms in the data-driven algorithms, to avoid significant errors
in the provided predictions. To this end, Sutton, C. et al., (2019) and
Schütt, K. T. et al., (2014) applied the data-driven science paradigm
scheme, using simulation data to train a ML algorithm to predict
faster solid-state properties. On the other hand, in the work of M
Aziar Raissi and George Karniadakis et al. in 2019 [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], we find an
elaborated methodology in which, physical modeling assists the
AI research by exploration of physics-informed neural network
algorithms.
      </p>
      <p>
        Regarding the prediction of wetting behaviour, Amir Kordijazi
et al., (2020) used ML techniques to predict the water contact
angle on surfaces of ductile iron. They used a set of experimental
measurements with input parameters the material composition,
droplet size, the surface grit size and roughness and the time of the
exposure to the liquid. The authors also evaluated the importance
of each input parameter on the value of contact angle and they
justified the primary role of surface roughness determined by the
grit size. However, it was not specified which aspect of surface
roughness is more critical. Given that the roughness of surfaces is
a complex multifaceted phenomenon characterized by a plethora
of parameters, it is worth questioning the relative importance of
roughness parameters on surface wetting behavior. In literature,
one can find interesting results coming from both experimental and
computational approaches exploring the impact of surface
roughness parameters on contact angle and hysteresis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>In our work, we follow the data-driven approach of the 4th
paradigm of science endowed by theoretical and computational
modelling knowledge of the 2nd and 3rd paradigms. The aim is to
investigate the prediction performance of these AI models on the
efects of roughness parameters on the wetting behavior of solid
surfaces. We assume that the Wenzel model assumption holds: the
contact angle of droplets posed on rough surfaces is determined
by the roughness ratio  and especially the full active surface area.
A hybridization framework is implemented, in which simulated
rough surfaces with a wide spectrum of parameters and appearances
are used to train and evaluate the AI models and explore their
performance versus the simulation cost. We also use the capability
of the developed AI models to reveal each roughness parameter
importance on the observed wetting behavior.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>METHODOLOGY</title>
    </sec>
    <sec id="sec-4">
      <title>Mathematical modelling of rough surface generation.</title>
      <p>
        In this section, we describe the methodology we used for generating
simulated rough surfaces with similar characteristics with a large
variety of experimental ones. These surfaces will be used to enrich
the dataset for training and validating the ML models. We begin
by describing the methodology for generating Gaussian and
nonGaussian surfaces with controlled spatial correlations.
Gaussian surfaces: We produce three-dimensional Gaussian
surfaces by inputting the Rms of the height distribution and the
correlation lengths along the x and y axes ( and  ) of the surface
(Table 1). (Figure 1a and 1b). The heights of the generated surfaces
are calculated on a square lattice  ×  points and area   ×  .
Therefore, the spacing in x and y direction corresponds the ratio:
 . The methodology for simulating the Gaussian surfaces is
ba−s1ed on the work of Garcia et.al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. First, we produce a white
Surface Type Input Parameters Output Parameter
      </p>
      <p>Gaussian Rms,  ,  active area</p>
      <p>Non-Gaussian Rms,  , , skewness, kurtosis active area
Table 1: Input parameters used for generating the active
area of Gaussian and non-Gaussian surfaces. The input
(roughness) parameters for the Gaussian consist of the Rms
heights and the correlation lengths () in x and y directions.
For the Non-Gaussian surfaces the inputs include
additionally the skewness and kurtosis.
noise   distribution with mean value zero and standard
deviation equal to the input Rms value. By applying the Gaussian filter
 , described in the following equation (Eq. 1) to the distribution
we add the desired correlations along x and y axis.</p>
      <p>2 2
 =  (−(2 2 + 2 2 ))
(1)</p>
      <p>
        Then, we take the Inverse Fourier Transform of the product of
Fourier transforms of   and  and multiply with normalization
factors to generate correlated isotropic and anisotropic Gaussian
surfaces After producing the surface, their Rms,  and  are
compared with the inputs to check possible divergence. The divergence
is related with limitations imposed by the discrete sampling and
ifnite range of surfaces. In such case, the algorithm of the surface
generation is repeated until the input parameters are converged.
Non-Gaussian surfaces: The method we used to model non-Gaussian
surfaces is based on the work of Yang et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] where, the
Johnson and Pearson transformations systems are used to transform
random Gaussian noise with specified average height and Rms into
non-Gaussian noise with user-defined skewness, and kurtosis
(Table 1). The steps of the method are as follows: First, we generate
a random two-dimensional non-Gaussian noise via Johnson
transform system giving as input parameters the first four statistical
moments (mean, Rms, skewness and kurtosis). If the distribution
parameters cannot converge, we use the Pearson transformation
system. Then, we measure the skewness and kurtosis of the surface
to satisfy the chosen precision conditions. If the conditions are not
met, we repeat the generation of the surface and validation. Finally,
the surface becomes correlated by reconstructing and rearranging
the height sequence in the x and y directions imitating a known
Gaussian correlated surface with correlation lengths  and 
along axes x and y respectively. Yang’s method is characterized by
its eficiency as diferent internal fitting methods are used for
convergence. Thus, we can create surfaces with skewness and kurtosis
inputs that can successfully (with low error) cover every point in
the skewness-kurtosis plane  − 2 − 1 ≥ 0.
      </p>
      <p>Subsequently, the active area was measured by integrating the
secant of the angle  between the surface normal and its z-direction
normal.</p>
      <p>∫ ∫
 =
∫ ∫
 ( )
(2)</p>
      <p>Where,  is defined as the angle that the z-axis makes with the
normal vector of the diferential surface dA.
In this section, we outline the AI methods we utilized in this work.
We begin by overviewing the main points describing the methods
and then we align these descriptions with our work.</p>
      <p>A physical model is a domain-driven model that is created through
functions that follow underlying physical laws to predict a property.
When using those models, the relation between the input
parameters and the output value is already known and used for predictions.
A ML model is a data-driven model that is used to find an
appropriate (originally unknown) function that reflects the connection of
an input (given) property to an output property that we want to
predict. Even though the actual relation is unknown, given
suficient values for input and output values from experiments, we can
create a ML (statistical) model that approximates the relation. Along
with pre-existing human expertise, the approximations could add
value to the manufacturer that aims to predict physical properties,
without the need to exhaustively perform experiments, given the
cost of such experiments in time and money.</p>
      <p>
        In this work, we are using three ML models: 1) Linear
Regression 2) Random Forests 3) Neural Networks. We try these diferent
families of learning, since this problem has no prior indication of
the underlying input-output relation, which may afect the method
selection. For example, linear regression models will assume a
linear relationship between the input and the output. On the other
hand, Random forests and Neural network models use diferent
methodologies to approximate/learn non-linear relationships
between (even high dimensional) input parameters and a predicted
output property. We stress that there is no single, overall better
method for all estimation problems. This statement is better known
as the "no-free-lunch theorem" [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
      <p>
        Linear regression models [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] (or in our case a multiple linear
regression model) approximate the relationship between the input
space and the output variable by fitting a linear equation. Those
models are characterized by their simplicity and speed and are
oftentimes used as a benchmark, to allow comparison to other,
more complex models.
      </p>
      <p>
        Random Forests [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] are ensemble methods that use multiple
decision trees. A decision tree partitions the input space into
subspaces and maps each subspace to a predicted output value. The
division of the space is done using training (example) data from
the dataset. The algorithm searches for an appropriate partitioning,
which reduces the error of the predicted value across all training
examples. Normally, each decision tree is fed by all the available
training data. However, in the Random forest approach, the learning
creates several trees, each applied on a randomly selected subset of
the full training data. The model predicts the output by considering
all of the predictions of each decision tree. This approach has been
shown to be more efective and also generalize better with respect
to unseen examples [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>
        Neural Networks (NNs) are models that learn complex functions,
combined in a non-linear manner over several layers. As such, NNs
have been shown to be excellent approximators of many families
of functions [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], meaning that they can mimic many underlying
functions very eficiently, when given enough training data. NNs
include neurons (or layers of nodes) that are combined to solve
complex problems. A NN consists of an input layer, one or more
hidden layers and an output layer. The input layer is fed a
representation of the independent variables that (we expect) define the
output. The output layer is expected to deliver the estimation of
the output, dependent variable. Intermediate layers of nodes take
as input the output of previous layers and transform it (i.e. apply a
function on it). In essence, each node of a NN is a linear function of
its input, passed through a non-linear operator. The intermediate
layers end up forming a (sometimes very complex) function that
connects input to output.
      </p>
      <p>NN models with many layers are called Deep Neural Networks.
Such a network may contain millions of parameters, identifying
the function the network represents. No matter whether we have a
deep network or not, these parameters are optimized to minimize
the estimation error. In other words, we take each training example,
provide it as input to the network and get its output prediction.
Based on the real value it should have output, we change (i.e.
optimize) the parameters of the network to minimize the prediction
error. A number of optimization methods can be used to infer these
parameters from the training data, the most well-known being
back-propagation.</p>
      <p>
        There is no a-priori best way to create a NN. However, standard
practices can be applied to create such an architecture [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
Generally, a Neural Network can be made deeper by adding more hidden
layers or more nodes, that however may lead the function to
overiftting, i.e. reducing generalization ability to unseen input, which
in turns increases the error when using the network for prediction.
Thus, the definition of an architecture can be a challenge in itself.
However, techniques such as dropout [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] have been proved to be
efective.
(3)
(4)
      </p>
      <p>Linear regression models are faster that both Neural networks
and Random Forests but have less accuracy in cases where
nonlinear inner dependencies appear in data. Both, Neural Networks
(NN) and Random Forests ofer good levels of performance in
different application areas. However, diferent methods ofer diferent
potential for learning and approximating, coupled with diferent
processing requirements (in time and memory). Random forests
training costs less time (when compared to NNs in a generic setting)
and after training, they can be more interpretable than the average
NN. On the other hand, the accuracy a neural network can reach
is higher, if we have access to the required volume and diversity
of data (more data is usually needed to tune more parameters).
Essentially, the selection of training data is a defining factor for ML,
beyond the algorithms and the corresponding architecture. These
data should satisfy three basic requirements in order to make a good
approximation through a ML model. Those requirement are related
to the: 1) quantity 2) diversity 3) quality of data. Regarding our
work, we aimed to satisfy these three requirements while creating
the training and testing datasets before using the models.
By applying the methodology described in section 3.1, we
generated two databases for the training and the validation of the ML
models respectively. The databases consist of surfaces with diverse
combinations of roughness parameters (Rms, correlation lengths,
Skewness and Kurtosis) and the corresponding functional
parameter (active area). The distribution of the training and validation
datasets are shown in Figure 2 a) and b):</p>
      <p>The total volume of training data-surfaces has been 3000 surfaces
while for validating reached 15000. We trained each model with
diferent percentages of surfaces from the train dataset to examine
the efects of the training data size on model success. Also, for every
percentage of the training set, we applied the training procedure
10 times with randomly selected surfaces from the database.
4.2</p>
    </sec>
    <sec id="sec-5">
      <title>Model evaluation metric</title>
      <p>For the validation of the predictability of models, we evaluated
the RMSRE (Root Mean Square Relative Error). While the RMSE
(Root Mean Square Error) can indicate successfully the appearance
of outliers, the relative value of RMSE has no units. RMSE is the
squared root error of the average predicted active area  from the
average actual active area  .</p>
      <p>=</p>
      <p />
    </sec>
    <sec id="sec-6">
      <title>Machine Learning Results</title>
      <sec id="sec-6-1">
        <title>Linear Regression.</title>
        <p>We trained a set of Linear Regression models as a basis
comparison with the rest of models. Figure 3 shows the RMSRE of the
trained models within a volume range of training data (surfaces).
The linear models reached a plateau of RMSRE 9.6% using the
validation dataset after approximately 100 training data points-surfaces.
The model predictions diverge from the true values for high and
low active area values, as seen in Figure 4.</p>
      </sec>
      <sec id="sec-6-2">
        <title>Random Forests.</title>
        <p>We used random forest estimators fitting 25 regression decision
trees each. Each tree averages the results to improve the predictive
accuracy and control over-fitting. The decision trees used for
training had a maximum depth of 25 branches with a minimum number
of internal node splits of two and a mean squared error criterion
for splitting. Random Forest models outperforms the Linear
Regression models after approximately 120 data-surfaces (Figure 3) and
achieved less than 4.0% RMSRE after 2100 training data-surfaces.
The standard deviation of the RMSRE decreases as the training
dataset becomes larger. Figure 5 shows that the models were able
to predict the high and low active areas of the surfaces with less
error as compared to the the linear regression models. However,
there is still a significant amount of error for high active areas.</p>
        <p>
          We used a set of neural network (NN) and a deep neural network
(DNN) models for the prediction of active areas. NN models were
trained using the adam [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] solver with a five-layer architecture
consisting of 15,25,40,25,15 nodes respectively. The DNN models
were trained through rmsprop optimizer with a two-layer
architecture consisting of 200 nodes each. To overcome any overfitting
tendency, a 50% dropout was established between the two layers
and an additive zero-centered Gaussian noise of  = 1 was added
to the last layer. Both models had activation function the Rectified
Linear unit (ReLu).
        </p>
        <p>
          For training datasets with a size of less than 300 surfaces, the NNs
performed worse as compared to the linear regression models
(Figure 3). After 300 surfaces, they outperformed the linear regression
models reaching a  = 6%. It is noted that in this case as
well, the standard deviation increases with the size of the training
data. The performance of DNNs is comparable to the Random forest
model. For 2700 surfaces of training data, the DNN models show
an average error of  = 2% (Figure 6). Therefore, the DNNs
generally show a significantly lower error compared to the NNs.
By taking the average of the absolute value of the weights between
the input layer and the first layer of the DNN, we can identify the
most important features of the model. Figure 7 shows that Rms has
the highest weight intensity, followed by the correlation lengths in
x and y axis ( and  resp). This results is in harmony with
previous findings based on computational analysis and experimental
measurements [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>5 SUMMARY</title>
      <p>
        In our work, we proposed a hybridization scheme that combines
data-driven methodologies (4th paradigm [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]) with domain
theoretical and modeling knowledge as an alternative (2nd and 3rd
paradigm) link between the configuration of nanosurface rough
morphology and prediction of wetting behaviour.
      </p>
      <p>In particular, we trained Linear Regression, Random Forest,
Neural Network and Deep Neural Network models with simulated
nanosurfaces in order to predict the true (active) area of the surface,
a critical parameter for wetting when Wenzel model is assumed. We
then compared their performances in relation to the required data
(simulation cost to produce rough surfaces). Random Forests and
Deep Neural Networks showed the highest performance reaching
4 % of RMSRE after 1000 training data-surfaces. The models and
particularly the Deep Neural Networks indicate that Rms has the
highest importance in wetting behavior. The correlation lengths in
the x and y axis showed lower but significant importance as well
whereas skewness and kurtosis play a minor though detectable
role.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.-P.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Characterization of Amorphous and Crystalline Rough Surface: Principles</article-title>
          and Applications. San Diego, CA: Elsevier,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R. N.</given-names>
            <surname>Wenzel</surname>
          </string-name>
          , “
          <article-title>Resistance of solid surfaces to wetting by water</article-title>
          ,
          <source>” Industrial &amp; Engineering Chemistry</source>
          , vol.
          <volume>28</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>988</fpage>
          -
          <lpage>994</lpage>
          ,
          <year>1936</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L.</given-names>
            <surname>Lai</surname>
          </string-name>
          and
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Irene</surname>
          </string-name>
          , “
          <article-title>Area evaluation of microscopically rough surfaces</article-title>
          ,
          <source>” Journal of Vacuum Science &amp; Technology B: Microelectronics and Nanometer Structures Processing, Measurement, and Phenomena</source>
          , vol.
          <volume>17</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>33</fpage>
          -
          <lpage>39</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Foadi</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. H.</surname>
          </string-name>
          <article-title>ten</article-title>
          <string-name>
            <surname>Brink</surname>
            ,
            <given-names>M. R.</given-names>
          </string-name>
          <string-name>
            <surname>Mohammadizadeh</surname>
          </string-name>
          , and G. Palasantzas, “
          <article-title>Roughness dependent wettability of sputtered copper thin films: The efect of the local surface slope</article-title>
          ,
          <source>” Journal of Applied Physics</source>
          , vol.
          <volume>125</volume>
          , no.
          <issue>24</issue>
          , p.
          <fpage>244307</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M. B.</given-names>
            <surname>Stearns</surname>
          </string-name>
          , “
          <article-title>Magnetization and structure of mn-ni and mn-co layered magnetic thin films</article-title>
          ,
          <source>” Journal of Applied Physics</source>
          , vol.
          <volume>53</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>2436</fpage>
          -
          <lpage>2438</lpage>
          ,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Y. Zhang,</surname>
          </string-name>
          <article-title>The efect of surface roughness parameters on contact and wettability of solid surfaces</article-title>
          .
          <source>PhD thesis</source>
          , Iowa State University,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Sacha</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Varona</surname>
          </string-name>
          , “Artificial intelligence in nanotechnology,” Nanotechnology, vol.
          <volume>24</volume>
          , p.
          <volume>452002</volume>
          , oct
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Song</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Sundmacher</surname>
          </string-name>
          , “
          <article-title>Big data creates new opportunities for materials research: A review on methods and applications of machine learning for materials design</article-title>
          ,” Engineering, vol.
          <volume>5</volume>
          , no.
          <issue>6</issue>
          , pp.
          <fpage>1017</fpage>
          -
          <lpage>1026</lpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Sutton</surname>
          </string-name>
          , “
          <article-title>Crowd-sourcing materials-science challenges with the nomad 2018 kaggle competition</article-title>
          ,
          <source>” npj Computational Materials</source>
          , vol.
          <volume>5</volume>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>K. T. Schütt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Glawe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Brockherde</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Sanna</surname>
            ,
            <given-names>K. R.</given-names>
          </string-name>
          <string-name>
            <surname>Müller</surname>
          </string-name>
          , and E. K. U. Gross, “
          <article-title>How to represent crystal structures for machine learning: Towards fast prediction of electronic properties,” Physical Review B</article-title>
          , vol.
          <volume>89</volume>
          , May
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kordijazi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Roshan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dhingra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Povolo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. K.</given-names>
            <surname>Rohatgi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Nosonovsky</surname>
          </string-name>
          , “
          <article-title>Machine-learning methods to predict the wetting properties of iron-based composites</article-title>
          ,
          <source>” Surface Innovations</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Raissi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Perdikaris</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Karniadakis</surname>
          </string-name>
          , “
          <article-title>Physics informed deep learning (part i): Data-driven solutions of nonlinear partial diferential equations</article-title>
          ,”
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Garcia</surname>
          </string-name>
          and E. Stoll, “
          <article-title>Monte carlo calculation for electromagnetic-wave scattering from random rough surfaces</article-title>
          ,
          <source>” Phys. Rev. Lett.</source>
          , vol.
          <volume>52</volume>
          , pp.
          <fpage>1798</fpage>
          -
          <lpage>1801</lpage>
          , May
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>G.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Hong</surname>
          </string-name>
          , “
          <article-title>Numerical simulation of 3d rough surfaces and analysis of interfacial contact characteristics,” CMES - Computer Modeling in Engineering and Sciences</article-title>
          , vol.
          <volume>103</volume>
          , pp.
          <fpage>251</fpage>
          -
          <lpage>279</lpage>
          ,
          <year>12 2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>D. H.</given-names>
            <surname>Wolpert</surname>
          </string-name>
          , “
          <article-title>The lack of a priori distinctions between learning algorithms</article-title>
          ,
          <source>” Neural Computation</source>
          , vol.
          <volume>8</volume>
          , no.
          <issue>7</issue>
          , pp.
          <fpage>1341</fpage>
          -
          <lpage>1390</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>D. W. Gareth</given-names>
            <surname>James</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Tibshirani</surname>
          </string-name>
          , An Introduction to Statistical Learning pp
          <fpage>59</fpage>
          -
          <lpage>126</lpage>
          and pp. New York, NY: Springer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Tin</given-names>
            <surname>Kam</surname>
          </string-name>
          <string-name>
            <surname>Ho</surname>
          </string-name>
          , “
          <article-title>Random decision forests,”</article-title>
          <source>in Proceedings of 3rd International Conference on Document Analysis and Recognition</source>
          , vol.
          <volume>1</volume>
          , pp.
          <fpage>278</fpage>
          -
          <lpage>282</lpage>
          vol.
          <volume>1</volume>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Tin</given-names>
            <surname>Kam</surname>
          </string-name>
          <string-name>
            <surname>Ho</surname>
          </string-name>
          , “
          <article-title>The random subspace method for constructing decision forests,”</article-title>
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          , vol.
          <volume>20</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>832</fpage>
          -
          <lpage>844</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          , “
          <article-title>Neural networks and deep learning</article-title>
          ,”
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>P.</given-names>
            <surname>Benardos and G.-C. Vosniakos</surname>
          </string-name>
          , “
          <article-title>Optimizing feedforward artificial neural network architecture,”</article-title>
          <source>Engineering Applications of Artificial Intelligence</source>
          , vol.
          <volume>20</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>365</fpage>
          -
          <lpage>382</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>N.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Hinton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krizhevsky</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Sutskever</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Salakhutdinov</surname>
          </string-name>
          , “
          <article-title>Dropout: A simple way to prevent neural networks from overfitting,”</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Mach</surname>
          </string-name>
          .
          <source>Learn. Res.</source>
          , vol.
          <volume>15</volume>
          , p.
          <fpage>1929</fpage>
          -
          <lpage>1958</lpage>
          , Jan.
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Kingma</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Ba</surname>
          </string-name>
          , “
          <article-title>Adam: A method for stochastic optimization</article-title>
          ,”
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>