<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Hybridizing AI and Domain Knowledge in Nanotechnology: the Example of Surface Roughness Effects on Wetting Behavior</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Antonios</forename><surname>Stellas</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Mathematics and Computer Science</orgName>
								<orgName type="institution">Technical University of Eindhoven Netherlands</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Scify</forename><forename type="middle">P N P C</forename><surname>Greece</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">George Giannakopoulos Institute of Informatics and Telecommunications</orgName>
								<orgName type="institution">NCSR Demokritos</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Vassilios</forename><surname>Constantoudis</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">George Giannakopoulos Institute of Informatics and Telecommunications</orgName>
								<orgName type="institution">NCSR Demokritos</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nanometrisis</forename><forename type="middle">P C</forename><surname>Greece</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Institute of Nanoscience and Nanotechnology</orgName>
								<orgName type="institution">NCSR Demokritos</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Hybridizing AI and Domain Knowledge in Nanotechnology: the Example of Surface Roughness Effects on Wetting Behavior</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">5C274F22619280F8B93B04E1A1BF4682</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T00:01+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Wetting roughness</term>
					<term>mathematical modeling</term>
					<term>Wenzel model</term>
					<term>contact angle</term>
					<term>Machine learning</term>
					<term>Artificial Intelligence</term>
					<term>Nanotechnology</term>
					<term>Rough Surfaces</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we propose a scheme for the hybridization of domain modeling and theoretical knowledge in nanotechnology with Artificial Intelligence(AI) techniques and evaluate the success of its application to predict the relationship between nanosurface morphology and wettability. We utilize domain knowledge consisting of two parts. The first part is a mathematical modeling based on the inverse Fourier transform for the generation of rough surfaces with Gaussian or non-Gaussian height distributions, characterized by their first moments (Rms, skewness, kurtosis) and the correlation lengths along x and y-axes. The second part lies in the assumption that the Wenzel scenario for wetting of rough surfaces holds where the critical parameter for contact angle determination is the roughness ratio 𝑟 , defined as the ratio of true (active) area of the solid surface to the apparent (projected) area. By creating different types of surfaces with a variety of input parameters, we create a database linking surface roughness parameters to the ratio 𝑟 . This database is used to train Machine Learning (ML) models and validate them appropriately. Specifically, we train deep, feed-forward neural networks and random forest models and validate them on a separate (held-out) test dataset. We investigate systematically the amount of input data needed to get accurate predictions on the test data. We also evaluate the importance of different input roughness parameters with respect to their effects on surface wettability. To this end, we study the weights that the learning AI models assigned to roughness parameters through training and discuss the findings with respect to experimental expectations.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Nanostructuring plays a fundamental role in nanotechnology since it enables new properties and functionalities of material surfaces. In order to provide a quantitative link between the geometry of nanostructure morphologies and the induced surface properties, we first need to find the proper mathematical tools to describe surface morphology. To this end, several parameters and metrics have been proposed for the quantitative characterization of nanostructure morphology. Some of them are more closely linked to the fabrication process, while others more directly fit to the critical property of a targeted application. For example, surfaces with stochastic morphologies (rough surfaces) are widely used in the strong modification of the wetting behaviour of materials. According to the first scenario (Wenzel model) for the impact of surface roughness on wetting and contact angle, the critical parameter of surface nanoroughness is the roughness ratio 𝑟 <ref type="bibr" target="#b0">[1]</ref>, defined as the ratio of true (active) area of the solid surface to the apparent (projected) area <ref type="bibr" target="#b1">[2]</ref>. On the other side, the fabrication of surfaces is more usually related with the surface Rms, correlation length or other surface height parameters. Furthermore, the measurement of the latter is more straightforward and accurate with respect to the full active surface area <ref type="bibr" target="#b2">[3]</ref>. Therefore, an estimated function connecting fabrication parameter of full active area) (and thus, the roughness ratio 𝑟 and contact angle) to roughness parameters (Rms, 𝜉 and other moments), can significantly help the fabrication parameter selection process.</p><p>Up until now, theoretical modelling and experiments have been used to quantify these links <ref type="bibr" target="#b2">[3]</ref><ref type="bibr" target="#b3">[4]</ref><ref type="bibr" target="#b4">[5]</ref><ref type="bibr" target="#b5">[6]</ref>. However, these methods are time-consuming and their results are limited to the specific cases they investigate. Could integrating AI techniques improve the timeefficiency of these methods, making them more applicable in an industrial environment? If yes, how can we incorporate domain knowledge coming from both modeling and experimental results in AI models to achieve a hybridization of both, improve the accuracy of results and the success of the AI predictions? By the term domain knowledge, we refer to the specific scientific area of used data which, in our case is nanoscience/nanotechnology.</p><p>During the last decade, several nanotechnology areas have started to benefit from AI techniques aiming for example, to predict the properties of new nanomaterials, to enhance microscopy results, to accelerate simulations, to link manufacturing conditions with nanostructure morphology and then with the properties and performance of the nanostructured devices <ref type="bibr" target="#b6">[7]</ref>. Due to the scientific and technological nature of data in these applications, there is an increasing need to devise ways to match AI methods with the domain knowledge, as reported in the relevant theoretical and experimental works.</p><p>The overall aim of this paper is to propose a hybridization scheme to facilitate the synergy of data-centric methods with domain modelling and theoretical knowledge concerning the link of the morphology of nanostructured surfaces with their properties and functionalities. The key ideas of this approach are: a) to use properly designed modeling results to train and validate AI methods along with experimental data when they are available and b) to exploit the ability of ML techniques to reverse input and output so that the targeted design of a nanostructured product dictates the choice of nanostructure geometry and manufacturing conditions. We will focus on the first idea and we will apply it to evaluate the success of AI techniques hybridized with domain modeling results to predict the relationship between nanosurface morphology and wettability. More specifically, the contributions of the paper can be outlined as follows:</p><p>• An implementation of domain modeling results in AI techniques is realized for training and validation. • A comparison of different AI models is performed based on the physical modeling results. • A study on the numerosity of required data (rough surfaces), to train sufficiently accurate AI methods. • An estimate of the relative importance of input roughness parameters on wetting behavior, supported by a discussion to feed domain decisions and evaluations.</p><p>The paper is structured as follows. We begin, in section 2, with a presentation of the related recent work and the need to investigate further the predictive capability of AI techniques in the nanotechnology applications and specifically wetting behavior. The mathematical modeling methodology for the generation of rough surfaces to train AI models as well as the AI techniques used in the paper is the subject of the section 3. Section 4 presents the results of AI techniques and their comparison. The paper closes with the summary in the final section 5.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">RELATED WORK</head><p>The applications of AI to physical problems and especially materials science have been studied in many contexts during the last decade, creating a novel research framework termed "data-driven materials science" (Teng <ref type="bibr">Zhou et al., 2019 [8]</ref> ). Teng Zhou highlighted the new opportunities that a data-driven approach can provide in the study of materials and named it the 4th paradigm in materials science. The previous three paradigms are assumed to be the empirical, theoretical and computational ones respectively. In the framework of this data-driven materials science, several studies have achieved to make significant contributions to the design and development of new materials <ref type="bibr">(Sutton, C. et al., 2019 [9]</ref>), the prediction of material properties (electronic, mechanic, thermal,. . . ) (Schütt, K. T. et al., 2014 <ref type="bibr" target="#b9">[10]</ref>) or the evaluation of the importance of manufacturing and structure parameters on material surface functionalities such as wetting behavior (Amir <ref type="bibr">Kordijazi et al., 2020 [11]</ref>). Furthermore, a critical question in this framework has been the incorporation of the domain knowledge of materials science (theoretical concepts and laws, modeling and simulation results) coming from previous paradigms in the data-driven algorithms, to avoid significant errors in the provided predictions. To this end, Sutton, C. et al., (2019) and Schütt, K. T. et al., (2014) applied the data-driven science paradigm scheme, using simulation data to train a ML algorithm to predict faster solid-state properties. On the other hand, in the work of M Aziar Raissi and George Karniadakis et al. in 2019 <ref type="bibr" target="#b11">[12]</ref>, we find an elaborated methodology in which, physical modeling assists the AI research by exploration of physics-informed neural network algorithms.</p><p>Regarding the prediction of wetting behaviour, Amir Kordijazi et al., (2020) used ML techniques to predict the water contact angle on surfaces of ductile iron. They used a set of experimental measurements with input parameters the material composition, droplet size, the surface grit size and roughness and the time of the exposure to the liquid. The authors also evaluated the importance of each input parameter on the value of contact angle and they justified the primary role of surface roughness determined by the grit size. However, it was not specified which aspect of surface roughness is more critical. Given that the roughness of surfaces is a complex multifaceted phenomenon characterized by a plethora of parameters, it is worth questioning the relative importance of roughness parameters on surface wetting behavior. In literature, one can find interesting results coming from both experimental and computational approaches exploring the impact of surface roughness parameters on contact angle and hysteresis <ref type="bibr" target="#b3">[4]</ref>.</p><p>In our work, we follow the data-driven approach of the 4th paradigm of science endowed by theoretical and computational modelling knowledge of the 2nd and 3rd paradigms. The aim is to investigate the prediction performance of these AI models on the effects of roughness parameters on the wetting behavior of solid surfaces. We assume that the Wenzel model assumption holds: the contact angle of droplets posed on rough surfaces is determined by the roughness ratio 𝑟 and especially the full active surface area. A hybridization framework is implemented, in which simulated rough surfaces with a wide spectrum of parameters and appearances are used to train and evaluate the AI models and explore their performance versus the simulation cost. We also use the capability of the developed AI models to reveal each roughness parameter importance on the observed wetting behavior.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">METHODOLOGY 3.1 Mathematical modelling of rough surface generation.</head><p>In this section, we describe the methodology we used for generating simulated rough surfaces with similar characteristics with a large variety of experimental ones. These surfaces will be used to enrich the dataset for training and validating the ML models. We begin by describing the methodology for generating Gaussian and non-Gaussian surfaces with controlled spatial correlations.</p><p>Gaussian surfaces: We produce three-dimensional Gaussian surfaces by inputting the Rms of the height distribution and the correlation lengths along the x and y axes (𝜉 𝑥 and 𝜉 𝑦 ) of the surface (Table <ref type="table" target="#tab_0">1</ref>). (Figure <ref type="figure" target="#fig_0">1a and 1b</ref>). The heights of the generated surfaces are calculated on a square lattice 𝑁 × 𝑁 points and area 𝑟𝐿 × 𝑟𝐿. Therefore, the spacing in x and y direction corresponds the ratio:</p><formula xml:id="formula_0">𝑟 𝐿 𝑁 −1 .</formula><p>The methodology for simulating the Gaussian surfaces is based on the work of Garcia et.al. <ref type="bibr" target="#b12">[13]</ref>. First, we produce a white For the Non-Gaussian surfaces the inputs include additionally the skewness and kurtosis.</p><p>noise 𝑍 𝑊 𝐺 distribution with mean value zero and standard deviation equal to the input Rms value. By applying the Gaussian filter 𝐹 𝐺 , described in the following equation (Eq. 1) to the distribution we add the desired correlations along x and y axis.</p><formula xml:id="formula_1">𝐹 𝐺 = 𝑒𝑥𝑝 (−(<label>2</label></formula><formula xml:id="formula_2">𝑥 2 𝜉 2 𝑥 + 2 𝑦 2 𝜉 2 𝑦 ))<label>(1)</label></formula><p>Then, we take the Inverse Fourier Transform of the product of Fourier transforms of 𝑍 𝑊 𝐺 and 𝐹 𝐺 and multiply with normalization factors to generate correlated isotropic and anisotropic Gaussian surfaces After producing the surface, their Rms, 𝜉 𝑥 and 𝜉 𝑦 are compared with the inputs to check possible divergence. The divergence is related with limitations imposed by the discrete sampling and finite range of surfaces. In such case, the algorithm of the surface generation is repeated until the input parameters are converged.</p><p>Non-Gaussian surfaces: The method we used to model non-Gaussian surfaces is based on the work of Yang et al. <ref type="bibr" target="#b13">[14]</ref> where, the Johnson and Pearson transformations systems are used to transform random Gaussian noise with specified average height and Rms into non-Gaussian noise with user-defined skewness, and kurtosis (Table 1). The steps of the method are as follows: First, we generate a random two-dimensional non-Gaussian noise via Johnson transform system giving as input parameters the first four statistical moments (mean, Rms, skewness and kurtosis). If the distribution parameters cannot converge, we use the Pearson transformation system. Then, we measure the skewness and kurtosis of the surface to satisfy the chosen precision conditions. If the conditions are not met, we repeat the generation of the surface and validation. Finally, the surface becomes correlated by reconstructing and rearranging the height sequence in the x and y directions imitating a known Gaussian correlated surface with correlation lengths 𝜉 𝑥 and 𝜉 𝑦 along axes x and y respectively. Yang's method is characterized by its efficiency as different internal fitting methods are used for convergence. Thus, we can create surfaces with skewness and kurtosis inputs that can successfully (with low error) cover every point in the skewness-kurtosis plane 𝐾𝑢 − 𝑆𝑘 2 − 1 ≥ 0.</p><p>Subsequently, the active area was measured by integrating the secant of the angle 𝛾 between the surface normal and its z-direction normal.</p><p>∫</p><formula xml:id="formula_3">∫ 𝑑𝐴 = ∫ ∫ 𝑠𝑒𝑐 (𝛾)𝑑𝑥𝑑𝑦<label>(2)</label></formula><p>Where, 𝛾 is defined as the angle that the z-axis makes with the normal vector of the differential surface dA. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">AI techniques</head><p>In this section, we outline the AI methods we utilized in this work. We begin by overviewing the main points describing the methods and then we align these descriptions with our work.</p><p>A physical model is a domain-driven model that is created through functions that follow underlying physical laws to predict a property. When using those models, the relation between the input parameters and the output value is already known and used for predictions. A ML model is a data-driven model that is used to find an appropriate (originally unknown) function that reflects the connection of an input (given) property to an output property that we want to predict. Even though the actual relation is unknown, given sufficient values for input and output values from experiments, we can create a ML (statistical) model that approximates the relation. Along with pre-existing human expertise, the approximations could add value to the manufacturer that aims to predict physical properties, without the need to exhaustively perform experiments, given the cost of such experiments in time and money.</p><p>In this work, we are using three ML models: 1) Linear Regression 2) Random Forests 3) Neural Networks. We try these different families of learning, since this problem has no prior indication of the underlying input-output relation, which may affect the method selection. For example, linear regression models will assume a linear relationship between the input and the output. On the other hand, Random forests and Neural network models use different methodologies to approximate/learn non-linear relationships between (even high dimensional) input parameters and a predicted output property. We stress that there is no single, overall better method for all estimation problems. This statement is better known as the "no-free-lunch theorem" <ref type="bibr" target="#b14">[15]</ref>.</p><p>Linear regression models <ref type="bibr" target="#b15">[16]</ref> (or in our case a multiple linear regression model) approximate the relationship between the input space and the output variable by fitting a linear equation. Those models are characterized by their simplicity and speed and are oftentimes used as a benchmark, to allow comparison to other, more complex models.</p><p>Random Forests <ref type="bibr" target="#b16">[17]</ref> are ensemble methods that use multiple decision trees. A decision tree partitions the input space into subspaces and maps each subspace to a predicted output value. The division of the space is done using training (example) data from the dataset. The algorithm searches for an appropriate partitioning, which reduces the error of the predicted value across all training examples. Normally, each decision tree is fed by all the available training data. However, in the Random forest approach, the learning creates several trees, each applied on a randomly selected subset of the full training data. The model predicts the output by considering all of the predictions of each decision tree. This approach has been shown to be more effective and also generalize better with respect to unseen examples <ref type="bibr" target="#b17">[18]</ref>.</p><p>Neural Networks (NNs) are models that learn complex functions, combined in a non-linear manner over several layers. As such, NNs have been shown to be excellent approximators of many families of functions <ref type="bibr" target="#b18">[19]</ref>, meaning that they can mimic many underlying functions very efficiently, when given enough training data. NNs include neurons (or layers of nodes) that are combined to solve complex problems. A NN consists of an input layer, one or more hidden layers and an output layer. The input layer is fed a representation of the independent variables that (we expect) define the output. The output layer is expected to deliver the estimation of the output, dependent variable. Intermediate layers of nodes take as input the output of previous layers and transform it (i.e. apply a function on it). In essence, each node of a NN is a linear function of its input, passed through a non-linear operator. The intermediate layers end up forming a (sometimes very complex) function that connects input to output.</p><p>NN models with many layers are called Deep Neural Networks. Such a network may contain millions of parameters, identifying the function the network represents. No matter whether we have a deep network or not, these parameters are optimized to minimize the estimation error. In other words, we take each training example, provide it as input to the network and get its output prediction. Based on the real value it should have output, we change (i.e. optimize) the parameters of the network to minimize the prediction error. A number of optimization methods can be used to infer these parameters from the training data, the most well-known being back-propagation.</p><p>There is no a-priori best way to create a NN. However, standard practices can be applied to create such an architecture <ref type="bibr" target="#b19">[20]</ref>. Generally, a Neural Network can be made deeper by adding more hidden layers or more nodes, that however may lead the function to overfitting, i.e. reducing generalization ability to unseen input, which in turns increases the error when using the network for prediction. Thus, the definition of an architecture can be a challenge in itself. However, techniques such as dropout <ref type="bibr" target="#b20">[21]</ref> have been proved to be effective.</p><p>Linear regression models are faster that both Neural networks and Random Forests but have less accuracy in cases where nonlinear inner dependencies appear in data. Both, Neural Networks (NN) and Random Forests offer good levels of performance in different application areas. However, different methods offer different potential for learning and approximating, coupled with different processing requirements (in time and memory). Random forests training costs less time (when compared to NNs in a generic setting) and after training, they can be more interpretable than the average NN. On the other hand, the accuracy a neural network can reach is higher, if we have access to the required volume and diversity of data (more data is usually needed to tune more parameters). Essentially, the selection of training data is a defining factor for ML, beyond the algorithms and the corresponding architecture. These data should satisfy three basic requirements in order to make a good approximation through a ML model. Those requirement are related to the: 1) quantity 2) diversity 3) quality of data. Regarding our work, we aimed to satisfy these three requirements while creating the training and testing datasets before using the models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">RESULTS</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Database Characteristics</head><p>By applying the methodology described in section 3.1, we generated two databases for the training and the validation of the ML models respectively. The databases consist of surfaces with diverse combinations of roughness parameters (Rms, correlation lengths, Skewness and Kurtosis) and the corresponding functional parameter (active area). The distribution of the training and validation datasets are shown in Figure <ref type="figure" target="#fig_1">2 a</ref>) and b):</p><p>The total volume of training data-surfaces has been 3000 surfaces while for validating reached 15000. We trained each model with different percentages of surfaces from the train dataset to examine the effects of the training data size on model success. Also, for every percentage of the training set, we applied the training procedure 10 times with randomly selected surfaces from the database.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Model evaluation metric</head><p>For the validation of the predictability of models, we evaluated the RMSRE (Root Mean Square Relative Error). While the RMSE (Root Mean Square Error) can indicate successfully the appearance of outliers, the relative value of RMSE has no units. RMSE is the squared root error of the average predicted active area 𝐴 𝑝 from the average actual active area 𝐴 𝑟 .</p><formula xml:id="formula_4">𝑅𝑀𝑆𝐸 = √︄ 𝑁 𝑖=1 (𝐴 𝑝 − 𝐴 𝑟 ) 𝑁<label>(3)</label></formula><p>To produce the RMSRE, the RMSE is normalized by the projected area of our surfaces. In our case, the projected area is 𝐴 𝑝𝑟𝑜 𝑗𝑒𝑐𝑡𝑒𝑑 = 𝑟𝐿 2 = 64𝜇𝑚 2 .Thus, for N surfaces the RMSRE would take the following formation: </p><formula xml:id="formula_5">𝑅𝑀𝑆𝑅𝐸 = 𝑅𝑀𝑆𝐸 𝐴 𝑝𝑟𝑜 𝑗𝑒𝑐𝑡𝑒𝑑 100 (4)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Machine Learning Results</head><p>Linear Regression.</p><p>We trained a set of Linear Regression models as a basis comparison with the rest of models. Figure <ref type="figure" target="#fig_2">3</ref> shows the RMSRE of the trained models within a volume range of training data (surfaces). The linear models reached a plateau of RMSRE 9.6% using the validation dataset after approximately 100 training data points-surfaces. The model predictions diverge from the true values for high and low active area values, as seen in Figure <ref type="figure" target="#fig_3">4</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Random Forests.</head><p>We used random forest estimators fitting 25 regression decision trees each. Each tree averages the results to improve the predictive accuracy and control over-fitting. The decision trees used for training had a maximum depth of 25 branches with a minimum number of internal node splits of two and a mean squared error criterion for splitting. Random Forest models outperforms the Linear Regression models after approximately 120 data-surfaces (Figure <ref type="figure" target="#fig_2">3</ref>) and achieved less than 4.0% RMSRE after 2100 training data-surfaces. The standard deviation of the RMSRE decreases as the training dataset becomes larger. Figure <ref type="figure" target="#fig_4">5</ref> shows that the models were able to predict the high and low active areas of the surfaces with less error as compared to the the linear regression models. However, there is still a significant amount of error for high active areas.   We used a set of neural network (NN) and a deep neural network (DNN) models for the prediction of active areas. NN models were trained using the adam <ref type="bibr" target="#b21">[22]</ref> solver with a five-layer architecture consisting of 15,25,40,25,15 nodes respectively. The DNN models were trained through rmsprop optimizer with a two-layer architecture consisting of 200 nodes each. To overcome any overfitting tendency, a 50% dropout was established between the two layers and an additive zero-centered Gaussian noise of 𝑠𝑡𝑑 = 1 was added to the last layer. Both models had activation function the Rectified Linear unit (ReLu).</p><p>For training datasets with a size of less than 300 surfaces, the NNs performed worse as compared to the linear regression models (Figure <ref type="figure" target="#fig_2">3</ref>). After 300 surfaces, they outperformed the linear regression models reaching a 𝑅𝑀𝑆𝑅𝐸 = 6%. It is noted that in this case as well, the standard deviation increases with the size of the training data. The performance of DNNs is comparable to the Random forest model. For 2700 surfaces of training data, the DNN models show an average error of 𝑅𝑀𝑆𝑅𝐸 = 2% (Figure <ref type="figure">6</ref>). Therefore, the DNNs generally show a significantly lower error compared to the NNs. By taking the average of the absolute value of the weights between the input layer and the first layer of the DNN, we can identify the most important features of the model. Figure <ref type="figure">7</ref> shows that Rms has the highest weight intensity, followed by the correlation lengths in x and y axis (𝜉 𝑥 and 𝜉 𝑦 resp). This results is in harmony with previous findings based on computational analysis and experimental measurements <ref type="bibr" target="#b3">[4]</ref>   In our work, we proposed a hybridization scheme that combines data-driven methodologies (4th paradigm <ref type="bibr" target="#b7">[8]</ref>) with domain theoretical and modeling knowledge as an alternative (2nd and 3rd paradigm) link between the configuration of nanosurface rough morphology and prediction of wetting behaviour.</p><p>In particular, we trained Linear Regression, Random Forest, Neural Network and Deep Neural Network models with simulated nanosurfaces in order to predict the true (active) area of the surface, a critical parameter for wetting when Wenzel model is assumed. We then compared their performances in relation to the required data (simulation cost to produce rough surfaces). Random Forests and Deep Neural Networks showed the highest performance reaching 4 % of RMSRE after 1000 training data-surfaces. The models and particularly the Deep Neural Networks indicate that Rms has the highest importance in wetting behavior. The correlation lengths in the x and y axis showed lower but significant importance as well whereas skewness and kurtosis play a minor though detectable role.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Examples of simulated rough nano-surfaces a) Isotropic Gaussian surface N= 600 points, rL=4 𝜇m, 𝜉 𝑥 =0.5 𝜇m, 𝜉 𝑦 =0.5 𝜇m, Rms=0.1 𝜇m b) Anisotropic Gaussian surface N= 600 points, rL=6 𝜇m, 𝜉 𝑥 =0. 𝜇m, 𝜉 𝑦 =0.6 𝜇m, Rms=0.1 𝜇m c) Isotropic non-Gaussian surface N= 600 points, rL=6 𝜇m, 𝜉 𝑥 =0.5 𝜇m, 𝜉 𝑦 =0.5 𝜇m, Rms=0.05 𝜇m Sk=2, Ku=6 d) Anisotropic non-Gaussian surface N= 600 points, rL=6 𝜇m, 𝜉 𝑥 =0.2 𝜇m, 𝜉 𝑦 =0.6 𝜇m, Rms=0.05 𝜇m Sk=-0.4, Ku=2</figDesc><graphic coords="3,324.69,83.69,226.77,171.37" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: a) Database of the surface roughness parameters used to train the models. b) Database of the roughness parameters used to validate the models. Each box represents a histogram for the specific parameters (Rms, 𝜉 𝑥 ,𝜉 𝑦 , skewness, kurtosis) and the functional parameter of active area.</figDesc><graphic coords="5,67.62,206.11,212.60,122.37" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: RMSRE vs the volume of the training data for Linear Regression, Random Forests, Neural Networks and Deep Neural Networks</figDesc><graphic coords="5,326.19,83.69,223.78,129.17" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: True actual active versus predicted active area for the linear regression model. This model was trained with 610 random data-surfaces</figDesc><graphic coords="5,341.15,271.59,193.86,118.62" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: True actual active versus predicted active area from a random forest model that was trained with 2137 random data-surfaces</figDesc><graphic coords="5,338.86,448.94,198.43,153.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head></head><label></label><figDesc><ref type="bibr" target="#b2">[3]</ref> </figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 6 :Figure 7 :</head><label>67</label><figDesc>Figure 6: True actual active versus predicted active area from a Deep Neural Network model that was trained with 2700 random data-surfaces</figDesc><graphic coords="6,74.71,428.02,198.44,153.91" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>𝜉 𝑥 , 𝜉 𝑦 active area Non-Gaussian Rms, 𝜉 𝑥 , 𝜉 𝑦 , skewness, kurtosis active area Input parameters used for generating the active area of Gaussian and non-Gaussian surfaces. The input (roughness) parameters for the Gaussian consist of the Rms heights and the correlation lengths (𝜉) in x and y directions.</figDesc><table><row><cell>Surface Type</cell><cell>Input Parameters</cell><cell>Output Parameter</cell></row><row><cell>Gaussian</cell><cell>Rms,</cell><cell></cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Y.-P</forename><surname>Zhao</surname></persName>
		</author>
		<title level="m">Characterization of Amorphous and Crystalline Rough Surface: Principles and Applications</title>
				<meeting><address><addrLine>San Diego, CA</addrLine></address></meeting>
		<imprint>
			<publisher>Elsevier</publisher>
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Resistance of solid surfaces to wetting by water</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">N</forename><surname>Wenzel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Industrial &amp; Engineering Chemistry</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="988" to="994" />
			<date type="published" when="1936">1936</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Area evaluation of microscopically rough surfaces</title>
		<author>
			<persName><forename type="first">L</forename><surname>Lai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A</forename><surname>Irene</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Vacuum Science &amp; Technology B: Microelectronics and Nanometer Structures Processing, Measurement, and Phenomena</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="33" to="39" />
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Roughness dependent wettability of sputtered copper thin films: The effect of the local surface slope</title>
		<author>
			<persName><forename type="first">F</forename><surname>Foadi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">H</forename><surname>Brink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Mohammadizadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Palasantzas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Applied Physics</title>
		<imprint>
			<biblScope unit="volume">125</biblScope>
			<biblScope unit="issue">24</biblScope>
			<biblScope unit="page">244307</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Magnetization and structure of mn-ni and mn-co layered magnetic thin films</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">B</forename><surname>Stearns</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Applied Physics</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="2436" to="2438" />
			<date type="published" when="1982">1982</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<title level="m">The effect of surface roughness parameters on contact and wettability of solid surfaces</title>
				<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
		<respStmt>
			<orgName>Iowa State University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Artificial intelligence in nanotechnology</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Sacha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Varona</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nanotechnology</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page">452002</biblScope>
			<date type="published" when="2013-10">oct 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Big data creates new opportunities for materials research: A review on methods and applications of machine learning for materials design</title>
		<author>
			<persName><forename type="first">T</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Sundmacher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Engineering</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1017" to="1026" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Crowd-sourcing materials-science challenges with the nomad 2018 kaggle competition</title>
		<author>
			<persName><forename type="first">C</forename><surname>Sutton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">npj Computational Materials</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">How to represent crystal structures for machine learning: Towards fast prediction of electronic properties</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">T</forename><surname>Schütt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Glawe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Brockherde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sanna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">R</forename><surname>Müller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">K U</forename><surname>Gross</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Physical Review B</title>
		<imprint>
			<biblScope unit="volume">89</biblScope>
			<date type="published" when="2014-05">May 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Machine-learning methods to predict the wetting properties of iron-based composites</title>
		<author>
			<persName><forename type="first">A</forename><surname>Kordijazi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>Roshan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dhingra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Povolo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">K</forename><surname>Rohatgi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Nosonovsky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Surface Innovations</title>
		<imprint>
			<biblScope unit="page" from="1" to="9" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Raissi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Perdikaris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">E</forename><surname>Karniadakis</surname></persName>
		</author>
		<title level="m">Physics informed deep learning (part i): Data-driven solutions of nonlinear partial differential equations</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Monte carlo calculation for electromagnetic-wave scattering from random rough surfaces</title>
		<author>
			<persName><forename type="first">N</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Stoll</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phys. Rev. Lett</title>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="page" from="1798" to="1801" />
			<date type="published" when="1984-05">May 1984</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Numerical simulation of 3d rough surfaces and analysis of interfacial contact characteristics</title>
		<author>
			<persName><forename type="first">G</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hong</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">CMES -Computer Modeling in Engineering and Sciences</title>
		<imprint>
			<biblScope unit="volume">103</biblScope>
			<biblScope unit="page" from="251" to="279" />
			<date type="published" when="2014">12 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">The lack of a priori distinctions between learning algorithms</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">H</forename><surname>Wolpert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Neural Computation</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page" from="1341" to="1390" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">W</forename><surname>Gareth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">James</forename></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Tibshirani</surname></persName>
		</author>
		<title level="m">An Introduction to Statistical Learning</title>
				<meeting><address><addrLine>New York, NY</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="59" to="126" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Random decision forests</title>
		<author>
			<persName><forename type="first">Kam</forename><surname>Tin</surname></persName>
		</author>
		<author>
			<persName><surname>Ho</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of 3rd International Conference on Document Analysis and Recognition</title>
				<meeting>3rd International Conference on Document Analysis and Recognition</meeting>
		<imprint>
			<date type="published" when="1995">1995</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="278" to="282" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The random subspace method for constructing decision forests</title>
		<author>
			<persName><forename type="first">Kam</forename><surname>Tin</surname></persName>
		</author>
		<author>
			<persName><surname>Ho</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Pattern Analysis and Machine Intelligence</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="832" to="844" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Neural networks and deep learning</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Nielsen</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Optimizing feedforward artificial neural network architecture</title>
		<author>
			<persName><forename type="first">P</forename><surname>Benardos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G.-C</forename><surname>Vosniakos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Engineering Applications of Artificial Intelligence</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="365" to="382" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Dropout: A simple way to prevent neural networks from overfitting</title>
		<author>
			<persName><forename type="first">N</forename><surname>Srivastava</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hinton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Krizhevsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Salakhutdinov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Mach. Learn. Res</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page" from="1929" to="1958" />
			<date type="published" when="2014-01">Jan. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<title level="m" type="main">Adam: A method for stochastic optimization</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">P</forename><surname>Kingma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ba</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
