<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Comparing SVM, Gaussian Process and Random Forest Surrogate Models for the CMA-ES</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Zbyněk</forename><surname>Pitra</surname></persName>
							<email>z.pitra@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">National Institute of Mental Health Topolová</orgName>
								<address>
									<addrLine>748</addrLine>
									<postCode>250 67</postCode>
									<settlement>Klecany</settlement>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Faculty of Nuclear Sciences and Physical Engineering</orgName>
								<orgName type="institution">Czech Technical University in</orgName>
								<address>
									<addrLine>Prague Břehová 7</addrLine>
									<postCode>115 19</postCode>
									<settlement>Prague 1</settlement>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Lukáš</forename><surname>Bajer</surname></persName>
							<email>bajer@cs.cas.cz</email>
							<affiliation key="aff2">
								<orgName type="department">Institute of Computer Science</orgName>
								<orgName type="institution">Academy of Sciences of the Czech Republic Pod Vodárenskou věží</orgName>
								<address>
									<postCode>182 07</postCode>
									<settlement>Prague 8</settlement>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
							<affiliation key="aff3">
								<orgName type="department">Faculty of Mathematics and Physics</orgName>
								<orgName type="institution">Charles University in</orgName>
								<address>
									<addrLine>Prague ; Malostranské nám. 25</addrLine>
									<postCode>118 00</postCode>
									<settlement>Prague 1</settlement>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Martin</forename><surname>Holeňa</surname></persName>
							<email>holena@cs.cas.cz</email>
							<affiliation key="aff2">
								<orgName type="department">Institute of Computer Science</orgName>
								<orgName type="institution">Academy of Sciences of the Czech Republic Pod Vodárenskou věží</orgName>
								<address>
									<postCode>182 07</postCode>
									<settlement>Prague 8</settlement>
									<country key="CZ">Czech Republic</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Comparing SVM, Gaussian Process and Random Forest Surrogate Models for the CMA-ES</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">3C5AEB96EF73149389248A28C6061D3D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T16:10+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In practical optimization tasks, it is more and more frequent that the objective function is black-box which means that it cannot be described mathematically. Such functions can be evaluated only empirically, usually through some costly or time-consuming measurement, numerical simulation or experimental testing. Therefore, an important direction of research is the approximation of these objective functions with a suitable regression model, also called surrogate model of the objective functions. This paper evaluates two different approaches to the continuous black-box optimization which both integrates surrogate models with the state-of-the-art optimizer CMA-ES. The first Ranking SVM surrogate model estimates the ordering of the sampled points as the CMA-ES utilizes only the ranking of the fitness values. However, we show that continuous Gaussian processes model provides in the early states of the optimization comparable results.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Optimization of an expensive objective or fitness function plays an important role in many engineering and research tasks. For such functions, it is sometimes difficult to find an exact analytical formula, or to obtain any derivatives or information about smoothness. Instead, values for a given input are possible to be obtained only through expensive and time-consuming measurements and experiments. Those functions are called black-box, and because of the evaluation costs, the primary criterion for assessment of the black-box optimizers is the number of fitness function evaluations necessary to achieve the optimal value.</p><p>The Covariance Matrix Adaptation Evolution Strategy (CMA-ES) <ref type="bibr" target="#b4">[5]</ref> is considered to be the state-of-the-art of the black-box continuous optimization. The important property of the CMA-ES is that it advances through the search space only according to the ordering of the function values in current population. Hence, the search of the algorithm is rather local which predisposes it to premature convergence in local optima if not used with sufficiently large population size. This issue resulted in development of several restart strategies <ref type="bibr" target="#b11">[12]</ref>, such as IPOP-CMA-ES <ref type="bibr" target="#b0">[1]</ref> and BIPOP-CMA-ES <ref type="bibr" target="#b5">[6]</ref> performing restarts with population size successively increased, or aCMA-ES <ref type="bibr" target="#b8">[9]</ref> using also unsuccessful individuals for covariance matrix adaptation.</p><p>Furthermore, the CMA-ES often requires more fitness function evaluations to find the optimum than many realworld experiments can offer. In order to decrease the number of evaluations in evolutionary algorithms, it is convenient to periodically train a surrogate model of the fitness function and use it for evaluation of new points instead of the original function. The second option is to use the model for selection of the most promising points to be evaluated by the original fitness.</p><p>Loshchilov's surrogate-model-based algorithm s * ACM-ES <ref type="bibr" target="#b12">[13]</ref> utilizes the former approach: it estimates the ordering of the fitness values required by the CMA-ES using Ranking Support Vector Machines (SVM) as an ordinal regression model. Moreover, it has been shown <ref type="bibr" target="#b12">[13]</ref> that model parameters (hyperparametres) used to construct Ranking SVM model can be optimized during the search by the pure CMA-ES algorithm. Later proposed s * ACM-ES extensions, referred to as s * ACM-ES-k <ref type="bibr" target="#b14">[15]</ref> and BIPOP-s * ACM-ES-k <ref type="bibr" target="#b13">[14]</ref>, use a more intensive exploitation of the surrogate model by increasing population size in generations evaluated by the model.</p><p>More recently, a similar algorithm based on regression surrogate model called S-CMA-ES <ref type="bibr" target="#b2">[3]</ref> has been presented. As opposed to the former algorithm, S-CMA-ES is performing continuous regression by Gaussian processes (GP) <ref type="bibr" target="#b16">[17]</ref> and random forests (RF) <ref type="bibr" target="#b3">[4]</ref>.</p><p>This paper compares the two mentioned surrogate CMA-ES algorithms, s * ACM-ES-k and S-CMA-ES, and the original CMA-ES itself. We benchmark these algorithms on the BBOB/COCO testing set <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref> not only in their one population IPOP-CMA-ES version, but also in combination with the two-population-size BIPOP-CMA-ES.</p><p>The remainder of the paper is organized as follows. The next chapter briefly describes tested algorithms: the CMA-ES, the BIPOP-CMA-ES, the s * ACM-ES-k, and the S-CMA-ES. Section 3 contains experimental setup and results, and Section 4 concludes the paper and suggests further research directions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Algorithms</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">The CMA-ES</head><p>In each generation g, the CMA-ES <ref type="bibr" target="#b4">[5]</ref> generates λ new candidate solutions</p><formula xml:id="formula_0">x k ∈ R D , where k = 1,...,λ , from a multivariate normal distribution N(m (g) ,σ 2 (g) C (g) )</formula><p>, where m (g) is the mean interpretable as the current best estimate of the optimum, σ 2 (g) the step size, representing the overall standard deviation, and C (g) the D × D covariance matrix. The algorithm selects the µ points with the lowest function value from λ generated candidates to adjust distribution parameters for the next generation.</p><p>The CMA-ES uses restart strategies to deal with multimodal fitness landscapes and to avoid being trapped in local optima. A multi-start strategy where the population size is doubled in each restart is referred to as IPOP-CMA-ES <ref type="bibr" target="#b0">[1]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">BIPOP-CMA-ES</head><p>The BIPOP-CMA-ES <ref type="bibr" target="#b5">[6]</ref>, unlike IPOP-CMA-ES, considers two different restart strategies. In the first one, corresponding to the IPOP-CMA-ES, the population size is doubled in each restart i restart using a constant initial stepsize σ 0 large = σ 0 default :</p><formula xml:id="formula_1">λ large = 2 i restart λ default .<label>(1)</label></formula><p>In the second one, the smaller population size λ small is computed as</p><formula xml:id="formula_2">λ small = ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ λ default 1 2 λ large λ default U[0,1] 2 ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ ,<label>(2)</label></formula><p>where U[0,1] denotes the uniform distribution in [0,1]. The initial step-size is also randomly drawn as</p><formula xml:id="formula_3">σ 0 small = σ 0 default × 10 −2U[0,1] .<label>(3)</label></formula><p>The BIPOP-CMA-ES performs the first run using the default population size λ default and the initial stepsize σ 0 default . In the following restarts, the strategy with less function evaluations summed over all algorithm runs is selected.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">s * ACM-ES-k</head><p>Loshchilov's version of the CMA-ES using the ordinal regression by Ranking SVM as surrogate model in specific generations instead of the original function is referred to as s * ACM-ES <ref type="bibr" target="#b12">[13]</ref>, and its extension using a more intensive exploitation is called s * ACM-ES-k <ref type="bibr" target="#b14">[15]</ref>.</p><p>Before the main loop starts, the s * ACM-ES-k evaluates g start generations by the original function, then it repeats the following steps: First, the surrogate model is constructed using hyperparameters θ , and the original function-evaluated points from previous generations. Second, the surrogate model is optimized by the CMA-ES for g m generations with population size λ = k λ λ default and the number of best points µ = k µ µ default , where k λ ,k µ ≥ 1. Third, the following generation is evaluated by the original function using λ = λ default and µ = µ default . To avoid a potential divergence when g m fluctuate between 0 and 1, k λ &gt; 1 is used only in the case of g m ≥ g m λ , where g m λ denotes the number of generations suitable for effective exploitation using the model. Then the model error is calculated according to the comparison of ranking between the original and model evaluation of the last generation. After that, the g m is adjusted in accordance with the model error. As the last step, the s * ACM-ES-k searches a hyperparameter space by one generation of the CMA-ES minimizing the model error to find the most suitable hyperparameter settings θ new for the next model-evaluated generations.</p><p>The s * ACM-ES-k version using BIPOP-CMA-ES proposed in <ref type="bibr" target="#b13">[14]</ref> is called BIPOP-s * ACM-ES-k.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">S-CMA-ES</head><p>As opposed to the former algorithms, a different approach to surrogate model usage is incorporated in the S-CMA-ES <ref type="bibr" target="#b2">[3]</ref>. The algorithm is a modification of CMA-ES where the original evaluating and sampling phases are substituted by the Algorithm 1 at the beginning of each CMA-ES generation.</p><p>In order to avoid the false convergence of the algorithm in the BBOB benchmarking toolbox, the model-predicted values are adapted to never be lower then the so far minimum of the original function (see the step 17 in the pseudocode).</p><p>The main difference between the S-CMA-ES and the </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Experimental Evaluation</head><p>The core of this paper lies in a systematic comparison of the two mentioned approaches to using surrogate models with the CMA-ES and the original CMA-ES algorithm itself. The first group of surrogate-based algorithms is formed by the S-CMA-ES algorithms using Gaussian processes and random forests models, and the other group is formed by the s * ACM-ES algorithm. These four algorithms (CMA-ES, GP-CMA-ES, RF-CMA-ES, s * ACM-ES) are tested in their IPOP version (based on IPOP-CMA-ES) <ref type="bibr" target="#b0">[1]</ref> and in the bi-population restart strategy version (based on BIPOP-CMA-ES and its derivatives) <ref type="bibr" target="#b5">[6]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Experimental Setup</head><p>The experimental evaluation is performed through the noiseless part of the COCO/BBOB framework (COmparing Continuous Optimizers / Black-Box Optimization Benchmarking) <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>. It is a collection of 24 benchmark functions with different degree of smoothness, uni-/multimodality, separability, conditionality etc. Each function is </p><formula xml:id="formula_4">y k ← f (x k ) k = 1,...,λ {fitness evaluation} 4: A = A ∪ {(x k ,y k )} λ k=1 5: (X tr ,y tr ) ← {(x,y)∈A (m−x) ⊺ σ C −1 2 (m−x) ≤ r} 6: if X tr ≥ n REQ then 7: (X tr ,y tr ) ← choose n MAX points if X tr &gt; n MAX 8:</formula><p>{transformation to the eigenvector basis:}</p><formula xml:id="formula_5">X tr ← {(σ C −1 2 ) ⊺ x tr for each x tr ∈ X tr } 9:</formula><p>f M ← trainModel(X tr ,y tr )  end if 21: end if Output: f M , A, (y k ) λ k=1 defined for any dimension D ≥ 2; the dimensions used for our tests are 2, 5, 10, and 20. The set of functions comprises, among others, well-known continuous optimization benchmarks like ellipsoid, Rosenbrock's, Rastrigin's, Schweffel's or Weierstrass' function.</p><formula xml:id="formula_6">x k ← (σ C −1 2 ) ⊺ x k k = 1,...,λ 16: y k ← f M (x k ) k = 1,...,</formula><p>The framework calls the optimizers on 15 different instances for each function and dimension, meaning that 1440 optimization runs were called for each of the eight considered algorithms. The graphs at the end of the paper show detailed results in a per-function and per-groupof-function manner. The following paragraphs summarize the parameters of the algorithms. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Results</head><p>The performance of the algorithms is compared in the graphs placed in Figures <ref type="figure" target="#fig_1">1-3</ref>. The graphs in Figure <ref type="figure" target="#fig_1">1</ref> depict the expected running time (ERT), which depends on a given target function value f t = f opt + ∆ f -the true optimum f opt of the respective benchmark function raised by a small value ∆ f . The ERT is computed over all relevant  trials as the number of the original function evaluations (FEs) executed during each trial until the best function value reached f t , summed over all trials and divided by the number of trials that actually reached f t <ref type="bibr" target="#b6">[7]</ref>.</p><p>As we can see in Figure <ref type="figure" target="#fig_1">1</ref>, the 24 functions can be roughly divided into two groups according to the algorithm which performed the best (at least in 10D and 20D). The first group of functions where the CMA-ES performed best consists of functions 1, 3, 4, 6, and 20 while on functions 2, 5, 7, 10, 11, 13-16, 18, 21, 23, and 24, GP5-CMA-ES is usually better. The usage of the BIPOP versions generally leads to no improvement or even to performance decrease.</p><p>The graphs in Figures <ref type="figure">2 and 3</ref> summarize the performance over subgroups of the benchmark functions and show the proportion of algorithm runs that reached the target value f t ∈ 10 [−8..2] indeed ( f t was actually different for each respective function, see the figures captions). Roughly speaking, the higher the colored line, the better the performance of the algorithm is for the number of the original evaluations given on the horizontal axis.</p><p>Thus we can see that our GP5-CMA-ES usually outperforms the other algorithms when we consider the evaluations budget FEs ≤ 10 1.5 D, i.e. FEs ≤ 150 for 5D and FEs ≤ 600 for 20D. However, as the number of the considered original evaluations rises, the original CMA-ES or the s * ACM-ES usually performs better. This fact can be summarized that our GP5-CMA-ES is convenient especially for the applications where a very low number of function evaluations is available, such as in <ref type="bibr" target="#b1">[2]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions &amp; Future Work</head><p>In this paper, we have compared the surrogate-assisted S-CMA-ES, which uses GP and RF continuous regression models, with s * ACM-ES-k algorithm based on ordinal regression by Ranking SVM, and the original CMA-ES, all in their IPOP and BIPOP versions. The comparison shows that Gaussian process S-CMA-ES usually outperforms the ordinal-based s * ACM-ES-k in early stages of the algorithm search, especially on multimodal functions (BBOB functions 15-24). However, the algorithms and surrogate models should be further analyzed and compared since, for example, the NEWUOA <ref type="bibr" target="#b15">[16]</ref> or SMAC <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11]</ref> algorithms spend a considerably lower number of function evaluations than the CMA-ES in these early optimization phases. The BIPOP versions of the algorithms did not increased performances of appropriate IPOP versions except BBOB function 5.</p><p>A natural perspective of improving S-CMA-ES is to make the number of model-evaluated generations selfadaptive. We will additionally investigate different properties of continuous and ordinal regression in view of their applicability as regression models. Different cases and benchmarks where the ordinal regression is clearly superior to continuous regression will be further identified. For example, hybrid surrogate models combining both kinds of regression will be attempted.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>-k is in the manner how the CMA-ES is utilized. Considering S-CMA-ES, the model prediction or training is performed within each generation of the CMA-ES. On the contrary in the s * ACM-ES-k, individual generations of the CMA-ES are started to optimize either original fitness, surrogate fitness, or model itself.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Algorithm 1</head><label>1</label><figDesc>Surrogate CMA-ES Algorithm [3] Input: g (generation), g m (number of model generations), σ , λ , m, C (CMA-ES internal variables), r (maximal distance between training points and m), n REQ (minimal number of points for model training), n MAX (maximal number of points for model training), A (archive), f M (model), f (original fitness function) 1: x k ∼ N m,σ 2 C k = 1,...,λ {CMA-ES sampling} 2: if g is original-evaluated then 3:</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>The CMA-ES. The original CMA-ES was used in its IPOP-CMA-ES version (Matlab code v. 3.61) with number of restarts = 4, IncPopSize = 2, σ start =8  3 , λ = 4 + ⌊3logD⌋. The remainder settings were left default. s * ACM-ES. We have used Loshchilov's GECCO 2013 Matlab code xacmes.m [14] in its s * ACM-ES version, setting the parameters CMAactive = 1, newRestartRules = 0 and withSurr = 1, modelType = 1, withModelEnsembles = 0, withModelOptimization = 1, hyper_lambda = 20, λ Mult = 1, µ Mult = 1 and Λ minIter = 4. S-CMA-ES: GP5-CMA-ES and RF5-CMA-ES. The number after the GP/RF in the names of the algorithms denotes the number of model-evaluated generations g m , which are evaluated by the model in row. All considered S-CMA-ES versions use the distance r = 8 (see algorithm 1). For the GP model, K ν=5 2 Matérn covariance function with starting values (σ 2 n ,l,σ 2 f ) = log(0.01,2,0.5) has been used (see [3] for the details). We have tested RF comprising 100 regression trees, each containing at least two training points in each leaf. The CMA-ES parameters (IPOP version, σ start , λ , IncPopSize etc.) were used the same as in the pure CMA-ES experiments. All S-CMA-ES parameter values were chosen according to preliminary testing on several functions from the COCO/BBOB framework. BIPOP version of the algorithms. The bi-population versions BIPOP-CMA-ES and BIPOP-s * ACM-ES use the same Loshchilov's Matlab code xacmes.m with the parameter BIPOP = 1. The BIPOP-GP5-CMA-ES and BIPOP-RF5-CMA-ES algorithms are constructed in the same manner as the S-CMA-ES was transformed from the CMA-ES -by integration of the Algorithm 1 into every generation of the BIPOP-s * ACM-ES.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 1 :Figure 2 :Figure 3 :</head><label>123</label><figDesc>Figure 1: Expected running time (ERT in number of f -evaluations as log 10 value) divided by dimension versus dimension. The target function value is chosen such that the bestGECCO2009 artificial algorithm just failed to achieve an ERT of 10 × DIM. Different symbols correspond to different algorithms given in the legend of f 1 and f 24 . Light symbols give the maximum number of function evaluations from the longest trial divided by dimension. Black stars indicate a statistically better result compared to all other algorithms with p &lt; 0.01 and Bonferroni correction number of dimensions (six). Legend: ○:BIPOP-CMAES, ▽:BIPOP-GP5, ⋆:BIPOP-RF5, ◻:BIPOP-saACMES, △:CMA-ES, ♢:GP5-CMAES, :RF5-CMAES, :saACMES</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Comparing SVM, Gaussian Process and Random Forest Surrogate Models for the CMA-ES</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work was supported by the Czech Science Foundation (GA ČR) grant P103/13-17187S, by the Grant Agency of the Czech Technical University in Prague with its grant No. SGS14/205/OHK4/3T/14, and by the project "National Institute of Mental Health (NIMH-CZ)", grant number CZ.1.05/2.1.00/03.0078 (and the European Regional Development Fund.). Further, access to computing and storage facilities owned by parties and projects contributing to the National Grid Infrastructure MetaCentrum, provided under the programme "Projects of Large Infrastructure for Research, Development, and Innovations" (LM2010005), is greatly appreciated.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A restart CMA evolution strategy with increasing population size</title>
		<author>
			<persName><forename type="first">A</forename><surname>Auger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 2005 IEEE Congress on Evolutionary Computation</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2005-09">Sept. 2005</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="1769" to="1776" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Baerns</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Holeňa</surname></persName>
		</author>
		<title level="m">Combinatorial development of solid catalytic materials. Design of high-throughput experiments, data analysis, data mining</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Imperial College Press / World Scientific</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Benchmarking gaussian processes and random forests surrogate models on the BBOB noiseless testbed</title>
		<author>
			<persName><forename type="first">L</forename><surname>Bajer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Pitra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Holeňa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 17th GECCO Conference Companion</title>
				<meeting>the 17th GECCO Conference Companion<address><addrLine>Madrid; New York</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2015-07">July 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Classification and regression trees</title>
		<author>
			<persName><forename type="first">L</forename><surname>Breiman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1984">1984</date>
			<publisher>Chapman &amp; Hall/CRC</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The CMA evolution strategy: A comparing review</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Studies in Fuzziness and Soft Computing</title>
				<editor>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Lozano</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Larranaga</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">I</forename><surname>Inza</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Bengoetxea</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2006-01">Jan. 2006</date>
			<biblScope unit="volume">192</biblScope>
			<biblScope unit="page" from="75" to="102" />
		</imprint>
	</monogr>
	<note>Towards a New Evolutionary Computation</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Benchmarking a BI-population CMA-ES on the BBOB-2009 function testbed</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 11th Annual GECCO Conference Companion: Late Breaking Papers, GECCO&apos;09</title>
				<meeting>the 11th Annual GECCO Conference Companion: Late Breaking Papers, GECCO&apos;09<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="2389" to="2396" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Auger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Finck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ros</surname></persName>
		</author>
		<title level="m">Real-parameter black-box optimization benchmarking</title>
				<imprint>
			<date type="published" when="2012">2012. 2012</date>
		</imprint>
		<respStmt>
			<orgName>INRIA</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical Report</note>
	<note>Experimental setup</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Finck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ros</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Auger</surname></persName>
		</author>
		<idno>RR-6829</idno>
		<title level="m">Real-parameter black-box optimization benchmarking 2009: Noiseless functions definitions</title>
				<imprint>
			<date type="published" when="2009-02">2009. February 2010</date>
		</imprint>
		<respStmt>
			<orgName>INRIA</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical Report</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Benchmarking a weighted negative covariance matrix update on the BBOB-2010 noiseless testbed</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hansen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ros</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 12th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO&apos;10</title>
				<meeting>the 12th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO&apos;10<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="1673" to="1680" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Sequential modelbased optimization for general algorithm configuration</title>
		<author>
			<persName><forename type="first">F</forename><surname>Hutter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hoos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Leyton-Brown</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Learning and Intelligent Optimization 2011</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">C</forename><surname>Coello</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="volume">6683</biblScope>
			<biblScope unit="page" from="507" to="523" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">An evaluation of sequential model-based optimization for expensive blackbox functions</title>
		<author>
			<persName><forename type="first">F</forename><surname>Hutter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hoos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Leyton-Brown</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 15th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO&apos;13 Companion</title>
				<meeting>the 15th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO&apos;13 Companion<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="1209">1209-1216. 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Alternative restart strategies for CMA-ES</title>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schoenauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sebag</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PPSN</title>
		<editor>C. A. C. Coello, V. Cutello, K. Deb, S. Forrest, G. Nicosia, and M. Pavone</editor>
		<imprint>
			<biblScope unit="volume">7491</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="296" to="305" />
			<date type="published" when="2012">2012</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Self-adaptive surrogate-assisted covariance matrix adaptation evolution strategy</title>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schoenauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sebag</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 14th GECCO, GECCO &apos;12</title>
				<meeting>the 14th GECCO, GECCO &apos;12<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="321" to="328" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">BI-population CMA-ES algorithms with surrogate models and line searches</title>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schoenauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sebag</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Genetic and Evolutionary Computation Conference (GECCO Companion)</title>
				<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="2013-07">July 2013</date>
			<biblScope unit="page" from="1177" to="1184" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Intensive surrogate model exploitation in self-adaptive surrogate-assisted CMA-ES (saACM-ES)</title>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schoenauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sebag</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Genetic and Evolutionary Computation Conference (GECCO)</title>
				<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="2013-07">July 2013</date>
			<biblScope unit="page" from="439" to="446" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The NEWUOA software for unconstrained optimization without derivatives</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J D</forename><surname>Powell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Large-Scale Nonlinear Optimization, number 83 in Nonconvex Optimization and Its Applications</title>
				<editor>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Pillo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Roma</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer US</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="page" from="255" to="297" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Gaussian processes for machine learning</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">E</forename><surname>Rasmussen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">K I</forename><surname>Williams</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Adaptative Computation and Machine Learning Series</title>
				<imprint>
			<publisher>MIT Press</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
