Towards an algorithm selection standard: data format and tools Lars Kotthoff1 The Algorithm Selection Problem is attracting increasing attention REFERENCES from researchers and practitioners from a variety of different back- [1] Bernd Bischl, Pascal Kerschke, Lars Kotthoff, Marius Lindauer, Yuri grounds. After decades of fruitful applications in a number of do- Malitsky, Alexandre Fréchette, Holger Hoos, Frank Hutter, Kevin mains, a lot of data has been generated and many approaches tried, Leyton-Brown, Kevin Tierney, and Joaquin Vanschoren, ‘Aslib: A but the community lacks a standard format or repository for this data. benchmark library for algorithm selection’, under review for AIJ, (2014). Furthermore, there are no standard implementation tools. This situ- [2] Lars Kotthoff, ‘LLAMA: leveraging learning to automatically manage algorithms’, Technical Report arXiv:1306.1031, arXiv, (June 2013). ation makes it hard to effectively share and compare different ap- [3] Lars Kotthoff, ‘Algorithm selection for combinatorial search problems: proaches and results on different data. It also unnecessarily increases A survey’, AI Magazine, (2014). the initial threshold for researchers new to this area. [4] Lars Kotthoff, Ian P. Gent, and Ian Miguel, ‘An evaluation of machine In this talk, I will first give a brief introduction to the Algorithm learning in algorithm selection for search problems’, AI Communica- tions, 25(3), 257–270, (2012). Selection Problem and approaches to solving it [4, 3]. Then, I will present a standardized format for representing algorithm selection scenarios and a repository that contains a growing number of data sets from the literature, Aslib [1]. The format has been designed to be able to express a wide variety of different scenarios. In addition to en- coding instance features and algorithm performances, there are facili- ties for providing feature costs, the status of algorithm execution and feature computations, cross-validation splits and meta-information. In addition to the data format itself, there is an R package that im- plements parsers and basic analysis tools. I will illustrate its usage through a series of examples. I will further present LLAMA [2], a modular and extensible toolkit implemented as an R package that facilitates the exploration of a range of different portfolio techniques on any problem domain. It im- plements the algorithm selection approaches most commonly used in the literature and leverages the extensive library of machine learning algorithms and techniques in R. I will provide an overview of the architecture of LLAMA and the current implementation. Leveraging the standard data format and the LLAMA toolkit, I will conclude this talk by presenting a set of example experiments that build and evaluate algorithm selection models. The models are created and evaluated with LLAMA on the problems in the algo- rithm selection benchmark repository. The results demonstrate the potential of algorithm selection to achieve significant performance improvements even through straightforward application of existing techniques. Together, Aslib and LLAMA provide a low-threshold starting point for researchers wanting to apply algorithm selection to their domain or prototype new approaches. Both are under active develop- ment. Joint work with Bernd Bischl, Pascal Kerschke, Marius Lindauer, Yuri Malitsky, Alexandre Fréchette, Holger Hoos, Frank Hutter, Kevin Leyton-Brown, Kevin Tierney, and Joaquin Vanschoren. 1 University College Cork, larsko@4c.ucc.ie