=Paper=
{{Paper
|id=None
|storemode=property
|title=Creating driving behavior for artificial agents in a social augmented micro-world
|pdfUrl=https://ceur-ws.org/Vol-928/0336.pdf
|volume=Vol-928
|dblpUrl=https://dblp.org/rec/conf/csp/SchneiderB12
}}
==Creating driving behavior for artificial agents in a social augmented micro-world==
Creating driving behavior for artificial agents in a social augmented micro-world M.C. Schneider and H-D. Burkhard Humboldt-Universität zu Berlin, Germany mschneid@informatik.hu-berlin.de, hdb@informatik.hu-berlin.de Abstract. This paper presents first results on the implementation of driving agents in the socially augmented micro-world (SAM). Time se- ries analysis are used for comparing the agent behavior with human behavior. We will discuss the results from time series analysis/clustering and compare them with classifications done by hand. Further approaches for future research are presented and discussed. Keywords: artificial agents, driving behavior, micro-world, time series 1 Introduction This paper presents the current work on the ongoing interdisciplinary project of Psychology and Informatics in the field of human factors. The central goal of the work in this paper is to construct artificial agents which can replace humans in a socially augmented micro-world. The socially augmented micro-world (SAM), was developed by the group of H. Wandke at the Institute of Psychology at Humboldt-Universität zu Berlin [1]. The aim is to study design problems for complex human operated systems. Fig. 1 shows the setup of the ATEO lab system (ALS), consisting of two humans which interact with SAM through a joystick, as seen in the left part of the figure. The humans are called micro-world inhabitants (MWI).The human operator sits on the right in the figure, hidden from both MWI. He supervises both MWI through the ATEO Master. ATEO refers to the German title of the umbrella project for developing the ALS: ArbeitsTeilung Entwickler-Operateur. So far, determinism has been the main deficit of simulations. To avoid deter- minism there are two humans included, to cooperatively performing a pursuit tracking task. Each human partially influences a vehicle, which drives a long a street. Both MWI are instructed with different orders, to drive fast or to drive accurately. The operator’s task is to improve the overall performance of the MWI’s by visual or acoustical hints. For a broader range of experiments, it shall be possible to replace human MWI by artificial agents. In the next chapter we will review previous work. Then we will show the final architecture of the agent. The next two parts focus on the two approaches which were made to analyze the human MWI behavior. We thank the colleages from Department of Mathematics and Informatics, University of Novi Sad, for 2 M.C. Schneider et al. Fig. 1: setup of ATEO lab system providing the data of their series analysis. The last chapter presents the results of the comparison of both approaches and the first results of artificial agents and outlines future work. 2 Earlier Work 2.1 Agent architecture In the last year we finished the proposed architecture [2] [3]. The architecture focuses on usability, efficiency, maintainability and portability. Usability can be understood as clarity of the used concepts and the effort needed to use the agents. Efficiency concerns the real time behavior of each component. The next point is the maintainability, which refers to the cost of finding and correcting errors. Modifiability is also part of maintainability and is the cost of changing parts of the agent architecture. The last part is portability and describes the difficulty to port the agent from one environment to an other. One important aspect of the usability is the effort which is needed to implement new agent behavior. To simplify the architecture we use a design pattern called strategy [4]. This design pattern gives a lot of possibilities, new agents can easily be added and the behavior can be changed during run time. 2.2 Classification by hand As mentioned in [2], the behavior of human MWI need to be analyzed, as a prerequisite for the development of artificial agents. There are 26 data sets from former studies with human probands. One data set contains 11 tracks. For each Title Suppressed Due to Excessive Length 3 track there exists one file in the csv data format, which contains information about the state of SAM, taken every 39ms. The first approach focused on only one track, because this track exhibits charac- teristics like forks or obstacles for the first time and has no intervention from the operator. We only analysed the critical situations (forks, the avoiding of obsta- cles). Data gained in the situations provides more variance and hence supports the identification of selective types of interaction behavior. Six different steering behavior were found: 1. Adapted navigator: MWI adjusts his steering to characteristics of the track. 2. Extreme steering navigator: steers with maximum deflection alternating ex- tremely from one direction to the other within a short time. 3. No steering navigator: this MWI hardly shows own steering, the partner steers. 4. Parallel navigator: produces comparable navigation lane to an adapted driver. The parallel navigator exists only in combination with a second parallel nav- igator. 5. Indecisive navigator: makes no decision or decides too late. 6. Residue: remaining data sets which did not fit in any other classification or does not show any characteristic steering behavior. Fig. 2 shows an example for the visualization that support the classification by hand. The driver types were classified by similarities of the curves. 2.3 Classification with time series Instead of analysis by hand as described above, the analysis of time series could lead to automatized classification. Such an approach was done with the help of FAP [5]. FAP is a system for time-series anlysis, developed at Department of Mathematics and Informatics, University of Novi Sad [6]. Three time series have been investigated which described the path, the acceleration and the deviation from the ideal path. The data have been derived from the CVS file. Three dif- ferent similarity measures were chosen: Dynamic Time Warping (DTW), Edit Distance on Real sequence (EDR) and Edit Distance with Real penalty (ERP). For each of the three time series a distance matrix was calculated. A hierarchical clustering algorithm was used for clustering and creating dendrograms. This can be seen in Fig. 3. As each of the three similarity measures follows a different methodology, they lead to different clusters. Two distinct clusters were to be identified distinguish- ing navigators focusing on speed from those focusing on accuracy. To do this, the appropriatness of the three similarity measures was investigated. The different instruction should result in different navigation styles. This difference should be observable by related clusters derived from the time series. As result the ERP measure is the most suitable to group the MWI. The ERP measure is defined as: 4 M.C. Schneider et al. Fig. 2: The acceleration-, speed- and steering curves of a data set at a fork (left). The road segment with fork (right). The classification sytem of parameters (below). P n 1 |si − g| if m = 0 Pm |r − g| if n=0 1 i ERP (R, S) = min{ERP (Rest(R), Rest(S)) + distE RP (r1 , s1 ), ERP (Rest(R), S) + distERP (r1 , gap), otherwise ERP (R, Rest(S)) + distERP (s1 , gap)} |ri − si | if ri , si not gaps distE RP (ri , si ) = |ri − g| if si is a gap |si − g| if ri is a gap This measure borrows ideas from the domain of strings. Two strings are aligned so that they become identical with the smallest number of added, deleted or changed symbols. Title Suppressed Due to Excessive Length 5 Fig. 3: A dendrogram created from hierarchical clustering, with ERP as similarity measure for the path series on ”learning track” 3 Results and Discussion 3.1 Comparison between classification by hand and classification with time series As described in the introduction, the goal is to create artifical agents which can be substituted human MWI. Those artifical agents should show characteristic human driving behavior. The ERP similarity measure is used to compare the human and agent behav- ior. We looked to confirm the hypothesis that MWI in the same classification, which was done by hand, are more similar to each other by the ERP similarity measure. In the first approach, the MWIs were compared using ERP by tracks were they have interacted separatledly with SAM (track 1-4). The classification by hand was done by another track (track 7) where steering were done coopera- tively. The tracks 1-4 were used since there was no interdependence between the drivers just that their characteristic behavior could be observed more clearly. As a negativ result, the MWIs in one of the classes found by hand did not show more ERP similarity than the others. The question stands, if other results could be achieved by comparing the same tracks. 3.2 Artifical agents first results We decided to use P-controller for our agents because of its simplicity. We want to start from a simple and easy to understand agent and add more complex concepts step by step. Ehlert et. al. [7] shows that with a reactive agent and simple rules it is possible to create human-like driving behavior. P-controller, or proportional controller, is well know as a control system and can mathematically described.[8] 6 M.C. Schneider et al. at = Kp (yt − xt ) • at is the resulting controller signal • Kp is the propotional gain • yt is the desired position of the vehicle at time t • xt is the actual position at time t In general, the agent behavior can be called seeking behavior, since it tries to reach a desired point, in this case, the middle of the street. We take the dif- ference from the actual position of the vehicle to the desired position as input for a p-controller. So we get the following equation: steering = Kps ∗ (positiondesired − positioncurrent ) acceleration = Kpa ∗ (1024 − 2 ∗ |steering|) Both functions have a value range from -1024 till 1024. We tested two different value pairs. Fig. 4: steering P-controller (left) and acceleration P-controller (right) Fig. 4 shows the P-controller function for both Typs, Typ1 (Kps = −1, Kpa = 1) and Typ2 (Kps = −1.28, Kpa = 1). For the steering P-controller deviation (x-axes) was printed against the steering (y-axes) and for the acceleration P- controller deviation (x-axes) was printed against acceleration (y-axes). We calculated the ERP distances for those two artificial agents compared to the human MWI. As result we got that the ERP distance of the artificial agents to any of the human MWI is greater then 18000. For example the greatest distance between two human MWI is 10237 (between Experiment 10 Rider 2 and Experiment 25 Rider 2). This means that the human MWI are far away from P-controller. Title Suppressed Due to Excessive Length 7 3.3 Future work In this paper, we presented the first attempts to create artifical agents which show human-like driving behavior. To compare the performances, we used the ERP similarity measure. In the end, we showed a first result for the artificial agents compared to human MWI. To improve the performance of the artificial agents foresight will be used. In this way, we come from a pure reactive agent to more deliberative agent. A further point is the step from P-controller to PID- controller. We will add more parameters which have to be tuned, but in the end we should get a more human like steering behavior and more parameters to create different navigation behavior. With human-like we mean a lower ERP similarity value. Further work is necessary for the usage of time series for analysis. References [1] Wandke, H. ; Nachtwei, J.: The different human factor in automation: the devel- oper behind vs. the operator in action. In: Waard, D. de (Hrsg.) ; Flemisch, F. O. (Hrsg.) ; Lorenz, B. (Hrsg.) ; Oberheid, H. (Hrsg.) ; Brookhuis, K. A. (Hrsg.): Human factors for assistance and automation. Maastricht, the Netherlands: Shaker Publishing, 2008, S. 493–502 [2] Burkhard, H-D. ; Jahn, L. ; Kain, S. ; Meyer, C. ; Muetterlein, J. ; Nachtwei, J. ; Niestroj, N. ; Rougk, S. ; Schneider, M.: Artificial Subjects in the Psychological Experiment Socially Augmented Microworld (SAM). In: Pro- ceedings of International Workshop CS&P 2011, 2011, S. 55–66 [3] Jahn, L. ; Schneider, M.: Dokumentation der Entwicklung von Agenten zur Erset- zung von Mikroweltbewohner on der Socially Augmented Microworld. Studienarbeit an der Humboldt University Berlin, 2012 [4] Gamma, Erich ; Riehle, Dirk ; Helm, Richard ; Johnson, Ralph ; Vlis- sides, John: Entwurfsmuster: Elemente wiederverwendbarer objektorientierter Soft- ware. [6. Aufl.]. München and Boston and Mass. [u.a.] : Addison-Wesley, 2011 http://www.worldcat.org/oclc/693880985 [5] Kurbalija, Vladimir ; Burkhard, Hans-Dieter ; Ivanović, Mirijana ; Meyer, Charlotte ; Nachtwei, Jens ; Fodor, Lidija: Time Series Mining in a Prschological Domain. 2012 [6] Kurbalija, V. ; Radovanović, M. ; Geler, Z. ; Ivanović, M.: A framework for time-series analysis. In: Proceedings of the 14th international conference on Artificial intelligence: methodology, systems and applications, 2010, S. 42–51 [7] Patrick A.M. Ehlert and Leon J.M. Rothkrantz ; Patrick A.M. Ehlert (Hrsg.): A REACTIVE DRIVING AGENT FOR MICROSCOPIC TRAFFIC SIM- ULATION. http://www.kbs.twi.tudelft.nl/docs/conference/2001/Ehlert.P.A.M- ESM2001.pdf. Version: 2001 [8] Russell, Stuart ; Canny, John F.: Künstliche Intelligenz: Ein moderner Ansatz. 1. München and and Boston [u.a.] : Pearson Studium, 2004. – ISBN 978–3–8273– 7089–1