=Paper=
{{Paper
|id=Vol-2542/MOD-KI3
|storemode=property
|title=Position Paper: Explainable Search of Multimodal Itineraries
|pdfUrl=https://ceur-ws.org/Vol-2542/MOD-KI3.pdf
|volume=Vol-2542
|authors=Jan Fabian Ehmke,Thomas Horstmannshoff
|dblpUrl=https://dblp.org/rec/conf/modellierung/EhmkeH20
}}
==Position Paper: Explainable Search of Multimodal Itineraries==
Joint Proceedings of Modellierung 2020 Short, Workshop and Tools & Demo Papers Workshop on Models in AI 150 Position Paper: Explainable Search of Multimodal Itineraries Jan Fabian Ehmke1, Thomas Horstmannshoff12 Abstract. Travel websites like Google Maps promise the seamless creation of multimodal itin- eraries according to individual traveler preferences. The engines of these websites are based on efficient techniques of mathematical optimization and can search multimodal itineraries in negli- gible runtime as long as the number of traveler preferences is small. However, due to the large number and variety of mobility services, the solution space of multimodal itineraries is often complex. Hence, the search process is not transparent to individual traveler, and it is hard for them to assess the impact of their preferences on the set of created multimodal itineraries. In this position paper, we propose to make the search of individualized multimodal itineraries more ex- plainable. Demonstrating the idea of solution sampling and using a large data set of multimodal mobility services, we derive information on the solution space and demonstrate how these can support travelers in the adaptation of their preferences and in the creation of a set of individualized itineraries. Keywords: Explainable OR, Multimodal Mobility, Sampling, Customer Empowerment. 1 Introduction In recent years, the number and variety of mobility services has risen considerably. Especially innovative mobility services such as bike, car and ride sharing services have emerged and complement traditional mobility services such as scheduled trains and buses. Following recent mobility surveys, travelers expect a reasonable choice of itin- eraries created from their individual preferences such as travel time, price, number of transfers, mode, waiting time and walking distance [1-5]. Since travelers require door- to-door travel information, individual mobility services need to be combined to form a complete itinerary, considering the preferences of individual travelers. Travel websites such as Google Maps promise the seamless creation of a set of indi- vidualized itineraries, but usually remain very simple in discovering and considering individual preferences. The reason is that the search of multimodal itineraries requires quantifications of traveler preferences. Then, these can be considered through con- straints of established mathematical models and be solved with efficient solution tech- niques of multimodal network search. However, state-of-the-art solution techniques are limited in the number of preferences they can consider efficiently. Delling et al. [6] 1 Management Science Group, Otto von Guericke University Magdeburg, Universitätsplatz 2, 39016 Mag- deburg, Germany Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). Position Paper: Explainable Search of Multimodal Itineraries 151 highlight that taking more preferences into account at the same time has an enormous impact on the runtime of multimodal path-finding approaches. With regard to the consideration of multiple traveler preferences, from a mathemat- ical-optimization point of view, hierarchical objective functions are common. These functions optimize for one objective first (e.g. minimize travel times) and focus on sec- ondary objectives subsequently (e.g. minimize the number of transfers). However, as it is well-known for hierarchical objective functions, depending on how tight the con- straints are, there may be either no feasible solution at all or a very large number of solutions [7]. Hence, travelers could face either an empty or an excessively large set of itineraries. Since the search is based on an “invisible” mathematical model, travelers cannot interact with the search engine in a reasonable way and often perceive the crea- tion of itineraries as a “black-box” process. However, if travelers knew more – in a simplified and aggregated way – about the characteristics of the solution space, they could interact with the search engine to define their individual preferences and adapt them if necessary. In information systems literature, this extended interaction of cus- tomer and search engine is called “customer empowerment” [8]. In this position paper, we propose the idea of the individualized creation of multi- modal travel itineraries. Based on efficient mathematical modeling and solution tech- niques and using a large data set of German mobility services, we demonstrate the po- tential and limitations of “black-box” mathematical optimization and discuss how to include more complex traveler preferences. With solution sampling, we present a sim- ple idea to identify characteristics of the solution space and empower travelers to re- strict or enhance their preferences interactively. 2 Multimodal Itinerary Creation Fig. 1. Common way of creating multimodal travel itineraries Figure 1 illustrates the common way of searching multimodal itineraries. Given for- malized traveler preferences, a multimodal routing algorithm such as REGL-CSPP, in- troduced by [9], is run to determine a probably extensive number of multimodal itiner- aries (each feasible itinerary is indicated by a cross). It is then the traveler’s burden to dynamically reduce or extend the retrieved set of itineraries by using filtering and sort- ing techniques without having a clear idea about the characteristics of the solution space. As these are hardly transparent for the traveler, it is usually quite difficult for them to restrict or extend the set of itineraries systematically. We propose a data-driven decision support system that uses solution sampling in conjunction with searching multimodal travel itineraries. Figure 2 visualizes our idea: By systematically re-running a multimodal routing algorithm, we retrieve valuable 152 Ehmke and Horstmannshoff meta-information about the solution space characteristics, which can be used to explain the solution space to travelers as well as empower them to reduce/extend the retrieved set of itineraries. This implies identifying promising solutions on the one hand, but also partially infeasible solutions on the other hand, allowing the traveler to reflect on the importance and impact of their preferences. For instance, in case the traveler sets the upper bound for travel time to 4 hours, a favored solution taking 4 hours and 5 minutes might still be relevant. In addition, solution space characteristics can be used to derive typical relationships between preferences. For instance, for some types of itineraries, there might be a linear relationship between travel time and price, which can help the traveler to adjust their preferences systematically. Fig. 2. Sampling of multimodal travel itineraries In the end, the proposed decision support system could be integrated in a multimodal routing app to enable an explainable search of multimodal itineraries for the traveler. In the following, based on real-world multimodal mobility service data from Germany, we will detail our idea of a more explainable search through solution sampling. 3 Sampling Framework To guide travelers in their search of multimodal travel itineraries, we focus on retrieving solution space characteristics through solution sampling. Solution sampling builds on the repeated execution of efficient solution techniques for the multimodal search of itineraries (in our case: REGL-CSPP). Our framework consists of four steps: 1. Two-dimensional analysis of each traveler preference 2. Merge of the created sets of itineraries 3. Removal of dominated itineraries 4. Dynamic reduction/extension of the set of itineraries For step 1, given a request for a specific source 𝑠 and a target 𝑡 of an itinerary, for all parameters 𝑖 ∈ 𝐼 with 𝐼 being the set of quantifiable traveler preferences, we identify “reasonable” parameter intervals [𝑙 , 𝑢 ]. The lower bound 𝑙 of the parameter 𝑖 can be easily determined by running REGL-CSPP without any constraints. Algorithmic details are provided in [10]. The upper bound 𝑢 is derived from the maximum value for the respective parameter after applying REGL-CSPP for a search based on all other param- eters under investigation. Then, we search all two-dimensional relationships between all parameter combinations 𝑖, 𝑗 ∈ 𝐼 | 𝑖 ≠ 𝑗 within the identified intervals by systemati- cally re-running REGL-CSPP according to a predefined sampling density. For instance, Position Paper: Explainable Search of Multimodal Itineraries 153 to identify the two-dimensional solution set 𝑆 , , we set the parameter “travel time” as the objective and rerun REGL-CSPP with different upper bounds for the parameter “price” starting with the identified upper bound 𝑢 . By iteratively softening the restriction (e.g., by dividing the interval of price into 𝑘 equally sized buck- ets) and re-running REGL-CSPP with that specific restriction, we can identify itinerar- ies with different prices that are all optimal with regard to travel time. For step 2, we merge all itineraries searched based on two-dimensional sets 𝑆 , ∀𝑖, 𝑗 ∈ 𝐼 | 𝑖 ≠ 𝑗 into one comprehensive set 𝑆 . Then, in step 3, dominated itiner- aries are removed. An itinerary 𝑠 dominates an itinerary 𝑠 if 𝑠 is no worse in any parameter 𝑖 ∈ 𝐼 than 𝑠 [6]. All remaining itineraries with optimal parameter combina- tions create the Pareto Front. For step 4, the set of non-dominated itineraries 𝑆 is reduced if needed. In case the number of created itineraries in 𝑆 is sufficient, i.e. it is in a predefined interval [𝑐 , 𝑐 ], e.g. [1, 10], the set of itineraries is presented to the traveler in conjunction with detailed information on solution space characteristics. The set of created itineraries can be reduced by applying fuzzy logic approaches to multimodal route planning as demonstrated by [6]. The main idea of fuzzy logic in this case is to identify itineraries from the set of Pareto-optimal solutions that show negligible differences, which are likely not relevant for the traveler. Therefore, undesirable and likely irrelevant itinerar- ies can be dismissed, e.g. an itinerary where a traveler would have to pay significantly more to save only a few seconds of travel time. 4 Exemplary Results To demonstrate our approach of systematic sampling, we create information about the search space for multimodal itineraries between several major cities in Germany. The experimental setting considers a large amount of data on current mobility services and different modes of transportation as well as innovative ridesharing services such as “BlaBlaCar”. Modelled as a time-expanded network, the entire network for one specific day of operation consists of approximately 6 million nodes and 200 million arcs. As exemplary traveler preferences, we consider travel time, price and number of transfers. We estimate price information for public transit systems, as these are not given in their respective published GTFS data. We analyze long-distance trips between large cities in Germany assuming a start time for the itinerary at 9 am. As result of solution sampling, Figure 3 shows information on the relationship be- tween the preferences “price” and “travel time” as well as between “price” and “number of transfers” for long-distance trips between Berlin and Munich, Nuremberg and Ham- burg as well as Dortmund and Berlin. For price versus travel time, to ensure the com- parability, we normalized the parameters between 0 and 1. A travel time of “0” indi- cates the itinerary with the smallest occurrence for travel time, whereas a travel time of “1” represents the itinerary with the highest occurrence for travel time. 154 Ehmke and Horstmannshoff Price Travel Time Number of Transfers Fig. 3. Exemplary results for three different itineraries Significant differences in the relationship between the parameters travel time and price can be discovered. For itineraries between Berlin and Munich (indicated by trian- gles), a traveler has to invest quite a significant amount of time until large cost savings can be realized. In comparison, prices are much lower when investing only slightly more travel time for traveling between Dortmund and Berlin (indicated by circles). Be- tween Nuremberg and Hamburg (indicated by rectangles), a roughly linear relationship between travel time and costs can be discovered. Minor differences can be seen for the relationship between the parameters “number of transfers” and “price”. For itineraries between Berlin and Munich as well as between Dortmund and Berlin, a traveler has to be willing to transfer at least once in order to achieve substantial price savings. Further transfers do not result in significant savings. For itineraries between Nuremberg to Hamburg, travelers realize the highest savings in case of transferring twice. The identification of these aggregated relationships between relevant traveler pref- erences can be communicated to travelers and help guiding them in the solution space. Thereby, we empower travelers to interactively individualize the set of itineraries. For instance, it can be of high relevance for a price-sensitive traveler to have an idea which of their individual preferences has a high impact on the price for the respective itinerary and thereby help assessing which of the preferences should be relaxed/restricted to re- duce/extend the set of itineraries under investigation. Position Paper: Explainable Search of Multimodal Itineraries 155 Table 1: Results of Solution Sampling Sampling density Mode Choice ∅ Dominated solutions removed [%] All 23.5 𝑘=5 Without car 19.8 All 89.1 𝑘 = 20 Without car 87.8 Table 1 compares the impact of different sampling densities (5 and 20) as well as different mode choice configurations. For the latter, either the travelers are willing to use all available modes, or the traveler does not want to drive a car. It can be seen that with a small sampling density of 𝑘 = 5 about 20% of the retrieved solutions can be removed as they are being dominated by another solution. With a larger sampling den- sity, up to 90% of the retrieved solutions are of no relevance for the traveler and hence are removed before showing the results to them. 5 Conclusion and Outlook In recent years, there has been tremendous progress in the field of searching multimodal travel itineraries. While the search of individualized itineraries for a given and limited set of preferences is possible, this often comes as a black-box search, and it is hard for travelers to interact with the search engine in a way that can help finding more individ- ualized itineraries. We propose the idea of solution sampling to benefit from the state- of-the-art algorithms in network search while deriving characteristics of the solution space that can be helpful to make the search process more explainable to travelers. Although search engines and mathematical optimization form an important part in finding individualized multimodal itineraries, a more comprehensive framework is needed to provide travelers and customers in general with more “white-box” solutions. Alt et al. [8] present a framework from the information-systems perspective not only suited for the search of itineraries, but also for services from finance, health and educa- tion. Embedding mathematical optimization and methods from data mining in this framework can help creating individualized and especially transparent options, com- bining the power of model-based optimization and data-driven decision making. References 1. Grotenhuis, Jan-Willem; Wiegmans, Bart W.; Rietveld, Piet (2007): The Desired Quality of Integrated Multimodal Travel Information in Public Transport. Customer needs for time and effort savings. In: Transport Policy 14 (1), 27–38. 2. Ehmke, Jan Fabian; Mattfeld, Dirk C.; Albrecht, Linda (2016): Position Paper: Combining Mobility Services by Customer-Induced Orchestration. In: Proceedings of Rec- Tour@RecSys 2016. 156 Ehmke and Horstmannshoff 3. Esztergár-Kiss, Domokos; Csiszár, Csaba (2015): Evaluation of Multimodal Journey Plan- ners and Definition of Service Levels. In: International Journal of Intelligent Transportation Systems Research 13 (3), 154–165. 4. Stopka, Ulrike (2014): Identification of User Requirements for Mobile Applications to Sup- port Door-to-Door Mobility in Public Transport. In: David Hutchison, Takeo Kanade und Josef Kittler (Eds.): Human-Computer Interaction. Applications and Services: 16th Interna- tional Conference, HCI International 2014, Part 3, 8512, 513–524. 5. Willing, Christoph; Brandt, Tobias; Neumann, Dirk (2017): Intermodal Mobility. In: Busi- ness & Information Systems Engineering 59 (3), 173–179. 6. Delling, Daniel; Dibbelt, Julian; Pajor, Thomas; Wagner, Dorothea; Werneck, Renato F. (2013): Computing Multimodal Journeys in Practice. In: David Hutchison, Takeo Kanade und Josef Kittler (Hg.): Experimental Algorithms. 12th International Symposium, SEA 2013, Proceedings, 7933, 260–271 7. Ehrgott, Matthias (2005): Multicriteria Optimization. 2nd edition. Springer. 8. Alt, Rainer; Ehmke, Jan Fabian; Haux, Reinhold; Henke, Tino; Mattfeld, Dirk Christian; Oberweis, Andreas et al. (2019): Towards customer-induced service orchestration - re- quirements for the next step of customer orientation. In: Electron Markets 29 (1), 79–91. 9. Barrett C., Bisset K., Holzer M., Konjevod G., Marathe M., Wagner D. (2008) Engineer- ing Label-Constrained Shortest-Path Algorithms. In: Fleischer R., Xu J. (eds) Algorithmic Aspects in Information and Management. AAIM 2008. Lecture Notes in Computer Sci- ence, vol 5034. Springer, Berlin, Heidelberg. 10. Horstmannshoff, Thomas; Ehmke, Jan Fabian (2020): Creation of Individualized Sets of Multimodal Travel Itineraries. In: Proceedings of 22nd EURO Working Group on Trans- portation Meeting (EWGT), to appear.