<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Understanding Customer Choices to Improve Recommendations in the Air Travel Industry</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alejandro Mottini Amadeus SAS Sophia Antipolis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rodrigo Acuna-Agost Amadeus SAS Sophia Antipolis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria A. Zuluaga Amadeus SAS Sophia Antipolis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Alix Lhéritier Amadeus SAS Sophia Antipolis</institution>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>28</fpage>
      <lpage>32</lpage>
      <abstract>
        <p>Recommender systems aim at suggesting relevant items to users to support them in various decision-making processes, on the basis of available information on items or users. In the latter, the customer's interests and tastes can be learnt and expressed using historical browsing data, purchase histories, and even other nontraditional data sources such as social networks. Despite its proven success in the on-line retailing industry, in electronic commerce and, even tourism, recommender systems have been less popular in flight itinerary selection processes. This could be partially explained by the fact that customers' interests are only expressed as a lfight search request. As a result, this problem has been historically tackled using classical Discrete Choice Modelling techniques and, more recently, through the use of data-driven approaches such as Machine and Deep Learning techniques. At Amadeus, we are interested in the use of choice models with recommender systems for the problem of airline itinerary selection. This work presents a benchmark on three family of methods to identify which is the most suitable for the problem we tackle.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        In the recent years, recommender systems (RecSys) have proven
invaluable for solving problems in the on-line retail industry and
e-commerce[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. While tourism has not been the exception to this
success [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], with applications covering almost every area of the
travel and hospitality industry [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], RecSys have been less
popular on the airline itinerary decision-making process. This can be
explained by two factors. On one hand, the available information
about users and items is not as rich as for most RecSys in tourism. In
the traveller’s flight itinerary choice problem, i.e. the task of
selecting a flight given a proposed list of itinerary recommendations, the
user’s interests are only expressed as a flight search request, user
sessions are usually anonymous and there is no user history in the
travel provider’s databases. Therefore, classical RecSys algorithms
cannot be applied directly.
      </p>
      <p>
        On the other hand, RecSys techniques sufer from a lack of
theoretical understanding of the underlying behavioural process that
led to a particular choice [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] by seeing the decision-making process
as a black box [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Collaborative and content-based methods
recommend items based on similarities among users or items but, cannot
provide further insight. In the flight industry, it is key to
understanding passenger behaviour and their flight itinerary preferences.
Players in the sector use this knowledge to adapt their ofers to
market conditions and customer needs, thus having an impact on
airline’s revenue management and price optimisation systems [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        To tackle the flight itinerary choice problem and overcome these
limitations, the airline industry has historically resorted to Discrete
Choice Modeling (CM). Due to its good performance, eficiency and
ease of interpretation, the Multinomial Logit model (MNL) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], a
specific CM technique is the most popular approach for the flight
itinerary choice problem. In spite of its numerous advantages, CM
also presents some weaknesses. For instance, MNL only considers
linear combinations of the input features, limiting its predictive
capability and requiring expert knowledge to perform feature
engineering. Also, they lack the flexibility to handle collinear attributes
and correlations between options and it is dificult to model
individual’s heterogeneities. These shortcomings might be overly
restrictive or afect performance [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. As an example, industrial
applications require to develop diferent models for distinct markets.
In the case of the flight itinerary choice prediction problem, this
involves estimating models at a city-pair level [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and/or customer
demographic segments [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>In an efort to cope with CM limitations, recently machine
learning and deep learning techniques have been proposed. These
algorithms can more easily model non-linear relationships and handle
correlated features, and have more modelling power which allows
to predict choices on an individual level, thus improving the
prediction performance.</p>
      <p>
        Inspired by the work from Chaptini [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], at Amadeus we are
working towards the use of CM with recommender systems for
the problem of airline itinerary selection. Combining the two
approaches should leverage the strengths of both, leading to robust
and scalable, but more interpretable models. In this first work, we
seek to explore, evaluate and compare three diferent CM models
which can be used as the predictive back-bone of a choice-based
RecSys framework. In the remainder of this paper, first we present
the theoretical background of CM and demonstrate why CM can
be seen as a RecSys problem. Then, we present our
experimental setup by describing the data, the evaluated algorithms and the
performance measures.
      </p>
    </sec>
    <sec id="sec-2">
      <title>BACKGROUND</title>
      <p>In this section, first we provide a brief background on classical
discrete choice modelling theory and then show how it is equivalent
to the recommendation problem.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Discrete Choice Models.</title>
      <p>
        CM defines four basic components: 1) the decision-maker, 2) the
alternatives, 3) the attributes, and 4) the decision rules [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Formally stated, a decision-maker i ∈ I chooses from a choice set Ai
composed of Ji alternatives, with with j ∈ {1, . . . , Ji } the index
of the jth alternative. For the sake of simplicity and without loss
of generality, we will refer to the number of alternatives simply
as J , although decision-makers might not be faced with the same
set and/or number of alternatives. The decision-maker i obtains an
utility Ui j from each j and chooses alternative jˆ if and only if:
(1)
(2)
(3)
(4)
(5)
      </p>
      <p>Vi j = V (Xi j ),
where Vi j is referred to as the representative utility and Xi j =
h(xi j , Si ), a simplified representation of xi j and Si through the use
of any appropriate vector valued function h. Vi j is generally a linear
combination of the features. For example, if an airline is trying to
predict which itinerary a user will choose, a very simple model
could be:</p>
      <p>Vi j = a ∗ pricei j + b ∗ tripDurationi j
with a, b parameters of the model to be estimated, and which are
commonly refered to as β .</p>
      <p>Since there are aspects of the utility function that cannot be
observed, Vi j , Ui j . To reflect uncertainty, the utility can be modelled
as a random variable,</p>
      <p>Ui j = Vi j + εi j ,
where εi j is a random variable that captures the unknown
factors that afect Ui j . As Ui j is now a random variable, the decision
rule needs to be expressed as the probability that decision-maker i
chooses the kth alternative:</p>
      <p>P (k |Ai ) = P (Uik ≥ Ui j ; ∀j ∈ Ai ).</p>
      <p>By replacing Ui j accordingly:</p>
      <p>P (k |Ai ) = P (Vik − Vi j ≥ εi j − εik ; ∀j ∈ Ai ).</p>
      <p>Diferent assumptions about the random term εi j and the
deterministic term Vi j produce specific models.
2.2</p>
    </sec>
    <sec id="sec-4">
      <title>Choice-based Recommender Systems.</title>
      <p>Given a set Ai of J available items presented to a user i, the
recommender problem can be seen as an optimisation task that first
estimates the utility of each item j ∈ Ai , and then chooses the item</p>
      <p>Ui , jˆ ≥ Ui j ; ∀ j ∈ Ai .</p>
      <p>
        The utility function is unknown and not observable. However, as
it is possible to determine the attributes xi j perceived by
decisionmaker i for each j, as well as Si the vector of characteristics of i,
there exists a function V (·) which relates the observed features to
the decision-maker’s utility:
jˆ that maximizes an utility function U (i, j), representing the user’s
utility on any item j [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]:
(6)
(7)
jˆ = arg max U (i, j).
      </p>
      <p>j ∈ Ai</p>
      <p>
        Conceptually this is the same optimisation problem as that one
formulated by choice theory [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and described previously in this
section. Equation (6) is equivalent to choosing the alternative with
the highest utility for a decision-maker, in choice modelling theory.
More formally:
jˆ = arg max U (i, j) ⇔ Ui jˆ ≥ Ui j ; ∀ j ∈ Ai ,
      </p>
      <p>j ∈ Ai
which implies that the recommendation problem can be seen as a
choice prediction problem. Therefore, the models and techniques
developed in CM can be applied to RecSys.
Experiments were conducted on real datasets of flight search logs
and bookings from MIDT, an Amadeus database containing
bookings from over 93000 travel agencies.</p>
      <p>Bookings are stored using Personal Name Records (PNR), which
are created at reservation time by airlines or other air travel providers,
and are then stored in the airline’s or Global Distribution System
(GDS) data centers. PNRs contain the travel itinerary of the
passenger, personal and payment information, and/or additional ancillary
services sold with the ticket. As these only contain information
about the purchased ticket (final choice), and not about the
alternatives considered before the purchase, we must also consider flight
search logs. These contain both itinerary requests (origin,
destination and dates), and the diferent alternatives presented to the
passenger.</p>
      <p>Both data sources are combined into a final dataset containing the
alternatives presented to each user and their final choice (Figure 1).
The matching process is in itself a challenging problem due to
the high volume of data (i.e., around 100 GB of daily search logs)
and to the diference in data sources and formats. Moreover, the
process cannot be perfectly accurate since there is not a direct link
between the two data sources and booking/search times difer. An
approximate matching is performed using data fields which are
shared between booking and logs (i.e. origin, destination, time and
booking agency).</p>
      <p>The choice set presented to a user, which we denote a session,
contains up to 50 itineraries. The features used for each alternative
are summarized in Table 1. The considered dataset contains 33951
sessions split into training/tests sets.</p>
      <p>
        3.2.1 Classical CM. Two classical CM approaches are
considered: The Multinomial Logit (MNL) model [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], perhaps the most
common CM model, and Latent class choice models (LCM) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
McFadden [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] demonstrated that if εi j is an i.i.d. Gumbel
random variable, the probability that a decision-maker i chooses the
where Q is the number of latent classes, βq are the choice model
parameters specific to class q, Aq is the choice set specific to class
q, θ is an unknown parameter vector, and Xi j the simplified vector
representation of attributes of alternatives and characteristics of
decision-maker i.
      </p>
      <p>Finally, both MNL and LCM models are optimized using
maximum likelihood estimation as they can not be solved in a closed
form.</p>
      <p>
        3.2.2 ML. Lheritier et al. a have proposed machine-learning
based CM (ML) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] technique which formulates the choice
modelling problem as a supervised learning one through the use of
Random Forests (RF), a learning algorithm based on an ensemble
of decision trees. The training data consists of the set of sample
pairs T = {(Xi j , yi j )} 1, with yi j the binary indicator of whether
1In the context of RF, Xi j referred to as the feature vector of a sample
(8)
(9)
decision maker i chooses the j-th alternative. As RF assumes
independence of the samples, at training stage, every Xi j is assumed
i.i.d., even if they belong to the same decision-maker. At
prediction, each unseen alternative Xi j is propagated through the trained
forest to obtain the posterior probability of being chosen:
P (yi j |Xi j ) = T t =1
1 ÕT
      </p>
      <p>Plt (yi j (Xi j ) = 1)
(10)
where T denotes the number of trees and Plt (·) denotes the
posterior probability function of a leaf node l in tree t . However, the
alternatives associated to an individual’s session cannot be treated
as independent. There is an inherent dependence among them: only
one alternative per session can be selected. To cope with this, the
predicted probabilities are considered scores used to rank the
alternatives. More formally, the index jˆ of the selected alternative ajˆ by
decision-maker i is:
jˆ = arg max P (yi j |Xi j )</p>
      <p>
        1≤j ≤ J
3.2.3 DL. The assessed Deep learning choice modeling (DL)
method [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] is based on an encoder-decoder network architecture
using a modified pointer-network mechanism [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. As with ML,
the model is trained to predict the chosen alternative using a
supervised learning approach. However, DL does not break the i.i.d.
assumption among samples, as ML-based CM does. Given the
sequential nature of pointer networks, sessions are represented as
sequences of itineraries, Z = {Xi1, ..., Xi J }, which are fed
sequentially to the model. The encoder network "encodes" the input into a
hidden (encoder) state e. The decoder network will use the encoded
information to output a vector u. Finally, a softmax function use
the decoder’s output to estimate the posterior probability of being
chosen for the kth element in the input sequence Z :
(11)
P (yk = 1|Z ) = ÍJ
j=1 exp(uj )
exp(uk )
with uk = dT W1ek , the pointer vector to the kth element of Z , ek
the kth encoder state, d = tanh(W2e J ) the decoder, W1, W2 learnable
parameters and yk the binary indicator of whether k was chosen
(yk = 1) or not. P (yk = 1|Z ) can be interpreted as an estimate of
P (k |Ai ).
      </p>
    </sec>
    <sec id="sec-5">
      <title>3.3 Performance measurement</title>
      <p>
        We used Top-N accuracy to asses and compare the models. Top-N
accuracy evaluates if the user’s choice is among the top-N predicted
alternatives. It is equivalent to the commonly used top-N error in
image classification [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], as it can be formulated in terms of the
latter as:
      </p>
      <p>accuracy = 1 − error</p>
    </sec>
    <sec id="sec-6">
      <title>4 RESULTS</title>
      <p>Figure 2 presents the Top-N accuracy for MNL, ML and DL methods.
Overall, DL presents the highest accuracies across all values of N.
These results are confirmed, in more detail, in Table 2 where Top-1,
5 and 15 accuracies are detailed. Top-15 accuracy has a particular
importance for ranking flight search recommendations since most
websites show approximately 15 results per page.</p>
      <p>To simulate data segmentation, a second experiment was
performed in a simplified subset containing a single origin-destination
(O&amp;D) pair chosen at random. This resulted in 1617 decision-makers
(users) with an associated booking to the O&amp;D. The Top-N accuracy
curve (Figure 2 dashed lines) shows how the diference in
performance between the methods is less significant w.r.t. that one using
the full data set. Despite MNL being the simplest method, results
show that, on simpler datasets, it is able to perform as well as more
complex methods.</p>
      <p>This behaviour explains the motivation behind dataset
pre-segmentation often used in classical CM. This is further confirmed by
investigating the performance of LCM, as a function of the number
of latent classes Q. Figure 3 reports top-1 accuracy of LCM, ML and
DL, and demonstrates how it is possible to increase classic CM
accuracy in complex data through a good estimation of Q. While MNL
reported accuracies lower than ML and DM, LCM can outperform
them when Q is estimated correctly. This improvements comes,
however, at some cost: LCM requires additional hand engineered
features to achieve the segmentation and a good choice of Q.</p>
      <p>Although ML and MNL are not as accurate as DL, they have
the advantage of having less hyper-parameters to tune. Moreover,
they are more interpretable than DL. ML methods based on RF are
known for their capacity to provide information on feature
importance (Figure 4). This type of information can help to understand
the rationale behind the decision-maker’s choices, which can be
important for some applications in the air travel industry.
5</p>
    </sec>
    <sec id="sec-7">
      <title>FINAL REMARKS</title>
      <p>
        RecSys research has so far predominantly focused on optimizing the
algorithms used for generating recommendations to increase
precision [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Precision measures how well the suggested alternatives
match a decision-maker’s profile based on previous data. While this
is an important criterion, its limited assessment of a recommender
quality has been criticized for not taking the decision-makers’
situational needs into account [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Due to their well-known readability,
Discrete Choice Modelling appears as a natural alternative to
overcome this current limitation of RecSys. However, despite CM being
a well-studied problem in various fields of research, literature on
its use with recommender systems is very scarce. Existing works
have adopted classical CM in combination RecSys [
        <xref ref-type="bibr" rid="ref17 ref6">6, 17</xref>
        ], while
suggesting CM as a promising paradigm in the field of RecSys.
      </p>
      <p>
        However, classical choice models tend to sufer from scalability
issues as expert knowledge is usually required for model
optimisation. ML- and DL-based [
        <xref ref-type="bibr" rid="ref10 ref13">10, 13</xref>
        ] choice models are non-parametric
approaches that overcome this limitation, easing the deployment of
choice-based RecSys at large scale. On the down side, model
readability can diminish. Although this might not be relevant for some
applications, understanding the reasons behind a decision-maker’s
choice is of high relevance in the air travel industry. ML-based
methods appear to be a suitable compromise into readability but,
they make strong assumptions on the independence of data that is
arguable. Overall, it is possible to say that there is no ideal method
and that the selection of one might depend on the specific
recommendation application that they target. As a guideline, Table 3
summarises the strengths and pitfalls of the diferent methods here
evaluated when considering choice-based RecSys.
      </p>
      <p>At Amadeus, we work towards the development of informative,
readable and interpretable RecSys that suit the needs of the air
travel industry. Our hypothesis is that the combination of discrete
choice modeling with RecSys can provide improvements to current
systems in the air travel industry by keeping readability while
improving performance. In that sense, an ML method like the random
• Simple and interpretable
• Accurate on simple cases
• Interpretable
• Accurate
• Suitable for big data
• Handles non-linear and latent relationships
• No assumptions on data
• Highly accurate
• Suitable for big data
• Handles non-linear and latent relationships
forests evaluated here represents a good compromise and a
promising path to pursue in what we are looking for. On one hand, the
method provides information on the relevance of features. On the
other one it avoids the limitations of classical CM models. In that
sense, although DL approaches have higher accuracy, they are not
as advantageous given their limited interpretability.</p>
      <p>This work represents an initial benchmark that evaluates three
families of CM methods in the context of flight itinerary
selection/recommmendation. Our future work will focus in the
development of a unified framework that can leverage the strengths of the
explored CM methods.</p>
      <p>Disadvantages
• Feature engineering is required
• Limited in handling big data
• Assumes independence of samples
• Feature engineering might be required
• Non-interpretable
• Many hyper-parameters
• Computationally expensive</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Gediminas</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>17</volume>
          ,
          <issue>6</issue>
          (
          <year>2005</year>
          ),
          <fpage>734</fpage>
          -
          <lpage>749</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Moshe</given-names>
            <surname>Ben-Akiva</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michel</given-names>
            <surname>Bierlaire</surname>
          </string-name>
          .
          <year>1985</year>
          .
          <article-title>Discrete choice analysis: theory and application to travel demand</article-title>
          . MIT press, Cambridge, MA.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Joan</given-names>
            <surname>Borràs</surname>
          </string-name>
          , Antonio Moreno, and
          <string-name>
            <given-names>Aida</given-names>
            <surname>Valls</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Intelligent tourism recommender systems: A survey</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>41</volume>
          ,
          <issue>16</issue>
          (
          <year>2014</year>
          ),
          <fpage>7370</fpage>
          -
          <lpage>7389</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Broder</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Rusmevichientong</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Dynamic pricing under a general parametric choice model</article-title>
          .
          <source>Operations Research</source>
          <volume>60</volume>
          ,
          <issue>4</issue>
          (
          <year>2012</year>
          ),
          <fpage>965</fpage>
          -
          <lpage>980</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Judit</surname>
            <given-names>G Busquets</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Antony D Evans</surname>
            ,
            <given-names>and Eduardo</given-names>
          </string-name>
          <string-name>
            <surname>Alonso</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Predicting Aggregate Air Itinerary Shares Using Discrete Choice Modeling</article-title>
          .
          <source>In 16th AIAA Aviation Technology, Integration, and Operations Conference</source>
          .
          <volume>4076</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Bassam</surname>
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Chaptini</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Use of discrete choice models with recommender systems</article-title>
          .
          <source>Ph.D. Dissertation</source>
          . Massachusetts Institute of Technology.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Li</given-names>
            <surname>Chen</surname>
          </string-name>
          , Marco de Gemmis, Alexander Felfernig, Pasquale Lops, Francesco Ricci, and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Human Decision Making and Recommender Systems</article-title>
          .
          <source>ACM Transactions on Interactive Intelligent Systems</source>
          <volume>3</volume>
          ,
          <issue>3</issue>
          (
          <year>2013</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>William</surname>
            <given-names>H</given-names>
          </string-name>
          <string-name>
            <surname>Greene and David A Hensher</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>A latent class model for discrete choice analysis: contrasts with mixed logit</article-title>
          .
          <source>Transportation Research Part B: Methodological</source>
          <volume>37</volume>
          ,
          <issue>8</issue>
          (
          <year>2003</year>
          ),
          <fpage>681</fpage>
          -
          <lpage>698</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Joseph</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Konstan</surname>
          </string-name>
          and John Riedl.
          <year>2012</year>
          .
          <article-title>Recommender systems: from algorithms to user experience</article-title>
          .
          <source>User Modeling and User-Adapted Interaction 22</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>2</lpage>
          (
          <year>2012</year>
          ),
          <fpage>101</fpage>
          -
          <lpage>123</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Alix</surname>
            <given-names>Lhéritier</given-names>
          </string-name>
          , Michael Bocamazo, Thierry Delahaye, and
          <string-name>
            <surname>Rodrigo</surname>
          </string-name>
          Acuna-Agost.
          <year>2018</year>
          .
          <article-title>Airline Itinerary Choice Modeling using Machine Learning</article-title>
          .
          <source>International Journal of Choice Modeling</source>
          (
          <year>2018</year>
          ), https://doi.org/10.1016/j.jocm.
          <year>2018</year>
          .
          <volume>02</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Daniel</given-names>
            <surname>McFadden</surname>
          </string-name>
          .
          <year>1973</year>
          .
          <article-title>Conditional Logit Analysis of Qualitative Choice Behaviour</article-title>
          . In Frontiers in Econometrics, P. Zarembka (Ed.). Academic Press New York, New York, NY, USA,
          <fpage>105</fpage>
          -
          <lpage>142</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D.</given-names>
            <surname>McFadden</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Economic choices</article-title>
          .
          <source>The American Economic Review</source>
          <volume>91</volume>
          ,
          <issue>3</issue>
          (
          <year>2001</year>
          ),
          <fpage>351</fpage>
          -
          <lpage>378</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Alejandro</given-names>
            <surname>Mottini</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rodrigo</given-names>
            <surname>Acuna-Agost</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Deep Choice Model Using Pointer Networks for Airline Itinerary Predictions</article-title>
          .
          <source>In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM</source>
          , New York, NY, USA,
          <fpage>1575</fpage>
          -
          <lpage>1583</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Julia</surname>
            <given-names>Neidhardt</given-names>
          </string-name>
          , Tsvi Kuflik, and Wolfgang WÃűrndl.
          <year>2018</year>
          .
          <article-title>Special section on recommender systems in tourism</article-title>
          .
          <source>Information Technology &amp; Tourism</source>
          <volume>19</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          (
          <year>2018</year>
          ),
          <fpage>83</fpage>
          -
          <lpage>85</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Francesco</surname>
            <given-names>Ricci</given-names>
          </string-name>
          , Lior Rokach, Bracha Shapira, and Paul B.
          <source>Kantor (Eds.)</source>
          .
          <year>2011</year>
          . Recommender Systems Handbook. Springer Nature, New York.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Olga</surname>
            <given-names>Russakovsky</given-names>
          </string-name>
          , Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein,
          <string-name>
            <surname>Alexander C. Berg</surname>
          </string-name>
          , and
          <string-name>
            <surname>Li</surname>
          </string-name>
          Fei-Fei.
          <year>2015</year>
          .
          <article-title>ImageNet Large Scale Visual Recognition Challenge</article-title>
          .
          <source>International Journal of Computer Vision</source>
          <volume>115</volume>
          ,
          <issue>3</issue>
          (
          <year>2015</year>
          ),
          <fpage>211</fpage>
          -
          <lpage>252</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Paula</surname>
            <given-names>Saavedra</given-names>
          </string-name>
          , Pablo Barreiro, Roi Duran, Rosa Crujeiras, María Loureiro, and Eduardo Sánchez Vila.
          <year>2016</year>
          .
          <article-title>Choice-Based Recommender Systems</article-title>
          . In RecTour@ RecSys - Workshop on Recommenders in
          <article-title>Tourism held in conjunction with the 10th ACM Conference on Recommender Systems (RecSys)</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , Boston, MA, USA,
          <fpage>38</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>O.</given-names>
            <surname>Vinyals</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fortunato</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Jaitly</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Pointer networks</article-title>
          .
          <source>In Advances in Neural Information Processing Systems (NIPS</source>
          <year>2015</year>
          ). Curran Associates, Inc., Montreal, Canada,
          <fpage>2692</fpage>
          -
          <lpage>2700</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>V.</given-names>
            <surname>Warburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bhat</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Adler</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Modeling demographic and unobserved heterogeneity in air passengersâĂŹ sensitivity to service attributes in itinerary choice</article-title>
          .
          <source>Journal of the Transportation Research Board</source>
          <year>1951</year>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>