<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ski Resorts Recommendation using Deep Neural Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>YASMINE SERDOUK</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>TIMOTHÉE COUBLE</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ERIC COUBLE</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>CÉDRIC MARCONE</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>iso - e-Commerce Web Agency</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <abstract>
        <p>In this work, we present a recommender system based on deep neural networks for a novel application that predicts suitable ski resorts for a web agency customers. A special focus is paid to the features construction where they are created from historical heterogeneous data according to a user-friendly questionnaire. The evaluation is performed through the accuracy metric and the obtained results show a promising performance with a TOP-10 accuracy that reaches 98.95% and 99.30% for French and non-French customers, respectively. CCS Concepts: • Computing methodologies → Artificial intelligence ; Machine learning approaches. Additional Key Words and Phrases: neural networks, deep learning, recommender systems, ski resorts Traditionally, Recommender Systems (RS) are designed to predict missing ratings and provide accurate top-N recommendation for queries. However, traditional techniques only use user ratings for the diferent items and can not exploit feature vectors that can better capture the complex relations between the user and items. Thus, deep learning has been changing dramatically recommendation architectures lately [6, 8] with MultiLayer Perceptron (MLP) [2, 7], AutoEncoder (AE) [1, 13], Convolutional Neural Networks (CNN) [9] and other hybrid methods [11, 14]. This work proposes a novel RS deep learning based for a web business company in order to provide their customers with the best online experience. The web company, called Valraiso makes commercial websites for the French Ski School (ESF: École de Ski Françaises) and this work aims at recommending suitable Ski resorts for the customers visiting these websites. In other words, we want to develop an online interface that communicates with clients to determine their needs and collect data, such as the budget, the customer's location, etc. Then, on the basis of these data and the history of other previously collected orders, we build a model through deep learning techniques in order to recommend suitable ski resorts. In the literature, some works related to tourism have been explored like in [4] where authors analyse ski-lift transportations data to shape the decision making process in order to improve safety in ski resorts. Another interesting work treats about RS for the travel and hospitality industries with a skimatcher application [3]. Also, a case study was carried out in [5] about a constraint-based RS that was integrated into a travel advisory system for an Austrian spa resort. The rest of this paper is organized as follows. Section 2 presents the dataset that we created and the process we went through for collecting and cleaning it. Section 3 is for to the experimental results. Finally, main conclusions and future works are drawn in Section 4.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 INTRODUCTION</title>
    </sec>
    <sec id="sec-2">
      <title>DATASET CONSTRUCTION</title>
      <p>The used dataset is named SRESP for "Ski RESorts Prediction". Initially, the Valraiso company has archived various
heterogeneous data since its creation. In order to know which data to select, we first define the questionnaire.</p>
    </sec>
    <sec id="sec-3">
      <title>Define the questions</title>
      <p>In a real time context, a questionnaire interacts with the customer and asks few questions in order to recover his needs
and desires. From the customer’s responses, we reconstruct a feature vector that will be introduced into the prediction</p>
      <p>
        (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) What kind of skier(s) are you ? (In the case of a group or a family, please specify the average level)
      </p>
      <p>
        Beginner, Intermediate, Confirmed, or Expert
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) What type of ski area suits you best ? Small, Medium, Large, or Very Large resorts,
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) How high would you like to ski ? Low, Average, High, or Very high altitude,
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) What week do you plan to go ?
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        ) What is your budget level for this trip ? Low, Average, or No limit cost
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        ) Where do you live ?
(
        <xref ref-type="bibr" rid="ref7">7</xref>
        ) How old are your children ? Under 6 years old, Between 6 and 12 years old, More than 12 years old, or No children
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Data Selection and Creation</title>
      <p>From the archived data of Valraiso, we select the most relevant features that can contribute to the choice of a ski resort
and which can be used to reconstruct the responses of the defined questionnaire. The dataset is organized, such that
each row is a client order for a specific ski resort while the columns represent the features. The created features are:
2.2.1</p>
      <p>Group. This feature regroup four information about the nature of the group with diferent numeric values.
• The existence of at least one child under 6 years old: numeric value 3
• The existence of at least one child aged between 6 and 12: numeric value 5
• The existence of children in both age categories, which means aged under 12: numeric value 8
• Or no children within the order, which implies only teenagers or adults in the order: numeric value 0
2.2.2 Budget. As most orders are for at least one week, we include the weekly pass fees and the total cost in euros
within this feature. Also, as the daily_pass information is for one person (an adult), it is multiplied by the number of
people included in the order.  = (5 × _ × (_)) +</p>
      <p>Since the questionnaire allows only few categories of answers, feature values are categorized.</p>
      <p>• Low cost category ∈]0, 300]€ having a numeric value 0, Average cost category ∈]300, 400]€ having a numeric
value 1, and a No limit cost category ∈]400, +∞[€ having numeric value 2.</p>
      <p>The range of each category of the selected features is chosen such that the data are fairly distributed over each group.
Each category is converted into a unique numeric value.
2.2.3 Delta_week. The delta between the stay week and the validation week (i.e. the week the order was placed) is
important and can influence greatly the choice of the ski resort. _ = _ − _
2.2.4 stay_week. In addition to the delta order, the stay week has an important influence on the choice of the resort
2.2.5 altitude_max. Some customers like to ski at very high altitude while others (that have children for instance)
would choose resorts with a low or an average altitude. Four categories representing each of:
• Low altitudes ∈]0, 2100] meters having a numeric value 0, Average altitudes ∈]2100, 2550] meters having
a numeric value 1, High altitudes ∈]2550, 2800] meters having a numeric value 2, Very high altitudes ∈
]2800, +∞[ meters having a numeric value 3
2.2.6 slopes_dificulty . This feature regroup the number of green (nb_g), blue (nb_b), red (nb_r) and black (nb_black)
ski slopes to create a new feature that gives a notion of the resort’s dificulty and though, infer the average level of
skiers in the order. Thus, it is calculated so as to slightly weight nb_g and nb_b and to heavily weight nb_r and nb_black
as follows. _ = (0.1 × _) + (0.2 × _) + (0.6 × _ ) + _</p>
      <p>Then, four categories are created.</p>
      <p>• Beginner level having a slopes_d between ]0, 6.2] with a numeric value 0, Intermediate level having a slopes_d
between ]6.2, 14.7] with a numeric value 1, Confirmed level having a slopes_d between ]14.7, 22.4] with a
numeric value 2, Expert level having a slopes_d between ]22.4, +∞[ with a numeric value 3,
2.2.7</p>
      <p>km_slopes. This feature represents the size of the ski area of the resort in kilometers.
• Small ski resorts having km_slopes between ]0, 70] kilometers with a numeric value 0, Medium ski resorts
having km_slopes between ]70, 146] kilometers with a numeric value 1, Large ski resorts having km_slopes
between ]146, 160] kilometers with a numeric value 2, Very large ski resorts having km_slopes between
]160, 250] kilometers with a numeric value 3
2.2.8 country_id. The country from which the order has been made is converted into a numeric value. The dataset
contains 174 diferent countries, the numeric values vary from 0 to 173.
2.2.9 dep. This feature is only dedicated to customers who ordered from France. Indeed, as more than 85% of the
orders in the dataset are made from French customers and according to the market experts, the French and non French
customer’s behaviours are diferent, we decided to build two models. One for the French customers and the other for
the remainder countries. Therefore, dep depicts for the department of French customers.
2.3</p>
    </sec>
    <sec id="sec-5">
      <title>SRESP construction</title>
      <p>Note that two diferent inputs are presented into the machine learning. The first is for French and the second for
non-French customers.</p>
      <p>[, _,  _, ,  _, _   , _, ],
[, _,  _, ,  _, _   , _, _],</p>
      <p>On the other hand, Table 1 details the composition of the dataset. It can be downloaded from [12]. We split SRESP
into two subsets: training and test. As our dataset has unbalanced classes, the train-test split is made according to the
number of samples on each class, i.e. 70% of the samples of each class are used for training while the rest are for the
evaluation. Finally, as the feature values have widely diferent ranges, a min-max scaling is done in the range of [0-1]
for a faster convergence [10].</p>
    </sec>
    <sec id="sec-6">
      <title>3 EXPERIMENTAL RESULTS</title>
      <p>In this work, the parameters are selected through a 5-fold cross validation (5 diferent train-test splits). On the other
hand, note that reducing the feature inputs, reduces the information variability and complexity to be introduced into the
model. Also, categorizing the feature values, further reduces the variability within the values of our inputs. Therefore,
the challenge is to have a strong architecture that can deal with such inputs. Finally, our problem can be summed up
into a supervised classification problem that aims to predict resorts for new customers. The evaluation is carried out by
computing the accuracy metric that assesses overall, how often our model is correct. It is presented for both French and
non-French models.</p>
      <p>After many experiments, the selected architecture has four hidden layers with 550 nodes each. A ReLU function is
placed after each layer, except for the last layer where a softmax function is employed. Then, a batch normalization is
employed to normalize each layer’s inputs. Lastly, a dropout layer is added to prevent over-fitting by ignoring 25% of
neurons during training. The training is performed through 1000 epochs for the French data and only 100 epochs for
the non-French data. The Adam Optimizer is used with a batch size of 6360 and an initial learning rate of 0.003 that is
decreased using the exponential decay:  = 0 ∗  (/) , where 0 is the initial learning rate,  is the current step,
and ,  are two parameters set to  = 0.96 and  = 90000.</p>
      <p>The DNN results are presented in Table 2 for French and non-French models. Furthermore, since we want to
recommend more than one ski resort, we present the accuracy results from TOP-1 to TOP-10. Note that the TOP-n
accuracy corresponds to the accuracy where the true class matches with any one of the n most probable classes
predicted by the model. Obviously, the accuracy increases through the TOP results. The results show a promising
TOP-10 performance with 98.95% and 99.30%, respectively. Another thing to note is that the non-French accuracies are
always higher than those with French data. This can be explained by the fact that in the non-French input vector we
have the country_id feature instead of the department feature in the French input. Indeed, since we have 171 countries
and only 99 diferent French departments, we think that the countries variability helps more in discriminating between
the diferent classes (i.e. resorts). Also, one can note that the TOP-1 accuracy remains rather low, which can be explained
by the constraint of the questionnaire which allows little variability in the data.</p>
    </sec>
    <sec id="sec-7">
      <title>4 CONCLUSION</title>
      <p>In this paper, the proposed system learns various feature vectors that describe important information specific to both
customers and target resorts. The obtained results showed promising results using the proposed architecture of DNN
with a TOP-10 accuracy of 98.95% and 99.30% for French and non-French data, respectively. As further improvements,
we plan to make the customer online experience better by recommending through a chat bot a full stay in the resort.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Bing</given-names>
            <surname>Bai</surname>
          </string-name>
          , Yushun Fan, Wei Tan,
          <string-name>
            <given-names>and Jia</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>DLTSR: A Deep Learning Framework for Recommendations of Long-Tail Web Services</article-title>
          .
          <source>IEEE Transactions on Services Computing</source>
          <volume>13</volume>
          ,
          <issue>1</issue>
          (Feb.
          <year>2020</year>
          ),
          <fpage>73</fpage>
          -
          <lpage>85</lpage>
          . https://doi.org/10.1109/TSC.
          <year>2017</year>
          .2681666
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Paul</given-names>
            <surname>Covington</surname>
          </string-name>
          , Jay Adams, and
          <string-name>
            <given-names>Emre</given-names>
            <surname>Sargin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep Neural Networks for YouTube Recommendations</article-title>
          .
          <source>In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys '16)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , Boston, USA,
          <fpage>191</fpage>
          -
          <lpage>198</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Joaquin</given-names>
            <surname>Delgado</surname>
          </string-name>
          and Richard Davidson.
          <year>2002</year>
          .
          <article-title>Knowledge Bases and User Profiling in Travel and Hospitality Recommender Systems</article-title>
          .
          <source>In Proceedings of the 9th International Conference on Information Technologies in Tourism (ENTER</source>
          <year>2002</year>
          ). springer, Innsbruck-Austria,
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Boris</given-names>
            <surname>Delibasic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Radovanović</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jovanovic</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Milija</given-names>
            <surname>Suknovic</surname>
          </string-name>
          .
          <year>2020</year>
          .
          <article-title>Improving Decision-Making in Ski Resorts by Analysing Ski Lift Transportation-A Review</article-title>
          .
          <source>In Advances in Operational Research in the Balkans</source>
          . Springer,
          <fpage>265</fpage>
          -
          <lpage>273</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Jannach</given-names>
            <surname>Dietmar</surname>
          </string-name>
          , Zanker Markus, and
          <string-name>
            <given-names>Fuchs</given-names>
            <surname>Matthias</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Constraint-Based Recommendation in Tourism: A Multiperspective Case Study</article-title>
          .
          <source>Information Technology and Tourism</source>
          <volume>11</volume>
          (
          <year>2009</year>
          ),
          <fpage>139</fpage>
          -
          <lpage>155</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Travis</given-names>
            <surname>Ebesu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yi</given-names>
            <surname>Fang</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Neural Citation Network for Context-Aware Citation Recommendation</article-title>
          .
          <source>In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , Tokyo, Japan,
          <fpage>1093</fpage>
          -
          <lpage>1096</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Jianfeng</given-names>
            <surname>Gao</surname>
          </string-name>
          , Patrick Pantel, Michael Gamon,
          <source>Xiaodong He, and li Deng</source>
          .
          <year>2014</year>
          .
          <article-title>Modeling Interestingness with Deep Neural Networks</article-title>
          .
          <source>In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP '14)</source>
          .
          <article-title>Association for Computational Linguistics</article-title>
          , Doha, Qatar,
          <fpage>2</fpage>
          -
          <lpage>13</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Xiangnan</given-names>
            <surname>He</surname>
          </string-name>
          and
          <string-name>
            <surname>Tat-Seng Chua</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Neural Factorization Machines for Sparse Predictive Analytics</article-title>
          .
          <source>In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , Tokyo, Japan,
          <fpage>355</fpage>
          -
          <lpage>364</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Donghyun</given-names>
            <surname>Kim</surname>
          </string-name>
          , Chanyoung Park, Jinoh Oh,
          <string-name>
            <given-names>Sungyoung</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Hwanjo</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Convolutional Matrix Factorization for Document Context-Aware Recommendation</article-title>
          .
          <source>In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys '16)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , Boston, USA,
          <fpage>233</fpage>
          -
          <lpage>240</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>[10] GOPAL KRISHNA PATRO and Kishore Kumar Sahu</source>
          .
          <year>2015</year>
          .
          <article-title>Normalization: A Preprocessing Stage</article-title>
          .
          <source>International Advanced Research Journal in Science, Engineering and Technology</source>
          <volume>2</volume>
          ,
          <issue>3</issue>
          (March
          <year>2015</year>
          ),
          <fpage>20</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Maria</given-names>
            <surname>Nadia Postorino and Giuseppe M. L. Sarnè</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>A Neural Network Hybrid Recommender System</article-title>
          .
          <source>In The 20th Italian Workshop on Neural Nets</source>
          ,.
          <volume>180</volume>
          -
          <fpage>187</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <article-title>skiResortData 2021</article-title>
          .
          <article-title>Ski Resort dataset</article-title>
          .
          <source>Retrieved September 14</source>
          ,
          <year>2021</year>
          from https://www.kaggle.com/serdoukyasmine/data-frenchskiresorts
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Florian</surname>
            <given-names>Strub</given-names>
          </string-name>
          , Romaric Gaudel, and
          <string-name>
            <given-names>Jérémie</given-names>
            <surname>Mary</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Hybrid recommender system based on autoencoders</article-title>
          .
          <source>In Proceedings of the 1st Workshop on Deep Learning for Recommender System (DLRS '16)</source>
          .
          <article-title>Association for Computing Machinery</article-title>
          , Boston, USA,
          <fpage>11</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Muhammet</surname>
            <given-names>Çakır</given-names>
          </string-name>
          , Şule Gündüz Öğüdücü, and
          <string-name>
            <given-names>Resul</given-names>
            <surname>Tugay</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>A Deep Hybrid Model for Recommendation Systems</article-title>
          .
          <source>In International Conference of the Italian Association for Artificial Intelligence</source>
          . Springer,
          <fpage>321</fpage>
          -
          <lpage>335</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>