<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Amit Livne</string-name>
          <email>amit.livne@booking.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eran Fainman</string-name>
          <email>eran.fainman@booking.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Tourism recommenders</institution>
          ,
          <addr-line>Ranking, Reviews</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>on Recommender Systems</institution>
          ,
          <addr-line>Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <abstract>
        <p>The ACM RecSys RecTour 2024 Challenge focuses on ranking reviews, which is an important aspect that influences users' decision-making. The goal of this challenge is to match given accommodations and users to their respective review IDs. The concept is that when a new user interacts with the booking system, we can analyze the accommodation they are viewing along with available user features (e.g., couple, country, etc.). This enables us to display reviews in an order that considers the review content with respect to the user and accommodation characteristics. To do so, Booking.com provides a unique training dataset containing 1.6 million reviews based on real anonymized bookings. Thirty teams signed up for the challenge. The top-performing teams are invited to present short papers describing their solution approaches.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Problem</title>
    </sec>
    <sec id="sec-2">
      <title>Description</title>
      <p>Our objective is to create a model that predicts the helpfulness of every review tailored to
individual users. In essence, we aim to construct a personalized helpfulness function, denoted as
 (  |  ), which evaluates the relevance of review   to user  given their context   . This function
assigns a score indicating the degree to which review  is beneficial for user  . These scores
enable us to rank reviews, ensuring that those with the highest  values are deemed most helpful
within context   .</p>
      <p>Using the number of helpful votes as the target signal inherits multiple issues. First, it
introduces a presentation bias towards the previous review ranking algorithm (usually sorted
by votes). Additionally, the signal of votes is sparse as most of the reviews are not presented
and therefore not voted. Moreover, there might be a cold-start problem where new reviews
don’t have as many votes as older reviews which might be less relevant over time. Finally, in
many cases, only the final number of votes is stored and therefore it’s not feasible to use this
signal for developing personalized review ranking models.</p>
      <p>Thus, we introduce a more feasible and novel approach for modeling personalized helpfulness
measure. We model the personalized helpfulness of a review as the likelihood that it is written
by its reviewer given the reviewer’s context. Notably, we define  such that given a user context
  , it estimates the likelihood that review   was written by the user. Formally, we optimize 
Workshop on Recommenders in Tourism (RecTour 2024), October 18th, 2024, co-located with the 18th ACM Conference
such that given that review   was written by a user with context   , it holds:
1 if  = 
 (  |  ) = { 0 if  ≠ 
(1)</p>
      <p>In this challenge the task is to match given accommodations and users to their respective
review IDs. Therefore, we provide three sets of data as follows:</p>
      <p>Users – hold information regarding users and accommodation features. Review – hold
information regarding reviews. Matches – a true label between given  _ ,   _
and    _ (only positive examples).</p>
    </sec>
    <sec id="sec-3">
      <title>2. Dataset</title>
      <p>
        The dataset we publish contains authentic user-generated reviews from 50,000 accommodations.
This includes information on the user reservation, the review and the accommodation. The
dataset consists of 2,031,914 anonymized reviews, along with guest and accommodation context.
It is based on real data, and available via the following repo1. Table 1 describes the dataset fields.
Further information regarding the creation of the dataset is presented in [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Challenge Timeline</title>
      <p>Key dates of the challenge are listed in Table 2.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Submission Guidelines</title>
      <p>The submission file should include 12 columns: Accommodation ID, User ID, and the top 10
review IDs ranked according to your model’s predictions on the test set.</p>
      <p>An example of a ranked review for Accommodation ID 1 and User ID 1. The algorithm
predicted the following ranking of reviews (158, 32, … 97). Consequently, the submission file
will display it as demonstrated by Table 3.</p>
      <p>The top 10 teams will be invited to submit short papers (up to 4 pages + references). The
papers will include the team and the authors names, an abstract, sections describing the method
and results, a link to their code repository and a reference to the Booking.com challenge in the
following format: Amit Livne, and Eran Fainman. 2024. Booking.com RecSys RecTour 2024
Challenge. http://www.bookingchallenge.com , In Workshop on Recommenders in Tourism
(RecTour 2024), October 18th, 2024, co-located with the 18th ACM Conference on Recommender
Systems, Bari, Italy.</p>
      <p>Selected papers are expected to present their work in the workshop. The submitted papers
will be evaluated based on their clarity, novelty, and results presentation.</p>
      <p>Please contact rectour2024challenge@booking.com for any questions.</p>
      <p>1https://huggingface.co/datasets/Booking-com/accommodation-reviews</p>
    </sec>
    <sec id="sec-6">
      <title>5. Evaluation Criteria and Leaderboard</title>
      <p>The goal of the challenge is to predict (and recommend) the review ID of each Accommodation
and User IDs pair. The quality of the predictions is evaluated based on the MRR@10. The online
results are available via the leaderboard2</p>
      <p>2https://huggingface.co/spaces/Booking-com/rectour24-review-ranking-leaderboard-test</p>
    </sec>
    <sec id="sec-7">
      <title>6. Results</title>
      <p>60 participants have signed up for the challenge. After two months of a contest, 10 of them
applied a final submission. Top 4 performing teams are listed in Table 4.</p>
      <p>The best performing team achieved MRR@10 of 0.1662. The teams have submitted short
papers and code repositories with a detailed description of their solution methodology.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Igebaria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Fainman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mizrachi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Beladev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Enhancing travel decision-making: A contrastive learning approach for personalized review rankings in accommodations</article-title>
          ,
          <source>arXiv preprint arXiv:2407.00787</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Beladev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kleinfeld</surname>
          </string-name>
          , E. Frayerman,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shachar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Fainman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. L.</given-names>
            <surname>Assaraf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mizrachi</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. Wang,</surname>
          </string-name>
          <article-title>Text2topic: Multi-label text classification system for eficient topic detection in user generated content with zero-shot capabilities</article-title>
          ,
          <source>in: 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP</source>
          <year>2023</year>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>