<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Automatic Procedure for Generating Datasets for Conversational Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alessandro Suglia</string-name>
          <email>alessandro.suglia@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudio Greco</string-name>
          <email>claudiogaetanogreco@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pierpaolo Basile</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Semeraro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Annalina Caputo</string-name>
          <email>annalina.caputo@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, Trinity College Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, University of Bari Aldo Moro</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Conversational Recommender Systems assist online users in their information-seeking and decision making tasks by supporting an interactive process with the aim of nding the most appealing items according to the user preferences. Unfortunately, collecting dialogues data to train these systems can be labour-intensive, especially for data-hungry Deep Learning models. Therefore, we propose an automatic procedure able to generate plausible dialogues from recommender systems datasets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>the knowledge base contains the triple (item, predicate, object), 0 otherwise. The
considered predicates are wdt:P57 (director), wdt:P161 (cast member), wdt:P136
(genre) 4. The dialogue generation procedure is an iterative algorithm which is
executed until all user preferences have been used. At each step of the dialog
generation procedure, a top-n list of items composed by positive and negative
items is generated by randomly choosing from positive and negative preferences
of the given user u. Then, paths from the root of the decision tree to the
consistently classi ed examples are exploited to generate a sequence of questions,
randomly chosen according to a binomial distribution over the item features,
to elicit user preferences. Depending from the percentage of positive items in
the top-n, a \re ne" step is triggered which extends the dialog with additional
questions that lead to a list of suggestions which contains only positive items.</p>
      <p>Table 1 shows a conversation generated by applying the designed procedure
to the well-known MovieLens 1M recommender systems dataset. In the rst
part of the conversation, utterances with the aim of introducing the user are
generated by exploiting the contextual information included in the dataset.</p>
      <p>In this work we have proposed an automatic procedure able to generate
synthetic dialogue datasets starting from well-known datasets in the recommender
system eld. The presented procedure is completely generic and can be applied
on any dataset containing binary user preferences and whose items have a
corresponding identi er in the Linked Open Data Cloud.
4 The pre x wdt stands for http://www.wikidata.org/prop/direct/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mahmood</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ricci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Improving recommender systems with adaptive conversational strategies</article-title>
          .
          <source>In: Proceedings of the 20th ACM conference on Hypertext and hypermedia</source>
          . pp.
          <volume>73</volume>
          {
          <fpage>82</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>