<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Revisiting Retrieval-based Approaches for Conversational Recommender Systems</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ahtsham</forename><surname>Manzoor</surname></persName>
						</author>
						<author role="corresp">
							<persName><forename type="first">Dietmar</forename><surname>Jannach</surname></persName>
							<email>dietmar.jannach@aau.at</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="institution">University of Klagenfurt</orgName>
								<address>
									<addrLine>Universitätsstraße 65-67, am Wörthersee</addrLine>
									<postCode>9020</postCode>
									<settlement>Klagenfurt</settlement>
									<country key="AT">Austria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department" key="dep1">12th</orgName>
								<orgName type="department" key="dep2">Italian Information Retrieval Workshop</orgName>
								<address>
									<addrLine>June 29 -June 30th</addrLine>
									<postCode>2022</postCode>
									<settlement>Milan</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Revisiting Retrieval-based Approaches for Conversational Recommender Systems</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">C527DAFE37C53F2E67F10AF025EA3AED</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T20:10+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Conversational recommendation</term>
					<term>retrieval and ranking</term>
					<term>language generation</term>
					<term>evaluation</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Conversational recommender systems (CRS) interact with users in natural language to support them in their decision-making process. Recently, an increased interest in developing novel approaches to CRS can be observed. Mainly, current research efforts rely on neural models trained on recorded recommendation dialogs between humans, thereby implementing an end-to-end learning process. Given a user utterance in an ongoing dialog, the trained models generate suitable responses. An alternative to generation-based approaches is to retrieve responses from the recorded dialog data and adapt them to the given dialog context. Such retrieval approaches have proven to be effective in various NLP tasks, but have received limited attention for CRS so far. In our ongoing research, we re-assess the potential value of retrievalbased approaches and compare their performance with recent generation-based approaches. Our results point to various limitations of current neural models and indicate that retrieval-based approaches can be an effective complement to today's generation-based techniques.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Conversational recommender systems (CRS) are software agents that converse with humans in natural language. The main goal of such agents is to recommend items of interest to the users and help them in making their decisions. In recent years, we observe that CRS obtained increased attention, mainly due to the spread of voice assistants like Alexa and Siri, and due to advancements in the area of natural language processing (NLP) and machine learning (ML) in general; see <ref type="bibr" target="#b0">[1]</ref> for a recent survey on conversational recommenders.</p><p>Current approaches to building CRS <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b5">6]</ref> mainly adapt an end-to-end learning paradigm. One promise of such end-to-end learning approaches is to avoid the knowledge engineering bottlenecks of traditional critiquing-based or constraint-based CRS <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9]</ref>. Specifically, these systems rely on ML models that are trained on large corpora of recommendation dialogs recorded with the help of paired humans, e.g., the ReDial dataset <ref type="bibr" target="#b1">[2]</ref>. Various algorithmic approaches were proposed in recent years in which the system relies on models that were trained on such datasets to generate a response given a user utterance in an ongoing dialog. While today's systems are often capable of returning meaningful responses, they may still fail in many cases. In the online material<ref type="foot" target="#foot_0">1</ref> provided by the authors of the first neural model built on the ReDial dataset <ref type="bibr" target="#b1">[2]</ref>, for instance, the system once responds to a user as follows: "what kind of movies do you like ? what kind of movies do you like ?". A further inspection of more examples revealed that there are various situations where the proposed system had difficulties to generate suitable responses. Therefore, the question arises to what extent current systems may actually be usable in practice. Moreover, since the performance evaluation of recent neural models is mostly based on offline experiments and computational metrics it remains unclear how users would perceive the quality of the responses of such systems.</p><p>While generation-based approaches can have different advantages, e.g., that they should be capable to respond to previously unseen dialog situations, we find that today's systems often encounter difficulties to correctly interpret the user's current intent and that they sometimes also return responses that are too short or too general <ref type="bibr" target="#b9">[10]</ref>. An alternative to recent NLP-enabled generation-based systems are retrieval-based approaches. In such approaches, the idea is to retrieve appropriate responses from the underlying dataset and adapt them to the context of the dialog if needed. Retrieval-based approaches have been applied in various NLP tasks such as question-answering (Q&amp;A) systems, machine translation, and open-domain dialog systems <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b14">15]</ref>. One main advantage of retrieval-based systems comes from the fact that such approaches do not require long and expensive training of models. Moreover, the returned utterances are mostly complete and semantically correct as they were originally made by humans, see also <ref type="bibr" target="#b15">[16]</ref> for a comparison of generation and retrieval-based methods.</p><p>In this paper, we summarize our recent findings on the state of the art in conversational recommendation published in <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b17">18,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b19">20]</ref>. Our main observations reported in these works include (a) that today's generation-based models may have difficulties to respond in a meaningful way in a substantial number of cases (30-40 %); (b) that some of these systems almost exclusively "generate" sentences that appear in the training data in the exact same form; and (c) that retrievalbased approaches may represent a promising alternative or complement to generation-based approaches. Differently from most current works, our analyses and experiments are mainly based on user-centric evaluation approaches, given the known limitations of offline experiments and certain linguistic metrics when evaluating dialog systems <ref type="bibr" target="#b20">[21,</ref><ref type="bibr" target="#b21">22]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Assessing Generation-based and Retrieval-based CRS</head><p>Analysis of generation-based CRS. We started our journey with the analysis of the system proposed by Li et al. <ref type="bibr" target="#b1">[2]</ref>, which we refer to as DeepCRS from here on. DeepCRS which was built on the ReDial dataset was also released in the same paper. This dataset contains over 10,000 recommendation dialogs that were obtained with the help of crowdworkers. Later on, Chen et al. <ref type="bibr" target="#b2">[3]</ref> proposed another system based on the ReDial dataset, named KBRD, and they reported improved performance over DeepCRS in different dimensions.</p><p>As a first step in our research, we used the code of these two systems (DeepCRS and KBRD) to generate system responses for 70 dialog situations. We then relied on human judges who assessed the returned system responses in terms of (1) the quality of the generated responses on a binary scale, (2) the quality of the specific recommendations made in the dialogs. Moreover, we automatically measured the originality of the returned responses, i.e., if they can also be found in the training data in identical or similar form.</p><p>The results of our analysis exhibited that 31 % of the responses by DeepCRS and 42 % of the responses by KBRD were not considered meaningful by the human judges, see <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b17">18]</ref> for details of these analyses. Also, about 40 % (DeepCRS) and 45 % (KBRD) of the recommendations of the two systems were not found to be suitable. Typical problems include responses that do not match the dialog situation, broken sentences, or duplicated responses or recommendations in the same dialog. Moreover, it turned out that in almost all cases both neural approaches actually did not generate new sentences, but returned sentences that appeared in the exact same or very similar form in the training data. Interestingly, the manual analysis with human judges indicated that DeepCRS was actually the better system when using our specific evaluation procedure, which stands in contrast to the findings reported in <ref type="bibr" target="#b2">[3]</ref>.</p><p>What about rule-based approaches? Many of today's online chatbots, e.g., ones that use Google's DialogFlow<ref type="foot" target="#foot_1">2</ref> , are based on pre-defined text templates and an inference mechanism that determines the user's current intent and then selects a suitable template as a response. Our next research question was how far we can get with a simple, manually engineered rule-based systems and how such a system would fare when compared to DeepCRS and KBRD. To that purpose, we analyzed the ReDial dataset to identify the most frequent user intents. We then developed a simple rule-based pattern-matching CRS-in the spirit of the famous ELIZA <ref type="bibr" target="#b22">[23]</ref> system-which guesses the intent of a given user utterance and then selects one of several suitable pre-curated sentences as a response. We compared our rule-based system with DeepCRS and KBRD through a user study, where study participants (N=58) had to assess the quality of the responses, using a 5-point scale, by the different systems for a given dialog situation, see <ref type="bibr" target="#b17">[18]</ref> for details. Specifically, each participant was tasked to asses 10 dialog situations, one of them was considered as an attention check, thereby in total 522 situations were assessed. The study showed that our simple rule-based system, which mainly consists of a few dozen if-statements, on average led to better quality perceptions than the recent neural models. However, there were still many situations in which all three compared systems failed.</p><p>Towards retrieval-based conversational recommendation. Since we found that the neural models mainly returned existing utterances from the underlying dataset, and we do not consider engineered rule-based systems to be the future of CRS, we designed a retrieval-based approach. The first version of this approach and its evaluation are discussed in <ref type="bibr" target="#b18">[19]</ref>. Given the last user utterance as an input, the system first determines similar user utterances in the dataset. The responses to these similar user utterances in the dataset are considered as candidates to return to the user, and we designed a small set of heuristics to select one of these candidates. In case the chosen response actually includes a recommendation, i.e., it is not some other form of utterance like a greeting, a specific recommendation module is used to determine a suitable item recommendation, which is then integrated into the selected response text. Again, we evaluated our system through a user study, with (N=60) subjects, like the one used to evaluate our rule-based system. The study, which again included DeepCRS and KBRD as baselines, showed that the retrieval-based system on average led to the best quality perception among the compared systems. As a side result, we also found that KBRD performed better than DeepCRS in the user study, which supports the findings reported in <ref type="bibr" target="#b2">[3]</ref>.</p><p>Inspired by these promising findings, we further improved our retrieval-based systems, see <ref type="bibr" target="#b19">[20]</ref>. Specifically, we found that our system has difficulties to respond to very short user utterances. Therefore, we adapted the approach in a way that it considers not only the last user utterance as a context but several of the most recent utterances in the dialog. Technically, we retrieve several sets of candidates, and we then use an outlier pruning (or: clustering) technique to identify the most plausible candidates, which in our case are candidates that are similar to each other. In a final step, the remaining candidates undergo a final selection process, where perplexity is used as a linguistic metric to rank the candidates. Before the response is returned to the user, again suitable item recommendations are injected using the recommender module.</p><p>The evaluation was done through a user study with (N=90) subjects. This time we included KBRD and the more recent KGSF system <ref type="bibr" target="#b3">[4]</ref> as baselines in the experiment. While KGSF outperformed KBRD, our improved retrieval-based system led to even better results than KGSF. In <ref type="bibr" target="#b19">[20]</ref>, we also report the outcomes of a number of additional analyses. We for example compare the performance of the systems at different dialog stages, we discuss difficult situations (intents) and typical failure points, and we specifically analyze in which ways users express their preferences in the ReDial dataset. Furthermore, we explore the relationships between the perceived meaningfulness of a response and certain (linguistic) characteristics of the response.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Discussion</head><p>Substantial progress was made recently for CRS that support natural-language interactions. Still, our work shows that building effective CRS remains a "grand AI challenge" <ref type="bibr" target="#b23">[24]</ref>. Our research indicates that retrieval-based approaches can be an alternative or complement to today's dominating approaches based on language generation. Given that both types of approaches have their advantages, future works may therefore more often consider hybrid systems that combine sentence retrieval and sentence generation in one system. Positive experiences with hybrid solutions were observed for related problem settings, e.g., for the AliMe chat, a single-turn Q&amp;A system <ref type="bibr" target="#b24">[25]</ref> or Microsoft's XiaoIce, a popular social chatbot system <ref type="bibr" target="#b25">[26]</ref>.</p><p>Our research also highlights that today's efforts to build CRS is hampered by the datasets that are available for learning. Dialogs in the ReDial dataset, for example, in many cases mention individual movies, e.g., when users indicate their preferences, but not preferences in terms of genres. In addition, there are almost no situations where the recommendation seeker asks for an explanation, which makes it almost impossible for the CRS to learn how to explain. Better datasets are therefore required, also ones that allow for more social interactions, e.g., <ref type="bibr" target="#b26">[27]</ref>.</p><p>Finally, our works shed light on potential issues of today's predominant evaluation practices. Researchers often seem to overly rely on computational metrics. While some reported humanevaluations, which are typically not described in detail. With our research works we lay out one possible way of how to evaluate CRS with humans in the loop. More work however remains to be done to ensure consistent progress in this challenging research area.</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://proceedings.neurips.cc/paper/2018/hash/800de15c79c8d840f4e78d3af937d4d4-Abstract.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://dialogflow.cloud.google.com/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A survey on conversational recommender systems</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Manzoor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Cai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="page" from="1" to="36" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Towards deep conversational recommendations</title>
		<author>
			<persName><forename type="first">R</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">E</forename><surname>Kahou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Schulz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Michalski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Charlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">NIPS &apos;18</title>
				<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="9725" to="9735" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Towards knowledge-based recommender dialog system</title>
		<author>
			<persName><forename type="first">Q</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Cen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EMNLP-IJCNLP &apos;19</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1803" to="1813" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Improving conversational recommender systems via knowledge graph based semantic fusion</title>
		<author>
			<persName><forename type="first">K</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">X</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>-R. Wen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">KDD &apos;20</title>
				<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1006" to="1014" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Towards topic-guided conversational recommender system</title>
		<author>
			<persName><forename type="first">K</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">X</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-R</forename><surname>Wen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICCL &apos;20</title>
				<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="4128" to="4139" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">CRFR: Improving conversational recommender systems via flexible fragments reasoning on knowledge graphs</title>
		<author>
			<persName><forename type="first">J</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Hou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EMNLP &apos;21</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="4324" to="4334" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Preference-based organization interfaces: Aiding user critiques in recommender systems</title>
		<author>
			<persName><forename type="first">L</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Pu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">UM &apos;</title>
		<imprint>
			<biblScope unit="volume">07</biblScope>
			<biblScope unit="page" from="77" to="86" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Constraint-based recommender systems</title>
		<author>
			<persName><forename type="first">A</forename><surname>Felfernig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Friedrich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zanker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Recommender Systems Handbook</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="161" to="190" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">ADVISOR SUITE -A knowledge-based sales advisory system</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ECAI &apos;04</title>
				<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="720" to="724" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">An ensemble of retrieval-based and generation-based human-computer conversation systems</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-T</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-Y</forename><surname>Nie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Yan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IJCAI &apos;18</title>
				<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="4382" to="4388" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Statistical machine translation for query expansion in answer retrieval</title>
		<author>
			<persName><forename type="first">S</forename><surname>Riezler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Vasserman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Tsochantaridis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">O</forename><surname>Mittal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ACL &apos;07</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="464" to="471" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">FAQ retrieval using query-question similarity and BERT-based query-answer relevance</title>
		<author>
			<persName><forename type="first">W</forename><surname>Sakata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Shibata</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Tanaka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kurohashi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SIGIR &apos;19</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1113" to="1116" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Retrieval-augmented Transformer-XL for close-domain dialog generation</title>
		<author>
			<persName><forename type="first">G</forename><surname>Bonetta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Cancelliere</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Vozila</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">FLAIRS &apos;21</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A retrieval-based dialogue system utilizing utterance and context embeddings</title>
		<author>
			<persName><forename type="first">A</forename><surname>Bartl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Spanakis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICMLA &apos;17</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1120" to="1125" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Open-domain utterance generation for conversational dialogue systems using web-scale dependency structures</title>
		<author>
			<persName><forename type="first">H</forename><surname>Sugiyama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Meguro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Higashinaka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Minami</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">SIGDIAL &apos;</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="334" to="338" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">A hybrid retrieval-generation neural conversation model</title>
		<author>
			<persName><forename type="first">L</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Qiu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Qu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">B</forename><surname>Croft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CIKM &apos;19</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="1341" to="1350" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">End-to-end learning for conversational recommendation: A long way to go?</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Manzoor</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IntRS Workshop at RecSys &apos;20</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Conversational recommendation based on end-to-end learning: How far are we?</title>
		<author>
			<persName><forename type="first">A</forename><surname>Manzoor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computers in Human Behavior Reports</title>
		<imprint>
			<biblScope unit="page">100139</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Generation-based vs. retrieval-based conversational recommendation: A user-centric comparison</title>
		<author>
			<persName><forename type="first">A</forename><surname>Manzoor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">RecSys &apos;21</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Towards retrieval-based conversational recommendation</title>
		<author>
			<persName><forename type="first">A</forename><surname>Manzoor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Systems</title>
		<imprint>
			<biblScope unit="page">102083</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">How NOT to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation</title>
		<author>
			<persName><forename type="first">C.-W</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Lowe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Serban</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Noseworthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Charlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pineau</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EMNLP &apos;16</title>
				<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="2122" to="2132" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Escaping the mcnamara fallacy: Towards more impactful recommender systems research</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bauer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">AI Magazine</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="page" from="79" to="95" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">ELIZA -Computer program for the study of natural language communication between man and machine</title>
		<author>
			<persName><forename type="first">J</forename><surname>Weizenbaum</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications. ACM</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="36" to="45" />
			<date type="published" when="1966">1966</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Conversational Recommendation: A Grand AI Challenge</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jannach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">AI Magazine</title>
		<imprint>
			<biblScope unit="volume">43</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">AliMe chat: A sequence to sequence and rerank based chatbot engine</title>
		<author>
			<persName><forename type="first">M</forename><surname>Qiu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F.-L</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Chu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACL&apos;</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="498" to="503" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">The design and implementation of XiaoIce, an empathetic social chatbot</title>
		<author>
			<persName><forename type="first">L</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H.-Y</forename><surname>Shum</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="page" from="53" to="93" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">INSPIRED: Toward sociable recommendation dialog systems</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Hayati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Yu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EMNLP &apos;20</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
