<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>IIR</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>A Spatial Approach to Predict Performance of Conversational Search Systems⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Discussion Paper</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guglielmo Faggioli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Ferro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristina Muntean</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafaele Perego</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicola Tonellotto</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ISTI-CNR</institution>
          ,
          <addr-line>Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Padova</institution>
          ,
          <addr-line>Padova</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Pisa</institution>
          ,
          <addr-line>Pisa</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>13</volume>
      <abstract>
        <p>Recent advancements in Information Retrieval and Natural Language Processing have led to significant developments in the way users interact with search engines, with traditional one-shot textual queries being replaced by multi-turn conversations. As a highly interactive search scenario, Conversational Search (CS) can significantly benefit from Query Performance Prediction (QPP) techniques. However, the application of QPP in the CS domain is a relatively new field and requires proper framing. This study proposes a set of spatial-based QPP models, designed to work efectively in the conversational search domain, where dense neural retrieval models are the most common approach and query cutofs are small. The proposed QPP approaches are shown to improve the predictive performance over the state-of-the-art in diferent scenarios and collections, highlighting the utility of QPP in the CS domain.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        precision-oriented metrics and small cutof retrieval, while traditional QPP techniques have been
often devised and tested to predict Average Precision (AP) at large cutofs; ii) retrieving passages
or short documents, while classical QPP techniques are often designed for long documents; iii)
heavy usage of Neural Information Retrieval (NIR) techniques which have not been yet explored
extensively in the QPP domain; iv) in the CS domain, utterances are correlated and grouped
into conversations, making the evaluation of CS systems intrinsically diferent from classical
ad-hoc IR [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. This characteristic should be taken into consideration, at least when evaluating
the QPP models. While a good share of efort has been devoted to both the CS and QPP tasks
alone, at the current time only a few works studied the application of QPP techniques to CS.
Most of these works rely on the use of well-established classical QPP methods to choose how
the system should interact with the user [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] or to determine if the answer provided to the user
contains the relevant information [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], without taking into consideration all the peculiarities of
the CS domain described above. In this work, we aim at address this gap by proposing a set of
predictors explicitly designed to synergize the best with CS models. We start by considering that
most of the modern CS approaches rely on NIR techniques. Thus, we focus on CS models that
exploit documents’ and queries’ dense representations and propose QPP methodologies relying
on measuring how close retrieved documents’ representations are to the query. We devise
two predictors that measure the volume of the hypercube encompassing the top  retrieved
documents in response to a given query and show that such quantity efectively correlates with
the actual performance achieved.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Hyper-Volume Based Predictors</title>
      <p>In this work, we focus on dense representations of the documents and the queries. In particular,
we consider two well-known dense representation approaches STAR [11] and ConvDR [12].
Concerning the notation, we call  the -dimensional vector representation of the utterance,
and  the vector representation of the -th document retrieved in response to the information
need. As commonly done for most post-retrieval predictors, only the top- documents retrieved
are considered in computing the prediction – we call the top-k retrieved documents @.</p>
      <p>Given a query , we consider the top- documents retrieved to answer it. Given the
multidimension representation of the query  and documents 1, ..., , we consider the volume
that encompasses the query and all the top-k retrieved documents. If such a volume is small, we
can expect a high semantic correlation between the query and the documents. Contrarily, a large
volume might indicate documents poorly coherent with the query. In particular, we compute the
volume of the hyper-cube containing all the documents. To do this, we consider each dimension
ℎ of the learned representation and determine the length of the hyper-cube’s edge laying on ℎ
as: ℎ = | max({(ℎ), ∀ ∈ [1, ]} ∪ {(ℎ)}) − min{{(ℎ), ∀ ∈ [1, ]} ∪ {(ℎ)}}|, where
 is the ranked list cutof, (ℎ) and (ℎ) are respectively the values of the ℎ-th dimension
for the query and -th document. Finally, the volume   of the hyper-cube constructed around
the top-k documents for query  is computed as:   = ∏︀ℎ=1 ℎ. Notice that, while no specific
bound is present on ℎ, it is likely that such values are small, thus it is numerically more stable
to compute the log sum of such value. We define the first predictor, Reciprocal Volume (
1
() = − ∑︀ℎ=1 (ℎ) .</p>
      <p>Assuming that each dimension represents a latent aspect of the query, having a smaller
hypercube on a certain dimension suggests that all the retrieved documents are closely related to that
query’s latent aspect. Vice versa, if the cube is particularly big on that dimension, it is likely
that the retrieved documents treat the latent aspect in a very diferent way from the query.</p>
      <p>The reference measure used most often in conversational search [13, 14] is normalize
Discounted Cumulative Gain (nDCG) [15]. nDCG is based on the model of a user browsing the
ranked list of retrieved documents and accruing utility proportional to the relevance of the
document and inversely proportional to its position [16]. Inspired by it, we propose a second
predictor, dubbed Discounted Matryoshka (DM), defined as follows:
() = ∑︁  ()
=1 log( + 1)
.</p>
      <p>Starting from the first document retrieved, we construct the hyper-cube containing the
document(s) and the query and determine its volume. Each hyper-cube constructed by adding a new
document contains (or is equal to) the previous one  () ≤ +1() – they can be seen
as Matryoshka dolls. If moving from document to document such volume remains limited – all
Matryoshkas are similar and small – we assume that all top retrieved documents are consistent
with the query in all its dimensions and therefore we could assume a successful retrieval. Notice
that, the hyper-volume of each hyper-cube used to compute () is discounted by a discount
factor proportional to the number of points in the space used to construct it.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental Evaluation</title>
      <p>Our experiments are based on 2019, 2020, and 2021 TREC Conversational Assistant Track
(CAsT)1 datasets2. In our experiments, we try to predict nDCG@3, the most commonly used
measure in CS [13, 14]. In our experiments, we use dense representations of original,
automatically rewritten, and manually rewritten queries, where missing keywords or references to
previous topics are resolved by human assessors. Original and manually rewritten queries are
encoded using the STAR model, while the automatically rewritten ones are obtained by using
the ConvDR model. For ConvDR we used publicly available weights3. For all models employing
it, the cutof hyperparameter  has been selected from the set {3, 5, 10, 50, 100, 500}. QPP models
have been fine-tuned using two-fold repeated sampling [17, 18, 19, 20] with 30 repetitions.</p>
      <p>
        Table 1 reports our experimental findings. First of all, it is interesting to notice that, when
using STAR vectors the DM predictor is either the best predictor or not statistically significantly
diferent from the best. This holds for all correlation measures considered. The high RBO
1Conversational Assistant Track, https://www.treccast.ai/
2for space reasons, we report results only on CAsT 2019. Full results are available in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
3https://github.com/thunlp/ConvDR
for the utterance labeling baseline is due to the fact that, by ranking higher self-explanatory
utterances, it is more likely to put in first positions utterances that are in fact “easier” – being
top-heavy, RBO awards this behaviour. To predict the performance of ConvDR, we consider
two alternatives, the first consists in using the same utterances that ConvDR uses, namely the
original ones (indicated with CDR-o), or the manually rewritten utterances (indicated with
CDR), to make results more comparable to those observed in STAR. It is important to notice
that while traditional predictors are influenced by the usage of either original or rewritten
queries, this is not the case for the proposed RV and DM predictors – they rely on the dense
representation of the utterance, regardless of its textual content. In terms of retrieval, both
ConvDR and ConvDR-o are exactly the same: the diference is the type of utterances used to
predict ConvDR performance for the traditional lexical QPP baselines. Notice that, in this sense,
the usage of ConvDR and rewritten utterances represent a non-realistic scenario, since they
would not be available to a real CS agent. Considering ConvDR with rewritten utterances, we
notice that the proposed predictors tend to fail compared to the baselines in the majority of the
cases with the exception of Rank-Biased Overlap (RBO) as correlation, where the performance is
statistically not diverse from the best method (WIG). If we consider the most conversational and
realistic scenario, ConvDR with original utterances and predictors based on original utterances,
we notice that DM is always the best method – with the only exception of the RBO measure,
where it ranks second, behind RV but statistically they are equivalent.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion and Future Work</title>
      <p>In this study, we explore the potential of a geometric framework for performance prediction in
the CS domain. We propose two geometric post-retrieval coherency predictors, which measure
the proximity of retrieved documents to the query by encapsulating them within a hypercube.
The predictors are applied to two conversational dense retrieval models, ConvDR and STAR,
on three established conversational collections, using the evaluation procedures defined. The
results demonstrate that our proposed methodology outperforms QPP baselines on CAsT 2019
and CAsT 2021. In conclusion, the significance of QPP in the CS domain is emphasized, and our
proposed models show promising results in improving QPP for conversational search. In future
research, we plan to investigate how to incorporate in the predictors signals from previous
utterances and their linguistic content.</p>
      <p>URL: https://doi.org/10.1145/3341981.3344219. doi:10.1145/3341981.3344219.
[11] J. Zhan, J. Mao, Y. Liu, J. Guo, M. Zhang, S. Ma, Optimizing dense retrieval model training
with hard negatives, in: Proceedings of the 44th International ACM SIGIR Conference on
Research and Development in Information Retrieval, SIGIR ’21, Association for Computing
Machinery, New York, NY, USA, 2021, p. 1503–1512. URL: https://doi.org/10.1145/3404835.
3462880. doi:10.1145/3404835.3462880.
[12] S. Yu, Z. Liu, C. Xiong, T. Feng, Z. Liu, Few-shot conversational dense retrieval, in:
Proceedings of the 44th International ACM SIGIR Conference on Research and
Development in Information Retrieval, SIGIR ’21, Association for Computing Machinery,
New York, NY, USA, 2021, p. 829–838. URL: https://doi.org/10.1145/3404835.3462856.
doi:10.1145/3404835.3462856.
[13] J. Dalton, C. Xiong, J. Callan, TREC cast 2019: The conversational assistance track
overview, CoRR abs/2003.13624 (2020) 1–10. URL: https://arxiv.org/abs/2003.13624.
arXiv:2003.13624.
[14] J. Dalton, C. Xiong, J. Callan, Cast 2020: The conversational assistance track overview, in:
E. M. Voorhees, A. Ellis (Eds.), Proceedings of the Twenty-Ninth Text REtrieval Conference,
TREC 2020, Virtual Event [Gaithersburg, Maryland, USA], November 16-20, 2020, volume
1266 of NIST Special Publication, National Institute of Standards and Technology (NIST),
2020, pp. 1–10. URL: https://trec.nist.gov/pubs/trec29/papers/OVERVIEW.C.pdf.
[15] K. Järvelin, J. Kekäläinen, Cumulated gain-based evaluation of IR techniques, ACM Trans.</p>
      <p>Inf. Syst. 20 (2002) 422–446. URL: http://doi.acm.org/10.1145/582415.582418. doi:10.1145/
582415.582418.
[16] B. Carterette, System efectiveness, user models, and user utility: a conceptual framework
for investigation, in: W. Ma, J. Nie, R. Baeza-Yates, T. Chua, W. B. Croft (Eds.), Proceeding
of the 34th International ACM SIGIR Conference on Research and Development in
Information Retrieval, SIGIR 2011, Beijing, China, July 25-29, 2011, ACM, 2011, pp. 903–912.</p>
      <p>URL: https://doi.org/10.1145/2009916.2010037. doi:10.1145/2009916.2010037.
[17] A. Shtok, O. Kurland, D. Carmel, F. Raiber, G. Markovits, Predicting query performance by
query-drift estimation, ACM Trans. Inf. Syst. 30 (2012) 11:1–11:35. URL: https://doi.org/10.
1145/2180868.2180873. doi:10.1145/2180868.2180873.
[18] S. Datta, D. Ganguly, M. Mitra, D. Greene, A Relative Information Gain-Based Query
Performance Prediction Framework with Generated Query Variants, ACM Transactions
on Information Systems 41 (2022) 1–31.
[19] O. Zendel, A. Shtok, F. Raiber, O. Kurland, J. S. Culpepper, Information needs, queries, and
query performance prediction, in: B. Piwowarski, M. Chevalier, É. Gaussier, Y. Maarek,
J. Nie, F. Scholer (Eds.), Proceedings of the 42nd International ACM SIGIR Conference
on Research and Development in Information Retrieval, SIGIR 2019, Paris, France, July
21-25, 2019, ACM, 2019, pp. 395–404. URL: https://doi.org/10.1145/3331184.3331253. doi:10.
1145/3331184.3331253.
[20] H. Zamani, W. B. Croft, J. S. Culpepper, Neural Query Performance Prediction using
Weak Supervision from Multiple Signals, in: K. Collins-Thompson, Q. Mei, B. D. Davison,
Y. Liu, E. Yilmaz (Eds.), The 41st International ACM SIGIR Conference on Research &amp;
Development in Information Retrieval, SIGIR 2018, Ann Arbor, MI, USA, July 08-12, 2018,
ACM, 2018, pp. 105–114. doi:10.1145/3209978.3210041.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Faggioli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Muntean</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Perego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Tonellotto</surname>
          </string-name>
          ,
          <article-title>A Geometric Framework for Query Performance Prediction in Conversational Search</article-title>
          ,
          <source>in: Proceedings of 46th international ACM SIGIR conference research development in information retrieval</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Carmel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Yom-Tov</surname>
          </string-name>
          ,
          <article-title>Estimating the Query Dificulty for Information Retrieval</article-title>
          ,
          <source>Synthesis Lectures on Information Concepts</source>
          , Retrieval, and Services, Morgan &amp; Claypool Publishers,
          <year>2010</year>
          . doi:
          <volume>10</volume>
          .2200/S00235ED1V01Y201004ICR015.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Thomas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Scholer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bailey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mofat</surname>
          </string-name>
          , Tasks, queries, and
          <article-title>rankers in pre-retrieval performance prediction</article-title>
          , in: B.
          <string-name>
            <surname>Koopman</surname>
            , G. Zuccon,
            <given-names>M. J.</given-names>
          </string-name>
          <string-name>
            <surname>Carman</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 22nd Australasian Document Computing Symposium, ADCS</source>
          <year>2017</year>
          ,
          <article-title>Brisbane</article-title>
          ,
          <string-name>
            <surname>QLD</surname>
          </string-name>
          , Australia, December 7-
          <issue>8</issue>
          ,
          <year>2017</year>
          , ACM,
          <year>2017</year>
          , pp.
          <volume>11</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          :4. doi:
          <volume>10</volume>
          .1145/3166072.3166079.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>H.</given-names>
            <surname>Scells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Azzopardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Koopman</surname>
          </string-name>
          ,
          <article-title>Query variation performance prediction for systematic reviews</article-title>
          , in: K.
          <string-name>
            <surname>Collins-Thompson</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          <string-name>
            <surname>Mei</surname>
            ,
            <given-names>B. D.</given-names>
          </string-name>
          <string-name>
            <surname>Davison</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Liu</surname>
          </string-name>
          , E. Yilmaz (Eds.),
          <source>The 41st International ACM SIGIR Conference on Research &amp; Development in Information Retrieval</source>
          ,
          <string-name>
            <surname>SIGIR</surname>
          </string-name>
          <year>2018</year>
          , Ann Arbor, MI, USA, July
          <volume>08</volume>
          -
          <issue>12</issue>
          ,
          <year>2018</year>
          , ACM,
          <year>2018</year>
          , pp.
          <fpage>1089</fpage>
          -
          <lpage>1092</lpage>
          . doi:
          <volume>10</volume>
          .1145/3209978.3210078.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H.</given-names>
            <surname>Roitman</surname>
          </string-name>
          ,
          <article-title>Enhanced Performance Prediction of Fusion-based Retrieval</article-title>
          , in: D.
          <string-name>
            <surname>Song</surname>
            , T. Liu,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Bruza</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Melucci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Sebastiani</surname>
            ,
            <given-names>G. H.</given-names>
          </string-name>
          <string-name>
            <surname>Yang</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval</source>
          ,
          <string-name>
            <surname>ICTIR</surname>
          </string-name>
          <year>2018</year>
          , Tianjin, China,
          <source>September 14-17</source>
          ,
          <year>2018</year>
          , ACM,
          <year>2018</year>
          , pp.
          <fpage>195</fpage>
          -
          <lpage>198</lpage>
          . URL: https://doi.org/ 10.1145/3234944.3234950. doi:
          <volume>10</volume>
          .1145/3234944.3234950.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Tonellotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Macdonald</surname>
          </string-name>
          ,
          <article-title>Using an inverted index synopsis for query latency and performance prediction</article-title>
          ,
          <source>ACM Trans. Inf. Syst</source>
          .
          <volume>38</volume>
          (
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .1145/3389795.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>H.</given-names>
            <surname>Hashemi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zamani</surname>
          </string-name>
          , W. B.
          <string-name>
            <surname>Croft</surname>
          </string-name>
          ,
          <article-title>Performance prediction for non-factoid question answering</article-title>
          , in: Y. Fang,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Allan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Balog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Carterette</surname>
          </string-name>
          , J. Guo (Eds.),
          <source>Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval</source>
          ,
          <string-name>
            <surname>ICTIR</surname>
          </string-name>
          <year>2019</year>
          , Santa Clara, CA, USA, October 2-
          <issue>5</issue>
          ,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>55</fpage>
          -
          <lpage>58</lpage>
          . doi:
          <volume>10</volume>
          .1145/ 3341981.3344249.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Faggioli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferrante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Perego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Tonellotto</surname>
          </string-name>
          ,
          <article-title>Hierarchical dependence-aware evaluation measures for conversational search</article-title>
          , in: F. Diaz,
          <string-name>
            <given-names>C.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Suel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Jones</surname>
          </string-name>
          , T. Sakai (Eds.),
          <source>SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          , Virtual Event, Canada,
          <source>July 11-15</source>
          ,
          <year>2021</year>
          , ACM,
          <year>2021</year>
          , pp.
          <fpage>1935</fpage>
          -
          <lpage>1939</lpage>
          . doi:
          <volume>10</volume>
          .1145/3404835.3463090.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Aliannejadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zamani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Crestani</surname>
          </string-name>
          , W. B.
          <string-name>
            <surname>Croft</surname>
          </string-name>
          ,
          <article-title>Asking Clarifying Questions in OpenDomain Information-Seeking Conversations</article-title>
          , in: B.
          <string-name>
            <surname>Piwowarski</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Chevalier</surname>
            , É. Gaussier,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Maarek</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Nie</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Scholer</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          ,
          <string-name>
            <surname>SIGIR</surname>
          </string-name>
          <year>2019</year>
          , Paris, France,
          <source>July 21-25</source>
          ,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>475</fpage>
          -
          <lpage>484</lpage>
          . URL: https://doi.org/10.1145/3331184. 3331265. doi:
          <volume>10</volume>
          .1145/3331184.3331265.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Roitman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Erera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Feigenblat</surname>
          </string-name>
          ,
          <article-title>A Study of Query Performance Prediction for Answer Quality Determination</article-title>
          , in: Y. Fang,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Allan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Balog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Carterette</surname>
          </string-name>
          , J. Guo (Eds.),
          <source>Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval</source>
          ,
          <string-name>
            <surname>ICTIR</surname>
          </string-name>
          <year>2019</year>
          , Santa Clara, CA, USA, October 2-
          <issue>5</issue>
          ,
          <year>2019</year>
          , ACM,
          <year>2019</year>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>