<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Conventional Metrics: Assessing User Simulators in Information Retrieval</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Saber Zerhoudi</string-name>
          <email>saber.zerhoudi@uni-passau.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Granitzer</string-name>
          <email>michael.granitzer@uni-passau.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Evaluation Metrics, Simulation Evaluation, Information Retrieval</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Figure 1: Bootstrap analysis of simulation approaches</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Passau</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Traditional evaluation methods for user simulators in Information Retrieval systems have limitations in assessing their reliability for comparative analysis. To address this, we apply the Fréchet Distance (FD) to measure similarity between real and simulated user search session distributions. Using the TREC Session 2014 Track dataset, we compare FD's performance against established metrics like session nDCG and Expected Global Utility. Our study explores FD's efectiveness in various scenarios, including those with minimal and extensive interaction data, and examines its sensitivity to diferent feature extraction methods. Results show that FD correlates strongly with existing metrics while ofering unique insights into session similarity, particularly for complex, multi-query sessions. FD demonstrates robustness across feature extraction techniques and versatility in various evaluation scenarios. This research contributes to the field of Interactive Information Retrieval (IIR) by providing a more comprehensive framework for evaluating simulated search sessions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>The evolution of Interactive Information Retrieval (IIR) systems has introduced new challenges
in performance evaluation, particularly for simulated search sessions. Traditional metrics often
fail to capture the complex, dynamic nature of user interactions in modern search environments,
which involve query sequences, diverse user actions, and temporal elements. Conventional
methods typically require extensive real user interaction data, which is costly and dificult
to obtain, and may not adequately assess simulation fidelity for comparative analyses across
diferent IIR systems.</p>
      <p>To address these limitations, we propose the application of Fréchet Distance (FD) as a novel
metric for evaluating the similarity between real and simulated user search sessions in IIR. This
approach extends FD’s successful application from fields like computer vision to information
retrieval. FD ofers a quantitative measure of how well simulated data replicates complex
patterns of real user behavior, potentially serving as a standard for assessing user simulator
performance and reliability in IIR systems. Our investigation into FD’s eficacy for IIR systems is
guided by four research questions: (1) How efectively does FD measure the quality of simulated
search sessions with minimal interaction data? (2) Can FD accurately evaluate the quality of
CEUR
Workshop
Proceedings
simulated search sessions with extensive interaction data? (3) How correlated is the performance
of IR systems in simulated search sessions, as measured by FD, compared to other metrics used
to assess the similarity of simulated user sessions? (4) How sensitive is FD to diferent feature
extraction methods when assessing the similarity of simulated search sessions?</p>
      <p>
        Our experimental study utilizes the TREC Session 2014 Track dataset[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], which provides
detailed logs of user interactions across multiple search sessions. This dataset is particularly
suitable for our research due to its structured representation of query sequences and user
actions over time. To compute FD, we employ various feature extraction methods to create
vector representations of both simulated and real sessions, ranging from simple query-based
embeddings to more sophisticated approaches like BERT-based Session Embedding and
Timeaware Session Embedding.
      </p>
      <p>
        We investigate the correlation between FD and established session similarity metrics such as
session nDCG (sDCG) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and Expected Global Utility (EGU) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], examine FD’s sensitivity to
diferent feature extraction methods, and explore its performance across various session lengths
and complexities. Additionally, we assess FD’s performance with both minimal and extensive
interaction data, providing insights into its versatility as an evaluation metric. By integrating
this sophisticated approach into IIR system evaluation, our research aims to demonstrate FD’s
potential as a metric that not only compares the quality of simulated and real user sessions but
also ofers insights into the reliability and accuracy of diferent simulation methodologies. This
study could significantly enhance the development and assessment of user simulators, leading
to more efective and user-centric interactive information retrieval systems.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. Related Work</title>
      <p>
        The evaluation of interactive information retrieval (IIR) systems has evolved significantly,
moving from single query-based metrics to more comprehensive session-based measures. This
shift reflects the recognition of complex, multi-query search behaviors and the limitations of
traditional evaluation methods. Early eforts to address this led to the development of
sessionbased extensions of traditional metrics, such as Session nDCG (sDCG) by Järvelin et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
which applies a discount to results from later queries in a session.
      </p>
      <p>
        Building on this concept, Yang and Lad [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] proposed a framework for modeling user browsing
behavior and computing Expected Global Utility (EGU) over a session, while Kanoulas et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
introduced the concept of modeling a user’s browsing behavior as a ”path.” Although these
session-based metrics represented a significant advancement, they often did not model detailed
browsing behaviors, such as clicking decisions. The development of click models [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] addressed
this gap, providing insights into user interaction patterns. Fuhr [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposed the Interactive
IR Probability Ranking Principle (IIR-PRP), which theoretically integrates a user’s clicking
decision with a measure of overall utility of a ranked list. Zerhoudi et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] used the
twosample Kolmogorov-Smirnov (KS-2) goodness-of-fit test and a classification-based evaluation to
evaluate simulated user interactions in the context of a search session. However, most existing
session-based evaluation measures and click models assume sequential browsing, which may
not hold in modern search interfaces with complex layouts and interaction possibilities.
      </p>
      <p>
        The Fréchet metric, a natural measure of similarity between two curves, has gained
prominence in various applications [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. This metric can be intuitively understood by imagining a
dog and its handler walking on separate curves. Both can control their speed but must move
forward, with the Fréchet distance representing the minimal leash length required for them to
traverse their respective paths from start to finish. Due to its efectiveness in comparing curve
similarities, the Fréchet distance and its variants have found widespread use across diverse fields.
These applications include dynamic time-warping [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], speech recognition [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], and matching of
time series in databases [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The versatility of the Fréchet metric in these domains underscores
its significance in analyzing and comparing complex, non-linear data patterns.
      </p>
      <p>
        Inspired by the success of distribution-based metrics in other fields, such as the Fréchet
Inception Distance (FID) in computer vision [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], our work introduces the Fréchet Distance
(FD) as a novel metric for evaluating simulated search sessions in IIR. This approach addresses
several limitations of existing methods by handling complex interactions, requiring minimal
data, providing distribution-based comparisons, ofering flexibility in feature representation,
and investigating correlations with established metrics.
      </p>
      <p>By adapting FD to the evaluation of simulated search sessions, our work bridges the gap
between advanced evaluation techniques in other fields and the specific needs of IIR evaluation.
This approach ofers a promising direction for developing more accurate and comprehensive
evaluation methods for modern IIR systems, particularly in scenarios where traditional metrics
may fall short due to data sparsity or complex user interaction patterns.</p>
      <p>
        Arabzadeh and Clarke [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]’s proposal to use the Fréchet Distance to measure the distance
between the distributions of relevant judged items and retrieved results aligns with our approach,
further validating its potential as a robust and flexible metric. Their research provides additional
evidence for the efectiveness of distribution-based comparisons in scenarios with sparse data,
complementing our exploration of FD for simulated search sessions.
3. Fréchet Distance for Evaluating Simulated Search Sessions
      </p>
      <sec id="sec-3-1">
        <title>3.1. Fréchet Distance</title>
        <p>
          The Fréchet distance is a measure of dissimilarity between two curves or trajectories. It can be
conceptualized as the minimum leash length required for a dog walking along one path while
its owner walks along another, with both potentially moving at diferent speeds [
          <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
          ].
        </p>
        <p>Formally, given two curves  and  represented as sequences of points in a metric space, the
Fréchet distance  (, ) is computed as:
 (, ) = inf max ((()), (()))</p>
        <p>
          , ∈[
          <xref ref-type="bibr" rid="ref1">0,1</xref>
          ]
where  and  are continuous maps from [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ] to a metric space, and  and  are continuous,
non-decreasing surjection functions representing reparameterizations of [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]. This formulation
ensures that neither the dog nor its owner can backtrack along their respective curves.
        </p>
        <p>
          The Fréchet distance can also be applied to assess the disparity between probability
distributions [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. For two normal univariate distributions  and  , the Fréchet Distance is given by:
 ( ,  ) = (  −   )2 + (  −   )2, where  and  represent the mean and standard deviation
of the distributions, respectively. This versatility makes the Fréchet distance a powerful tool for
comparing both geometric curves and statistical distributions in various fields of study.
3.2. Fréchet Distance for Evaluating Simulated Search Sessions
The evaluation of simulated search sessions using Fréchet Distance provides a robust method
for assessing the quality of simulation models in Information Retrieval (IR). This approach
considers both the semantic content and sequential nature of user actions within search sessions.
Let  represent a set of  simulated search sessions, where each session   consists of a sequence
of user actions
        </p>
        <p>. These actions may include queries, clicks, scrolls, or other interactions with
the search engine.   denotes the set of ideal or expected actions for the sessions in  . The
function  (</p>
        <p>) generates a sequence of actions for a given simulated session   , producing  
To apply Fréchet Distance, we map the actions to a suitable embedding space using function

.
 , which transforms any action into a  -dimensional vector. This embedding captures both
semantic and behavioral aspects of each action. The Fréchet Distance for Simulated Search
Sessions (   ) is then calculated as:
  
=  ((
 ), ( ()))
Here, FD measures the distance between the distribution of the set embeddings of the ideal
actions  (
 ) and those of the simulated actions  ( ())
. A lower  
similarity between simulated and ideal actions, suggesting better performance of the simulation
model 
on the session set  . To account for the sequential nature of search sessions, we extend

 indicates higher
this measure to consider the order of actions within each session:</p>
        <p>=
1
||   ∈
∑  ((
 ), ( (

 )))
(5)
(6)
This sequential Fréchet Distance (</p>
        <p>) calculates the average Fréchet Distance between ideal
and simulated action sequences for each session. This provides a more nuanced evaluation of the
simulation model’s ability to capture the temporal dynamics of user behavior in search sessions.
By incorporating both semantic content and sequential information, this approach ofers a
comprehensive evaluation framework for simulated search sessions, enabling researchers to
assess and improve the fidelity of their simulation models in IR experiments.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental Setup</title>
      <p>In this section, we describe the general settings of our experiments, including the dataset,
traditional evaluation metrics, click models, simulation framework, and the embeddings used to
represent user search sessions.</p>
      <sec id="sec-4-1">
        <title>4.1. Dataset</title>
        <p>
          This study employs the TREC Session 2014 Track dataset [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], which is designed for evaluating
multi-query search behavior. The dataset comprises 1,257 sessions with 4,680 queries and 1,685
clicks, averaging 4.33 queries per session (median: 2). It includes real user queries, interaction
logs, and ranked document lists with snippets, making it ideal for simulated search session
evaluation. The dataset’s diverse composition allows for a comprehensive analysis of user
interactions and search strategies in multi-query scenarios, enabling robust conclusions about
the efectiveness of various information retrieval techniques.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Click Models and Simulation Framework</title>
        <p>
          This study employs a comprehensive approach to simulate user search sessions, utilizing both
traditional probabilistic graphical models and a neural click model. The traditional models
include the Position-Based Model (PBM) [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], User Browsing Model (UBM) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], Dependent
Click Model (DCM) [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], and Dynamic Bayesian Network Model (DBN) [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], implemented
using the PyClick library. Additionally, we incorporate the neural click model NCM [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] to
enhance simulation diversity. To create complete user search sessions, we use the SimIIR 2.0
framework [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ], which simulates complex user behaviors including query formulation, result
list examination, and click decisions. This integrated approach allows for a comprehensive
assessment of FD’s ability to quantify the quality of simulated search sessions across a broad
spectrum of user behaviors.
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Embeddings for Search Session Representation</title>
        <p>
          This study explores two approaches for embedding search sessions to apply the Fréchet
Distance metric. The first approach uses action-based embeddings, employing a fine-tuned BERT
model [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] to embed individual user actions and aggregating them through mean pooling or
sequence modeling. The second approach adapts Doc2Vec [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] to create Session2Vec [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ],
learning fixed-length vector representations of entire search sessions. Both methods aim to
capture semantic and behavioral aspects of user search sessions. For query and document
representations, pre-trained word embeddings are used, with query embeddings computed
as the average of query word embeddings and document embeddings derived from title and
snippet words. These approaches provide insights into efectively capturing search behavior
nuances and improving simulated search session evaluation using the Fréchet Distance metric.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Evaluation Process</title>
        <p>Our evaluation process involves generating simulated search sessions using various click models
and SimIIR 2.0, then embedding these sessions along with ground truth sessions from the TREC
Session 2014 Track dataset. We compare simulation approaches by calculating the Fréchet
Distance between simulated and ground truth session embedding distributions. To account
for temporal dynamics, we also compute the sequential Fréchet Distance (    ) as defined
in equation (6). The efectiveness of Fréchet Distance as an evaluation metric is assessed
by comparing FD scores with traditional metrics and analyzing results across diferent click
models and simulation configurations. This approach provides insights into the strengths and
limitations of various simulation methods in replicating realistic user search behavior.
5. Evaluating Search Sessions with Minimal Interaction Data
This section explores the efectiveness of the Fréchet Distance (FD) in assessing the quality of
simulated search sessions with limited interaction data. We analyze the performance of various
click models and the SimIIR 2.0 framework on the TREC Session 2014 Track dataset, comparing
the FD metric with traditional session-based metrics.</p>
        <p>Our study utilizes a subset of 200 sessions from the TREC Session 2014 Track, each containing
an average of 4-5 queries. We generate simulated interactions using click models and the
SimIIR 2.0 framework, then compute the FD between simulated and ground truth sessions using
action-based embeddings and Session2Vec representations.</p>
        <p>Table 1 presents the performance of diferent simulation approaches using traditional metrics
(nDCG@10 and ERR@10) and Fréchet Distance metrics (FD@1 and FD@10). The results show
that FD efectively quantifies the quality of simulated search sessions. The PBM model, being
the simplest, shows the highest FD values, indicating the largest discrepancy from ground
truth sessions. Conversely, SimIIR 2.0, incorporating more complex user behaviors, achieves
the lowest FD values, suggesting simulations closest to the ground truth. FD aligns well with
traditional evaluation metrics, consistently ranking simulation approaches. This indicates that
FD can efectively capture simulation quality even with minimal interaction data. Bootstrap
analysis (Figure 1) confirms these patterns across diferent samples, with narrow confidence
intervals indicating stability. The FD metric demonstrates sensitivity to simulation model
complexity, with more sophisticated models like NCM and SimIIR 2.0 achieving lower FD
scores. However, its discriminative power decreases when comparing closely performing
models. Overall, Fréchet Distance proves to be a promising metric for evaluating simulated
search sessions, especially with minimal interaction data, complementing traditional evaluation
metrics in interactive information retrieval.
6. Evaluating Search Sessions with Extensive Interaction Data
This section examines the efectiveness of the Fréchet Distance (FD) in assessing simulated
search sessions with extensive interaction data. Using a subset of the TREC Session 2014
Track dataset, we focus on longer and more complex sessions to address whether FD can
accurately evaluate the quality of simulated search sessions with extensive interaction data.
Our experimental setup involved selecting 100 sessions from the dataset, each containing at
least 10 queries and a rich set of user interactions. We generated simulated interactions using
various click models and the SimIIR 2.0 framework, computing the FD between simulated and
ground truth sessions using both action-based embeddings and Session2Vec representations.
To investigate the impact of interaction data quantity, we evaluated the simulations using full
sessions, first 5 queries, and first 10 queries.</p>
        <p>Results in Table 2 demonstrate that Fréchet Distance (FD) efectively quantifies the quality of
simulated search sessions across varying amounts of interaction data. FD scores for full sessions
are generally lower than those for partial sessions, indicating improved simulation accuracy
with more interaction data. Figure 2 shows a strong negative correlation between nDCG@10
and FD scores for full sessions suggests alignment with traditional evaluation metrics.</p>
        <p>Bootstrap analysis, using 1000 subsets of 50 sessions each, revealed narrow confidence
intervals for both ERR@10 and FD scores, indicating stability across diferent session subsets.
High correlations between FD scores for full and partial sessions (5Q and 10Q) suggest that
FD maintains discriminative power even when evaluating partial sessions (i.e., Table 3). FD
demonstrates several advantages, including consistency across diferent amounts of interaction
data, robustness in assessments, sensitivity to simulation complexity, and alignment with
traditional metrics. However, computational complexity may increase with larger datasets,
warranting future exploration of eficient approximation methods.</p>
        <p>In conclusion, Fréchet Distance proves to be an efective and robust metric for evaluating
simulated search sessions, particularly with extensive interaction data. Its ability to capture
distributional similarities between simulated and ground truth sessions makes it valuable for
assessing and improving search session simulation models.
7. Correlation Analysis with Session Similarity Metrics
This section examines the correlation between the Fréchet Distance (FD) and other established
metrics used to evaluate the similarity of simulated user sessions in Information Retrieval (IR)
systems. The study aims to understand how FD’s performance in measuring simulated search
sessions correlates with other session similarity metrics.</p>
        <p>We employ the TREC Session 2014 dataset, selecting 200 sessions of varying lengths and
complexities. Simulated sessions are generated using click models and the SimIIR 2.0 framework.
FD is compared with Session nDCG (sDCG), Expected Global Utility (EGU), Path-based Session
Evaluation (PSE), and Interactive IR Probability Ranking Principle (IIR-PRP) based metric.</p>
        <p>Results show strong negative correlations between FD and all other metrics (i.e., Table 4),
with the strongest correlation observed with PSE. As FD decreases, indicating better simulation
quality, other metrics tend to increase.</p>
        <p>Correlation analysis for diferent session lengths reveals that the relationship between FD
and other metrics strengthens as session length increases as shown in Table 5, suggesting FD’s
efectiveness in capturing complex interaction patterns in longer sessions. The study supports
FD as a valid and efective measure for evaluating simulated search sessions, demonstrating
consistency with established metrics and sensitivity to session complexity.</p>
        <p>However, limitations include dataset specificity, simulation model dependency, and the need
for further investigation into metric assumptions and computational eficiency. Future work
should validate findings on other datasets, explore a wider range of simulation approaches, and
correlate metric-based assessments with human judgments of session similarity. In conclusion,
this study supports the use of FD as a valuable tool in evaluating IR systems, particularly for
complex, multi-query sessions, potentially ofering complementary insights to existing metrics
and enhancing the assessment and improvement of IR systems in interactive, session-based
search contexts.
8. Sensitivity to Feature Extraction in Simulated Sessions
This section examines the impact of various feature extraction methods on the Fréchet Distance
when evaluating simulated search sessions. The study utilizes the TREC Session 2014 dataset,
selecting 300 sessions of varying lengths and complexities. Five feature extraction methods are
investigated: Query-based Embedding (QBE), Action Sequence Embedding (ASE), Session2Vec
(S2V), BERT-based Session Embedding (BSE), and Time-aware Session Embedding (TSE).</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>9. Conclusion and Future Work</title>
      <p>This paper explores the application of Fréchet Distance (FD) as a novel metric for evaluating
simulated search sessions in interactive information retrieval systems. Our experiments
demonstrate FD’s efectiveness and robustness in assessing the quality of simulated user interactions
across various scenarios.</p>
      <p>FD shows strong correlations with established metrics like session nDCG and Expected Global
Utility, capturing similar aspects of session quality while ofering additional insights due to
its distributional nature. It proves particularly efective in evaluating longer, more complex
sessions and demonstrates efectiveness even with minimal interaction data. These findings
have significant implications for interactive information retrieval, potentially leading to more
accurate and realistic simulation models. Future research directions include investigating FD’s
performance with multi-modal session representations, extending this work to larger datasets,
correlating FD assessments with human judgments, and exploring FD in evaluating generative
models for search session simulation.</p>
      <p>While our study ofers valuable insights, it’s important to acknowledge certain limitations.
FD requires sets of sessions for evaluation and assumes multivariate normal distributions, which
may not always hold for all types of session data. Additionally, as an unbounded metric, the
interpretation of FD scores may vary depending on dataset characteristics and sample sizes.</p>
      <p>Despite these limitations, we believe that FD ofers a powerful and flexible approach to
evaluating simulated search sessions, complementing existing metrics and potentially driving
improvements in both simulation models and real-world information retrieval systems.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Carterette</surname>
          </string-name>
          , E. Kanoulas,
          <string-name>
            <surname>M. M. Hall</surname>
            ,
            <given-names>P. D.</given-names>
          </string-name>
          <string-name>
            <surname>Clough</surname>
          </string-name>
          ,
          <article-title>Overview of the TREC 2014 session track</article-title>
          , in: E. M.
          <string-name>
            <surname>Voorhees</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Ellis (Eds.),
          <source>Proceedings of The Twenty-Third Text REtrieval Conference</source>
          , TREC 2014, Gaithersburg, Maryland, USA, November
          <volume>19</volume>
          -
          <issue>21</issue>
          ,
          <year>2014</year>
          , volume
          <volume>500</volume>
          -308 of NIST Special Publication,
          <source>National Institute of Standards and Technology (NIST)</source>
          ,
          <year>2014</year>
          . URL: http://trec.nist.gov/pubs/trec23/papers/overview-session.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Järvelin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. L.</given-names>
            <surname>Price</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M. L.</given-names>
            <surname>Delcambre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          ,
          <article-title>Discounted cumulated gain based evaluation of multiple-query IR sessions</article-title>
          , in: C.
          <string-name>
            <surname>Macdonald</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Ounis</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Plachouras</surname>
            ,
            <given-names>I. Ruthven</given-names>
          </string-name>
          ,
          <string-name>
            <surname>R. W.</surname>
          </string-name>
          White (Eds.),
          <source>Advances in Information Retrieval , 30th European Conference on IR Research</source>
          , ECIR
          <year>2008</year>
          , Glasgow, UK, March 30-April 3,
          <year>2008</year>
          . Proceedings, volume
          <volume>4956</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2008</year>
          , pp.
          <fpage>4</fpage>
          -
          <lpage>15</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -78646-
          <issue>7</issue>
          _4. doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>540</fpage>
          - 78646- 7\_4.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lad</surname>
          </string-name>
          ,
          <article-title>Modeling expected utility of multi-session information distillation</article-title>
          , in: L.
          <string-name>
            <surname>Azzopardi</surname>
            , G. Kazai,
            <given-names>S. E.</given-names>
          </string-name>
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          <string-name>
            <surname>Rüger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Shokouhi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Song</surname>
          </string-name>
          , E. Yilmaz (Eds.),
          <source>Advances in Information Retrieval Theory</source>
          ,
          <source>Second International Conference on the Theory of Information Retrieval</source>
          ,
          <string-name>
            <surname>ICTIR</surname>
          </string-name>
          <year>2009</year>
          , Cambridge, UK,
          <source>September 10-12</source>
          ,
          <year>2009</year>
          , Proceedings, volume
          <volume>5766</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2009</year>
          , pp.
          <fpage>164</fpage>
          -
          <lpage>175</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -04417-5_
          <fpage>15</fpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>642</fpage>
          - 04417- 5\_
          <fpage>15</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E.</given-names>
            <surname>Kanoulas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Carterette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Clough</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sanderson</surname>
          </string-name>
          ,
          <article-title>Evaluating multi-query sessions</article-title>
          , in: W. Ma, J. Nie,
          <string-name>
            <given-names>R.</given-names>
            <surname>Baeza-Yates</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Chua</surname>
          </string-name>
          , W. B.
          <string-name>
            <surname>Croft</surname>
          </string-name>
          (Eds.),
          <source>Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          ,
          <string-name>
            <surname>SIGIR</surname>
          </string-name>
          <year>2011</year>
          , Beijing, China,
          <source>July 25-29</source>
          ,
          <year>2011</year>
          , ACM,
          <year>2011</year>
          , pp.
          <fpage>1053</fpage>
          -
          <lpage>1062</lpage>
          . URL: https://doi.org/10. 1145/2009916.2010056. doi:
          <volume>10</volume>
          .1145/2009916.2010056.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chuklin</surname>
          </string-name>
          , I. Markov, M. de Rijke,
          <article-title>Click Models for Web Search</article-title>
          ,
          <source>Synthesis Lectures on Information Concepts</source>
          , Retrieval, and Services, Morgan &amp; Claypool Publishers,
          <year>2015</year>
          . URL: https://doi.org/10.2200/S00654ED1V01Y201507ICR043. doi:
          <volume>10</volume>
          .2200/ S00654ED1V01Y201507ICR043.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          ,
          <article-title>A probability ranking principle for interactive information retrieval</article-title>
          ,
          <source>Inf. Retr</source>
          .
          <volume>11</volume>
          (
          <year>2008</year>
          )
          <fpage>251</fpage>
          -
          <lpage>265</lpage>
          . URL: https://doi.org/10.1007/s10791-008-9045-0. doi:
          <volume>10</volume>
          .1007/ S10791- 008- 9045- 0.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zerhoudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Granitzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Seifert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schloetterer</surname>
          </string-name>
          ,
          <article-title>Evaluating simulated user interaction and search behaviour</article-title>
          , in: M.
          <string-name>
            <surname>Hagen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Verberne</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Macdonald</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Seifert</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Nørvåg</surname>
          </string-name>
          , V. Setty (Eds.),
          <source>Advances in Information Retrieval - 44th European Conference on IR Research</source>
          , ECIR
          <year>2022</year>
          , Stavanger, Norway,
          <source>April 10-14</source>
          ,
          <year>2022</year>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>II</given-names>
          </string-name>
          , volume
          <volume>13186</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2022</year>
          , pp.
          <fpage>240</fpage>
          -
          <lpage>247</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -99739-7_
          <fpage>28</fpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>030</fpage>
          - 99739- 7\_
          <fpage>28</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Efrat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Guibas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Har-Peled</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S. B.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          ,
          <string-name>
            <surname>T. M. Murali</surname>
          </string-name>
          ,
          <article-title>New similarity measures between polylines with applications to morphing and polygon sweeping</article-title>
          ,
          <source>Discret. Comput. Geom</source>
          .
          <volume>28</volume>
          (
          <year>2002</year>
          )
          <fpage>535</fpage>
          -
          <lpage>569</lpage>
          . URL: https://doi.org/10.1007/s00454-002-2886-1. doi:
          <volume>10</volume>
          .1007/ S00454- 002- 2886- 1.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          ,
          <article-title>Scaling up dynamic time warping to massive dataset</article-title>
          , in: J.
          <string-name>
            <surname>M. Zytkow</surname>
          </string-name>
          , J. Rauch (Eds.),
          <source>Principles of Data Mining and Knowledge Discovery, Third European Conference, PKDD '99</source>
          , Prague, Czech Republic,
          <source>September 15-18</source>
          ,
          <year>1999</year>
          , Proceedings, volume
          <volume>1704</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>1999</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -48247-
          <issue>5</issue>
          _1. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>540</fpage>
          -48247-5\_1.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kwong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Man</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <article-title>Parallel genetic-based hybrid pattern matching algorithm for isolated word recognition</article-title>
          ,
          <source>Int. J. Pattern Recognit. Artif. Intell</source>
          .
          <volume>12</volume>
          (
          <year>1998</year>
          )
          <fpage>573</fpage>
          -
          <lpage>594</lpage>
          . URL: https://doi.org/10.1142/S0218001498000348. doi:
          <volume>10</volume>
          .1142/ S0218001498000348.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Shin</surname>
          </string-name>
          ,
          <article-title>Optimization of subsequence matching under time warping in time-series databases</article-title>
          , in: H.
          <string-name>
            <surname>Haddad</surname>
            ,
            <given-names>L. M.</given-names>
          </string-name>
          <string-name>
            <surname>Liebrock</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Omicini</surname>
          </string-name>
          , R. L. Wainwright (Eds.),
          <source>Proceedings of the 2005 ACM Symposium on Applied Computing (SAC)</source>
          ,
          <source>Santa Fe</source>
          , New Mexico, USA, March
          <volume>13</volume>
          -17,
          <year>2005</year>
          , ACM,
          <year>2005</year>
          , pp.
          <fpage>581</fpage>
          -
          <lpage>586</lpage>
          . URL: https://doi.org/10.1145/ 1066677.1066814. doi:
          <volume>10</volume>
          .1145/1066677.1066814.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Heusel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ramsauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Unterthiner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Nessler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hochreiter</surname>
          </string-name>
          ,
          <article-title>Gans trained by a two time-scale update rule converge to a local nash equilibrium</article-title>
          , in: I. Guyon, U. von Luxburg, S. Bengio,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. V. N.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9</source>
          ,
          <year>2017</year>
          , Long Beach, CA, USA,
          <year>2017</year>
          , pp.
          <fpage>6626</fpage>
          -
          <lpage>6637</lpage>
          . URL: https://proceedings.neurips.cc/paper/2017/hash/ 8a1d694707eb0fefe65871369074926d-Abstract.html.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Arabzadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. L. A.</given-names>
            <surname>Clarke</surname>
          </string-name>
          ,
          <article-title>Fréchet distance for ofline evaluation of information retrieval systems with sparse labels</article-title>
          , in: Y. Graham, M. Purver (Eds.),
          <source>Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024 - Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <given-names>Long</given-names>
            <surname>Papers</surname>
          </string-name>
          , St.
          <source>Julian's, Malta, March</source>
          <volume>17</volume>
          -22,
          <year>2024</year>
          , Association for Computational Linguistics,
          <year>2024</year>
          , pp.
          <fpage>420</fpage>
          -
          <lpage>431</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          .
          <article-title>eacl-long</article-title>
          .
          <volume>26</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mannila</surname>
          </string-name>
          ,
          <article-title>Computing discrete fréchet distance (</article-title>
          <year>1994</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>H.</given-names>
            <surname>Alt</surname>
          </string-name>
          ,
          <article-title>The computational geometry of comparing shapes</article-title>
          , in: S. Albers,
          <string-name>
            <given-names>H.</given-names>
            <surname>Alt</surname>
          </string-name>
          , S. Näher (Eds.),
          <string-name>
            <surname>Eficient</surname>
            <given-names>Algorithms</given-names>
          </string-name>
          , Essays Dedicated to Kurt
          <source>Mehlhorn on the Occasion of His 60th Birthday</source>
          , volume
          <volume>5760</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2009</year>
          , pp.
          <fpage>235</fpage>
          -
          <lpage>248</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -03456-5_
          <fpage>16</fpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -03456-5\ _
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>N.</given-names>
            <surname>Craswell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Zoeter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Taylor</surname>
          </string-name>
          , B.
          <string-name>
            <surname>Ramsey</surname>
          </string-name>
          ,
          <article-title>An experimental comparison of click position-bias models</article-title>
          , in: M.
          <string-name>
            <surname>Najork</surname>
            ,
            <given-names>A. Z.</given-names>
          </string-name>
          <string-name>
            <surname>Broder</surname>
          </string-name>
          , S. Chakrabarti (Eds.),
          <source>Proceedings of the International Conference on Web Search and Web Data Mining, WSDM</source>
          <year>2008</year>
          , Palo Alto, California, USA, February
          <volume>11</volume>
          -
          <issue>12</issue>
          ,
          <year>2008</year>
          , ACM,
          <year>2008</year>
          , pp.
          <fpage>87</fpage>
          -
          <lpage>94</lpage>
          . URL: https: //doi.org/10.1145/1341531.1341545. doi:
          <volume>10</volume>
          .1145/1341531.1341545.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>G.</given-names>
            <surname>Dupret</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Piwowarski</surname>
          </string-name>
          ,
          <article-title>A user browsing model to predict search engine click data from past observations</article-title>
          , in: S. Myaeng,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Oard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sebastiani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Chua</surname>
          </string-name>
          , M. Leong (Eds.),
          <source>Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          ,
          <string-name>
            <surname>SIGIR</surname>
          </string-name>
          <year>2008</year>
          , Singapore,
          <source>July 20-24</source>
          ,
          <year>2008</year>
          , ACM,
          <year>2008</year>
          , pp.
          <fpage>331</fpage>
          -
          <lpage>338</lpage>
          . URL: https://doi.org/10.1145/1390334.1390392. doi:
          <volume>10</volume>
          .1145/1390334. 1390392.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>F.</given-names>
            <surname>Guo</surname>
          </string-name>
          , C. Liu,
          <string-name>
            <given-names>Y. M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Eficient multiple-click models in web search</article-title>
          , in: R. BaezaYates, P. Boldi,
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Ribeiro-Neto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Cambazoglu</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the Second International Conference on Web Search and Web Data Mining, WSDM</source>
          <year>2009</year>
          , Barcelona, Spain, February 9-
          <issue>11</issue>
          ,
          <year>2009</year>
          , ACM,
          <year>2009</year>
          , pp.
          <fpage>124</fpage>
          -
          <lpage>131</lpage>
          . URL: https://doi.org/10.1145/1498759. 1498818. doi:
          <volume>10</volume>
          .1145/1498759.1498818.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>O.</given-names>
            <surname>Chapelle</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Zhang,</surname>
          </string-name>
          <article-title>A dynamic bayesian network click model for web search ranking</article-title>
          , in: J.
          <string-name>
            <surname>Quemada</surname>
            , G. León,
            <given-names>Y. S.</given-names>
          </string-name>
          <string-name>
            <surname>Maarek</surname>
          </string-name>
          , W. Nejdl (Eds.),
          <source>Proceedings of the 18th International Conference on World Wide Web, WWW</source>
          <year>2009</year>
          , Madrid, Spain,
          <source>April 20-24</source>
          ,
          <year>2009</year>
          , ACM,
          <year>2009</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . URL: https://doi.org/10.1145/1526709.1526711. doi:
          <volume>10</volume>
          .1145/1526709.1526711.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Borisov</surname>
          </string-name>
          , I. Markov, M. de Rijke,
          <string-name>
            <given-names>P.</given-names>
            <surname>Serdyukov</surname>
          </string-name>
          ,
          <article-title>A neural click model for web search</article-title>
          , in: J.
          <string-name>
            <surname>Bourdeau</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Nkambou</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>B. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 25th International Conference on World Wide Web, WWW</source>
          <year>2016</year>
          , Montreal, Canada,
          <source>April 11 - 15</source>
          ,
          <year>2016</year>
          , ACM,
          <year>2016</year>
          , pp.
          <fpage>531</fpage>
          -
          <lpage>541</lpage>
          . URL: https://doi.org/10.1145/2872427.2883033. doi:
          <volume>10</volume>
          .1145/2872427.2883033.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zerhoudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Günther</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Plassmeier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Borst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Seifert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Granitzer</surname>
          </string-name>
          ,
          <article-title>The simiir 2.0 framework: User types, markov model-based interaction simulation, and advanced query generation</article-title>
          , in: M. A.
          <string-name>
            <surname>Hasan</surname>
          </string-name>
          , L. Xiong (Eds.),
          <source>Proceedings of the 31st ACM International Conference on Information &amp; Knowledge Management</source>
          , Atlanta,
          <string-name>
            <surname>GA</surname>
          </string-name>
          , USA, October
          <volume>17</volume>
          -
          <issue>21</issue>
          ,
          <year>2022</year>
          , ACM,
          <year>2022</year>
          , pp.
          <fpage>4661</fpage>
          -
          <lpage>4666</lpage>
          . URL: https://doi.org/10.1145/3511808.3557711. doi:
          <volume>10</volume>
          .1145/3511808.3557711.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Acheampong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nunoo-Mensah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Transformer models for text-based emotion detection: a review of bert-based approaches</article-title>
          ,
          <source>Artif. Intell. Rev</source>
          .
          <volume>54</volume>
          (
          <year>2021</year>
          )
          <fpage>5789</fpage>
          -
          <lpage>5829</lpage>
          . URL: https://doi.org/10.1007/s10462-021-09958-2. doi:
          <volume>10</volume>
          .1007/S10462- 021- 09958- 2.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Le</surname>
          </string-name>
          , T. Mikolov,
          <article-title>Distributed representations of sentences and documents</article-title>
          ,
          <source>in: Proceedings of the 31th International Conference on Machine Learning, ICML 2014</source>
          , Beijing, China,
          <fpage>21</fpage>
          -
          <lpage>26</lpage>
          June 2014, volume
          <volume>32</volume>
          <source>of JMLR Workshop and Conference Proceedings, JMLR.org</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>1188</fpage>
          -
          <lpage>1196</lpage>
          . URL: http://proceedings.mlr.press/v32/le14.html.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>L.</given-names>
            <surname>Bing</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Niu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Lam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Learning a semantic space of web search via session data</article-title>
          , in: S. Ma, J. Wen,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Dou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. X.</given-names>
            <surname>Zhao</surname>
          </string-name>
          (Eds.),
          <source>Information Retrieval Technology - 12th Asia Information Retrieval Societies Conference, AIRS 2016</source>
          , Beijing, China,
          <source>November 30 - December 2</source>
          ,
          <year>2016</year>
          , Proceedings, volume
          <volume>9994</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2016</year>
          , pp.
          <fpage>83</fpage>
          -
          <lpage>97</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -48051-
          <issue>0</issue>
          _7. doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>319</fpage>
          - 48051- 0\_7.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>