<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>International Journal of Human</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1145/3078072.3084330</article-id>
      <article-id pub-id-type="urn">.kb.se/resolve?urn=urn:</article-id>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Theo Huibers</string-name>
          <email>t.w.c.huibers@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thomas Beelen</string-name>
          <email>t.h.j.beelen@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Khiet P. Truong</string-name>
          <email>k.p.truong@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roeland Ordelman</string-name>
          <email>roeland.ordelman@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ella Velner</string-name>
          <email>p.c.velner@utwente.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vanessa Evers</string-name>
          <email>vanessa.evers@ntu.edu.sg</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>NTU Institute of Science and Technology for Humanity</institution>
          ,
          <addr-line>50 Nanyang Avenue, 639798</addr-line>
          ,
          <country>Singapore, Singapore</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Twente</institution>
          ,
          <addr-line>Drienerlolaan 5, 7522NB, Enschede</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Workshop Proce dings</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <volume>2</volume>
      <fpage>10</fpage>
      <lpage>12</lpage>
      <abstract>
        <p>Increasingly, search engines can be accessed via speech, approach employs open-ended elicitation questions to by search engines on a computer, as well as by VAs. Chil- templates that do not require any domain knowledge. often through Voice Assistants (VAs). As outlined before by Beelen et al. [1], children are insuficiently supported by technology during their search process, both dren have more dificulty in formulating efective search queries that represent their information need well due to a smaller knowledge base and vocabulary [2]. Using speech does not inherently solve these obstacles. Furthermore, most VAs provide only a limited question-answer interaction style where the agent directly tries to find results based on the initial query. This interaction style causes several issues for children. Firstly, it is necessary to put all the required context into one query, which they struggle with [3, 4]. Secondly, usually it is not possible to ask follow-up questions, a functionality that children often expect [ 5, 3]. Lastly, children are typically not supported in formulating queries, for instance by query suggestions or clarifying questions [4, 6].</p>
      </abstract>
      <kwd-group>
        <kwd>are added to the memory until a threshold is</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>LGOBE
0000-0002-5650-2830 (V. Evers); 0000-0002-9837-8639
(T. Huibers)</p>
    </sec>
    <sec id="sec-2">
      <title>3. Proposed system</title>
      <sec id="sec-2-1">
        <title>We first develop a robot that uses simplified conversa</title>
        <p>MICROS’22: Mixed-Initiative ConveRsatiOnal Systems workshop at leveraging multiple turns. Furthermore, we are
interconversation with an introduction. It states its name, asks the home. In the future we change to a museum archive
the child for theirs, and says it is pleased to meet them. search task.</p>
        <p>
          Then the robot asks if the child wants help searching for
information, and the child can ask a question. Since the 4.1. Method
child’s speech may contain words that are not relevant
to the search (such as ”uhm, let’s see”), keywords need Our study is a within-subjects comparison with two
conto be extracted. In the pilot study described in section ditions. In one condition the robot uses the simplified
4, keyword extraction was done by matching against a conversational interaction style (see section 3), in the
pre-programmed list of possible keywords. Detected key- other it uses a question-answer interaction style. The
words are added to the robot’s memory. If the number order of the conditions was alternated between
particiof keywords is below a threshold, the robot will pose an pants. The Furhat robot [9] and its software environment
elicitation question. This threshold will be optimized in are used for both conditions. This keeps the two
experithe future. Too few keywords lead to an unspecific rank- mental conditions similar while the style of interaction
ing of results, while trying to elicit too many keywords is varied.
leads to a long interaction and possibly frustration. In There are two search tasks, one for each condition
the pilot study (section 4) this threshold was set to three (since condition order was alternated, each task was used
keywords, which was an estimate based on the number on diferent conditions). Both tasks are factoid questions
of words in the search tasks. The elicitation questions based on the work by Landoni et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The first was
are based on a simple pattern that includes the keywords to find out what hail is. The second was to find three
that were extracted so far. The two question patterns are endangered animal species. The tasks were presented
(translated from Dutch): in one sentence in Dutch on the task sheet. The search
results that could be retrieved were pre-programmed in
• What is it you want to know about [recognised this pilot, and were the same in both conditions. When
keywords]? presenting results, the robot mentions the website where
• What don’t you know yet about [recognised it found the information, the name of the article, and
keywords]? then reads aloud a snippet of the web page.
        </p>
        <p>In the question-answer condition, the robot mimics the
The word ”and” is added to the list of keywords where interaction of a commercial VA. This means it first waits
necessary. The robot loops over the questions and adds for a wake word, in this case “Hey robot”, or “Hey Furhat”.
new keywords until the threshold is met. After this phase, Then the robot’s LED ring lights up green to signal it
the robot will move on to present search results. Then the is awake, and a question can be asked. The wake word
robot asks the child if it can be of any further assistance. and question can also be combined into one statement.
Otherwise it goes to a closing interaction. The robot will then present results right away in the way
described above. After the results the robot goes back to
waiting for the wake word.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4. Pilot study</title>
      <p>
        We conducted a pilot study as a first step to find out
if the simplified conversational approach elicits more
keywords (as described in the introduction), and how
children experience such a robot. Furthermore, the pilot
is a way to discover methodological issues that may still
be corrected for the main study. In the pilot, we evaluated
a robot that uses the simplified conversational search
approach. As described in the evaluation framework
by Landoni et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], studies in IR for children can be
described by the intended search strategy, for a particular
user group, given a task, in a certain context. Our user
group are children ages 10–12 years old. These children
are in the final years of primary school in the Netherlands
and are starting to work more on assignments such as
presentations. We compare our conversational robot
strategy to a traditional question-answer style interaction
that is common with commercial VAs. The task in the
experiments is searching information related to school
subjects. The context of the searches is at school or in
      </p>
      <sec id="sec-3-1">
        <title>4.2. Measures</title>
        <p>
          The measurements consist of observational notes, Likert
scales using emojis (Smileyometers [10]), logs containing
a raw transcript, and interview questions. The children
were also asked about their current VA usage.
Observations focused on how the children behaved and spoke to
the robot. The Smileyometer was used to gauge the user
experience of the interactions. The questions are based
on [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], they include questions on: fun, ease of finding
answers, intention to use again, kindness, and ease of
conversing with. The logs are a transcript of how the
robot interpreted the child and how it responded. This
is used to evaluate whether the approach is able to elicit
more keywords from children compared to the traditional
paradigm. The goal of the interview at the end is to study
children’s perception of the diferences and advantages
of the two systems, and to add more qualitative data on
their experience
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>4.3. Participants &amp; procedure</title>
        <p>Eleven children participated in the pilot at a local af- 2,50
ter school care over two days in June 2022. Children
whose parents consented joined the experiments. Our 2,00
user group are children ages 10-12, but the children that
participated were mostly younger (mean age = 8.8, SD = 1,50
1.3). We decided to also let younger children participate
due to the dificulty of recruiting children in our targeted 1,00
age range.</p>
        <p>The robot was set up on a table in a separate room 0,50
with an open door to the main area. The researcher sat at 0,00
the same table and height as the child and explained that
they will be talking to two versions of the same robot. It
was explained that the robots may not be fully functional
yet, and that the children are helping to further develop
them. The children were also explained that one of the
robots will start talking right away, and the other requires
a wake word. The children received a task sheet that
also includes the questionnaire questions. The interview
happened after interacting with both robots.</p>
      </sec>
      <sec id="sec-3-3">
        <title>4.4. Results</title>
        <p>The first day the speech recognition often did not
understand children correctly due to background noise.
Therefore, the second day a headset was used, and
Wizardof-Oz (undisclosed human operator) functionality was
added. This way the researcher could take over when
some responses were not understood correctly by the
automatic speech recognition. Due to speech recogniser
errors the logs were not usable for analysis.</p>
        <p>Enjoyment Easy searching with Use again</p>
        <p>Friendly / nice Easy talking with</p>
        <p>Observation Many children resorted to reading the Interview Four children preferred the question-answer
search task directly from the sheet. This caused the condition, while five preferred the conversational. Two
queries in both conditions to be similar. Mainly for children had no preference. Some of the interesting
stateyounger users the task seemed complex and they re- ments on the robots are in table 1. The statements
inquired more input from the researcher. The children dicate there is a potential trade-of between eficiency
around the target age seemed more comfortable with the and fun. There are also clear individual diferences, as
level of complexity, working more independently. In the some children seemed to enjoy talking more, while
othconversational condition, the system entered the elicita- ers preferred a fast interaction. Concerning participants’
tion question loop in many cases. Sometimes, the child VA usage, five children had no experience, three children
wrongfully assumed they already provided all the words used them a few times, and three used them frequently.
in the search task, and got confused by the elicitation The frequent users ask VAs about the weather, jokes, and
question. Especially younger children tended to look at finding information. No efects of prior VA usage on the
the researcher when they were unsure how to continue outcome could be deterined in this pilot. Some children
the interaction. This also happened at the elicitation gave tips to improve the robot. These tips were: an easier
questions where the researcher had to give a hint. In to understand voice, and a touch screen on the robot’s
other cases the child kept the conversation going and face to be able to select search results visually as well.
answered the elicitation question naturally.</p>
        <sec id="sec-3-3-1">
          <title>Smileyometers The robots scored relatively similar in</title>
          <p>the Smileyometers. The results suggest that the
conversational robot may be more enjoyable to use and easier to
ifnd answers with. Children also indicated being slightly</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>4.5. Conclusion and limitations</title>
        <p>Based on the pilot, the number of elicited keywords could
not yet be compared due to errors and task design.
Children rated the robots quite similar but possibly find the
conversational system more fun and easier to find
answers with. More children preferred the conversational
system. They perceived the robots as nearly equally
friendly. The interview answers shed light on the
individual preferences regarding the amount of conversation
during the search process.</p>
        <p>The pilot study also gave insights that influence the
method of the main study. Firstly, the small sample size
means the current findings have low confidence. In the
main study we will increase the sample size by relying on
cooperation with partners such as museums. Secondly,
background noise led to errors and required the addition
of Wizard-of-Oz controls. Another limitation is that most
participants were younger than our target audience. A
few years can have a significant developmental diference
in children, therefore the interaction may be less complex
for children in the target age. However, the complexity of
the interaction seemed suitable for the older participants
around the target age. Finally, the search tasks were
mostly read aloud from the task sheet, which likely afects
the naturalness of children’s queries. In the next section
we describe how a diferent task in the museum context
may address this.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5. Future work</title>
      <p>The next step in creating the conversational robot is to
connect it the API of the Netherlands Institute for Sound
and Vision1, containing Dutch public broadcasting
media. In line with the tip by one of the participants, this
use case will also introduce a display for multi media
search results. The API connection will enable us to
study more natural search tasks and move away from
pre-programmed search results. The tasks that were
used in the pilot are fact finding and stated directly on
the task sheet. The API connection would allow children
to search for TV fragments that they come up with
themselves, which is a more open search task than fact finding.
This enables us to study children in a more natural
setting, where their query formulation process more closely
reflects a realistic scenario instead of reading from a task
sheet. Elicitation questions may become more useful in
this case. A more advanced keyword extraction from
speech method will need to be implemented as well, such
as the one by Habibi and Popescu-Belis [11]. The API
connected robot will be tested in a similar method as the
pilot study described above. The method compares the
style of interaction without changing other aspects about
the robot between conditions. The within-subjects setup
allowed children to reflect on diferences between the
systems and their preference. The Smileyometers worked
well even for participants that are younger than the
target audience. With our pilot findings we can account for</p>
      <sec id="sec-4-1">
        <title>1https://www.beeldengeluid.nl</title>
        <p>some important methodological issues. We look forward
to learning more about children’s conversational search
process in our main study.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <sec id="sec-5-1">
        <title>This research is supported by the Dutch SIDN fund https:</title>
        <p>//www.sidn.nl/ and TKI CLICKNL funding of the Dutch
Ministry of Economic Afairs https://www.clicknl.nl/.
We would also like to thank the staf at BSO Partou de
Vlinder for their time and cooperation.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Beelen</surname>
          </string-name>
          , E. Velner,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ordelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. P.</given-names>
            <surname>Truong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Evers</surname>
          </string-name>
          , T. Huibers,
          <article-title>Does your robot know? Enhancing children's information retrieval through spoken conversation with responsible robots</article-title>
          ,
          <source>arXiv:2106</source>
          .07931 [cs] (
          <year>2021</year>
          ). URL: http://arxiv.org/ abs/2106.07931, arXiv:
          <fpage>2106</fpage>
          .
          <fpage>07931</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Hutchinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Druin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Bederson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Reuter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Weeks</surname>
          </string-name>
          ,
          <article-title>How do I find blue books about dogs? The errors and frustrations of young digital library users</article-title>
          ,
          <source>Proceedings of HCII</source>
          <year>2005</year>
          (
          <year>2005</year>
          )
          <fpage>22</fpage>
          -
          <lpage>27</lpage>
          . Publisher: Citeseer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Lovato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Piper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Wartella</surname>
          </string-name>
          , Hey Google, Do Unicorns Exist?
          <article-title>Conversational Agents as a Path to Answers to Children's Questions</article-title>
          ,
          <source>in: Proceedings of the 18th ACM International Conference on Interaction Design and Children</source>
          , IDC '19,
          <string-name>
            <surname>Association</surname>
            for Computing Machinery, Boise,
            <given-names>ID</given-names>
          </string-name>
          , USA,
          <year>2019</year>
          , pp.
          <fpage>301</fpage>
          -
          <lpage>313</lpage>
          . URL: https://doi.org/10.1145/ 3311927.3323150. doi:
          <volume>10</volume>
          .1145/3311927.3323150.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Yarosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Watson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Senthilkumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J. B.</given-names>
            <surname>Brush</surname>
          </string-name>
          ,
          <article-title>Children asking questions: speech interface reformulations and personification preferences</article-title>
          ,
          <source>in: Proceedings of the 17th ACM Conference on Interaction Design and Children</source>
          , IDC '18,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, Trondheim, Norway,
          <year>2018</year>
          , pp.
          <fpage>300</fpage>
          -
          <lpage>312</lpage>
          . URL: https://doi.org/10.1145/3202185. 3202207. doi:
          <volume>10</volume>
          .1145/3202185.3202207.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Landoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Matteri</surname>
          </string-name>
          , E. Murgia,
          <string-name>
            <given-names>T.</given-names>
            <surname>Huibers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Pera</surname>
          </string-name>
          , Sonny, Cerca!
          <article-title>Evaluating the Impact of Using a Vocal Assistant to Search at School</article-title>
          , in: F. Crestani,
          <string-name>
            <given-names>M.</given-names>
            <surname>Braschler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Savoy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rauber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. E.</given-names>
            <surname>Losada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. Heinatz</given-names>
            <surname>Bürki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cappellato</surname>
          </string-name>
          , N. Ferro (Eds.),
          <source>Experimental IR Meets Multilinguality, Multimodality, and Interaction, Lecture Notes in Computer Science</source>
          , Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>101</fpage>
          -
          <lpage>113</lpage>
          . doi:
          <volume>10</volume>
          .1007/ 978- 3-
          <fpage>030</fpage>
          - 28577-
          <issue>7</issue>
          _
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Druga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Breazeal</surname>
          </string-name>
          , M. Resnick, ”
          <article-title>Hey Google is it OK if I eat you</article-title>
          ?”: Initial Explo-
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>