<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Do lab effects transfer into the real-world? And should we care?</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Geraldine Fitzpatrick HCI Group Vienna Uni of Technology Austria geraldine</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Paul Tennent Mixed Reality Lab University of Nottingham United Kingdom</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Petr Slovak HCI Group Vienna Uni of Technology Austria</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Presented at RepliCHI2013. Copyright c 2013 for the
individual papers by the papers authors. Copying permitted only for
private and academic purposes. This volume is published and
copyrighted by its editors.</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>We report on two of our own studies, each of which has
built on a laboratory based nding and explored if and
how the e ects played out in everyday settings. In each,
we found e ects that in some ways validated the prior lab
studies, but each also pointed to very di erent
implications for HCI than those which were suggested by
the initial lab work. By lab-based work we mean here
empirical studies that are tightly controlled and aim to
uncover causal relationships.</p>
      <p>
        We would like to use these examples to open a discussion
within RepliCHI on whether or not the transferring of such
lab ndings into the eld is a speci c type of replication
that is especially important for HCI research, as (i) we
generally do want our systems/ ndings to transfer into
eld settings; and (ii) it is plausible to expect similar
results to the ones in our studies when transferring other
lab e ects into the real-world. We expect that further
discussion of this topic could well complement the existing
\into the wild" literature in HCI that now focuses more on
open-ended, in-situ exploration (e.g., see [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]).
      </p>
    </sec>
    <sec id="sec-3">
      <title>Moving e ects from the lab to the wild</title>
      <p>The two studies reported below come from di erent
projects, but each builds on a nding that was previously
rigorously analysed in the lab and seemed to be
potentially useful in HCI.</p>
      <sec id="sec-3-1">
        <title>Study 1: Feeling connected by sharing heartbeat</title>
        <p>
          The rst study picked up on a psychology experiment by
Janssen et. al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Their work showed that sharing
heartbeat between people increased feelings of intimacy
and social connectedness. This e ect was shown not only
by statistically signi cant di erences in questionnaire
responses, but also by measuring changes in carefully
chosen non-verbal aspects of the interaction. Implications
of such ndings in HCI could be, for example, the use of
such an e ect to design systems supporting mutual
a ection in couples living apart or helping to create
stronger ties within families and other social groups etc.
To explore the potential for real-world application of the
observed e ect, we1 developed a simple technology probe
package based on a heart rate monitor belt paired with a
standard laptop through a Bluetooth connection. Ten
such packages were distributed amongst 5 couples,
encouraging each couple to use the probe in any way they
wish over the period of two weeks. Furthermore, we
invited additional pairs of friends into our lab, let them
experience the probe in social scenarios and interviewed
them about their reactions and ideas. We analysed the
interview and usage data by qualitative means, identifying
two distinct e ects appearing across most of our sample,
and suggested interpretations as to why the e ects
happen. The results were then presented at CHI'12 [9].
What was interesting in the study with regards to this
workshop is that while our results con rmed the initial
study in many ways, the implications for HCI were
strikingly di erent. For example, our participants reported
feeling much \closer" to another when talking about
situations closely resembling the lab study. However,
1That is, Joris Janssen and two of the authors of this workshop
paper (Geraldine Fitzpatrick and Petr Slovak)
people often felt actually \too close" in these moments,
describing the feeling, e.g., \as if a stranger in an elevator
was keeping eye-contact for a long time". In other
contexts, such as everyday use by the collocated couples,
heart rate sharing did not have any e ect at all (e.g. \as
we are already close enough, this changes nothing"). Such
results led us to suggest more speci c contexts and
situations where the e ects of heart rate sharing could be
used in positive ways, and better scope the potential
applications of the original nding.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Study 2: Linking empathy to synchrony of bio-signals</title>
        <p>
          In another study, we explored work done by Marci et. al.
in psychotherapy and psychophysiology [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. This work
focuses on interaction between patient and therapist, and
it links moments high in empathy to synchronised changes
in skin conductance levels of the therapist/patient pair.
For example, if the changes in skin conductance of patient
and therapists were synchronised for a particular segment,
external raters were more likely to rate such moments as
high in empathy, it would also correspond to higher values
in self-reported empathy etc.
        </p>
        <p>Such a link could be of interest for HCI, e.g., as a novel
indicator to embed in various a ective computing
systems, creating systems to support teaching of empathy
for psychotherapy students, workplace etc. However, the
original research was based on a very speci c setting
(therapy session) and participants with speci c skills
(therapist with many years of training to become highly
empathetic). As such, we were interested to test how
robustly the observed e ects appear in the types of
real-world settings that are of interest of HCI, but which
are also often full of distractions and potential intervening
variables that could not be controlled in real-world
deployment.</p>
        <p>We designed a study [10] in which pairs of friends
discussed a topic of their choice in a public house during
normal opening hours. The rationale was to test the
robustness of the link in a setting that is more extreme (in
terms of potential disruptions and intervening variables)
than those needed for the potential applications. In other
words, we argued that if the e ect is robust enough to
appear in a busy pub and for pairs of friends talking about
any topics of importance, it is then more likely to appear
also in a therapy students class, workplace setting or other
potential application contexts.</p>
        <p>
          Our results followed a similar pattern as in the rst study:
we have seen results that are in line with the original
work, but the implications for HCI application of these
have changed. For example, when we focused on
interactions where participants were instructed to discuss
their topics naturally, then thirty-seconds long video
snapshots chosen purely on the basis of high synchrony
showed also more empathy related non-verbal behaviour
(as judged by independent raters). This ts with the prior
lab results. However, we also found high synchrony in a
condition where we asked one of the participants to ignore
the other, i.e., where then little empathy could be
expected. Such inconsistencies led us to suggest a
re-interpretation of skin conductance synchrony { seeing it
not only as an indicator of empathy, but potentially as a
more general indicator of \mutual reactivity" (i.e., that
people emotionally react to each other). Such reactivity
then just happens to correspond well to empathy in the
right contexts, such as therapy session or a discussion of
two friends about an issue important for one of them. We
were able able to further support this hypothesis through
other psychological literature such as [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Summary</title>
        <p>To summarise, each of the two studies have shown that
the expected e ects can appear also in an uncontrolled,
real-world setting, and are thus potentially robust enough
for HCI applications. However, and maybe more
importantly, each also clari ed and better scoped the
potential implications of the original nding for HCI.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Do we see a general pattern?</title>
      <p>
        Stepping away from the two examples here, it does seem
that, at least for results in psychology, transfer of e ects
from the lab to the eld is far from an obvious claim. For
example, Mitchell [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] shows in a recent meta-review that
many lab e ects either become much weaker when tested
in the eld, or even change direction entirely. Mitchell
also shows how the extent of such \failures to transfer"
di ers among various sub- elds of psychology.
      </p>
      <p>
        Can this be expected also of lab-based research in HCI?
To our knowledge, there is little literature on this within
HCI so far. It is also not discussed in the recent \into the
wild" literature, e.g., [
        <xref ref-type="bibr" rid="ref1 ref3">1, 3</xref>
        ], which seems to have a more
\open orientation towards nding out what happens and
drawing design principles or recommendations about
users' reactions" [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>We think it would be interesting to discuss in more depth
how this focus on lab-to- eld transfer of e ects di ers and
complements the existing work on research \in the wild".
One immediate di erence is the focus, i.e., whether a
well-understood lab e ect is robust enough to also appear
in more realistic (and thus messy) conditions. Among
other things, this will probably also raise methodological
questions, as the main aim of such work is to test if an
e ect appears (thus pointing to more quantitative,
experimental work), but in a setting where one cannot
control many of the potentially intervening variables. See
Oulavirta [8] for an initial discussion of similar topic in the
context of Pervasive computing.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>We intended to demonstrate that examining whether the
results of lab studies appear robustly `in the wild' may be
a speci c kind of replication research, and one that could
be of signi cant bene t to the CHI community. Drawing
on our two studies, we saw that while the core e ect did
translate, the implications about how it might be used
within HCI were changed markedly. We referenced
additional literature in psychology suggesting that such
results might also be expected for other lab-based ndings.</p>
    </sec>
    <sec id="sec-6">
      <title>Short Bio</title>
      <p>Petr Slovak is a researcher and PhD student at the HCI
Group at Vienna University of Technology. Drawing on his
background in both psychology and computer science, his
research focuses on support for teaching of empathy in
medical and therapeutic settings, with speci c interest in
the use of biosensors.</p>
      <p>Paul Tennent is a researcher at the Mixed Reality Lab at
the University of Nottingham. He has worked on a
number of systems designed to support the transformation
of complex system log data into accountable, queryable
objects that can be used in qualitative analysis. More
recently he has turned to the analysis and representation
of biodata with a particular focus on television and the
public understanding of science.</p>
      <p>Geraldine Fitzpatrick is Professor at Vienna University
of Technology in Austria and heads the Institute of Design
and Assessment of Technology. She is interested in how
we design pervasive, tangible and ubiquitous technologies
to t in with everyday contexts, with a particular interest
in supporting social interaction and collaboration, and
health and well being.
Laboratory. Perspectives on Psychological Science 7,
2 (Mar. 2012), 109{117.
[8] Oulasvirta, A. Rethinking Experimental Designs for
Field Evaluations. IEEE Pervasive Computing 11, 4
(Oct. 2012), 60{67.
[9] Slovak, P., Janssen, J., and Fitzpatrick, G.</p>
      <p>Understanding heart rate sharing: towards unpacking
physiosocial space. In CHI '12 (2012), 859{868.
[10] Slovak, P., Tennent, P., and Fitzpatrick, G. Exploring
physiological synchrony in everyday meaningful
interactions. Submitted for Interact'13 (2013).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Brown</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reeves</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Sherwood</surname>
            ,
            <given-names>S. Into</given-names>
          </string-name>
          <article-title>the wild: Challenges and Opportunities for Field Trial Methods</article-title>
          .
          <source>In CHI '11</source>
          , ACM Press (May
          <year>2011</year>
          ),
          <fpage>1657</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Consolvo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harrison</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>M. Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Everitt</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Froehlich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Landay</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          <article-title>Conducting In Situ Evaluations for and With Ubiquitous Computing Technologies</article-title>
          .
          <source>International Journal of Human-Computer Interaction 22</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>2</lpage>
          (
          <issue>Apr</issue>
          .
          <year>2007</year>
          ),
          <volume>103</volume>
          {
          <fpage>118</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Hutchinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hansen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roussel</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , Eiderback,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mackay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Westerlund</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Bederson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. B.</given-names>
            ,
            <surname>Druin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Plaisant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Beaudouin-Lafon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Conversy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            , and
            <surname>Evans</surname>
          </string-name>
          ,
          <string-name>
            <surname>H.</surname>
          </string-name>
          <article-title>Technology probes: inspiring design for and with families</article-title>
          .
          <source>In CHI '03</source>
          , ACM Press (New York, New York, USA,
          <year>2003</year>
          ),
          <volume>17</volume>
          |-
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Janssen</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bailenson</surname>
            ,
            <given-names>J. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>IJsselsteijn</surname>
          </string-name>
          , W. A., and
          <string-name>
            <surname>Westerink</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          <string-name>
            <surname>Intimate</surname>
          </string-name>
          <article-title>Heartbeats: Opportunities for A ective Communication Technology</article-title>
          .
          <source>IEEE Transactions on A ective Computing</source>
          <volume>1</volume>
          ,
          <issue>2</issue>
          (
          <year>July 2010</year>
          ),
          <volume>72</volume>
          {
          <fpage>80</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Levenson</surname>
            ,
            <given-names>R. W.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ruef</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          <article-title>Physiological aspects of emotional knowledge and rapport</article-title>
          . In Empathic accuracy.
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Marci</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Orr</surname>
            ,
            <given-names>S. P.</given-names>
          </string-name>
          <article-title>The e ect of emotional distance on psychophysiologic concordance and perceived empathy between patient and interviewer</article-title>
          .
          <source>Applied psychophysiology and biofeedback 31</source>
          , 2 (
          <year>June 2006</year>
          ),
          <volume>115</volume>
          {
          <fpage>28</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7] Mitchell,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Revisiting Truth or Triviality: The External Validity of Research in the Psychological</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>