=Paper=
{{Paper
|id=Vol-2481/paper6
|storemode=property
|title=How do Physiotherapists and Patients talk? Developing and annotating RiMotivAzione dialogue corpus.
|pdfUrl=https://ceur-ws.org/Vol-2481/paper6.pdf
|volume=Vol-2481
|authors=Andrea Bolioli,Francesca Alloatti,Mariafrancesca Guadalupi,Roberta Iolanda Lanzi,Giorgia Pregnolato,Andrea Turolla
|dblpUrl=https://dblp.org/rec/conf/clic-it/BolioliAGLPT19
}}
==How do Physiotherapists and Patients talk? Developing and annotating RiMotivAzione dialogue corpus.==
How do Physiotherapists and Patients talk?
Developing the RiMotivAzione dialogue corpus.
Andrea Bolioli[1], Francesca Alloatti[1,2], Mariafrancesca Guadalupi[1],
Roberta Iolanda Lanzi[1], Giorgia Pregnolato[3], Andrea Turolla[3] 1
1
CELI - Language Technology, Italy
2
Department of Computer Science - Università degli Studi di Torino, Italy
3
IRCCS Fondazione Ospedale San Camillo, Italy
Abstract 2018). A recent review of scientific literature
about Artificial Intelligence and IoT in healthcare
The research project RiMotivAzione aims at can be found in (Shah and Chircu, 2018).
helping post-stroke patients who are following
The research project RiMotivAzione aims at
an arm and hand rehabilitation path. In this pa-
per we present the RiMotivAzione corpus, the helping the patients who suffered from a stroke
first collection of dialogues between physio- and are following an arm and hand rehabilitation
therapists and patients recorded in an Italian path. The goal is to motivate the patients to follow
hospital and annotated following the RIAS an- the assigned exercises through the use of a new
notation protocol. We describe the dataset, the wearable device with motion sensors developed by
methodologies applied and our first investiga- the Istituto Italiano di Tecnologia (IIT), integrated
tions on relevant features of the dialogue pro-
with a visual App and a conversational interface.
cess. The corpus was the basis for the design
of a conversational interface integrated with a
This last component guides the user through the
wearable device for rehabilitation, to be used therapeutic path proposing the exercises, giving
by the patient during the exercises that he or advice and asking for feedback.
she may perform independently.1 The implementation of voice technologies in the
healthcare domain allows for patients with motor
1 Introduction impairments to interact with devices through spo-
ken language (Moore et al., 2018), while arm and
In recent years, computational linguistics and
hand are busy performing the assigned exercises.
medical research have started to collaborate in or-
The interaction is seamless and spontaneous. The
der to analyze the communication in the health-
patient can keep up autonomously with the ther-
care domain, in particular between clinicians and
apy thanks to the guidance provided by the voice
patients. From a medical perspective, linguistic
assistant. The physiotherapist can monitor the pa-
analysis and dialogue modeling can be used to
tients at a distance, to evaluate their progress, and
better understand and potentially enhance com-
he can prevent a situation of therapy neglect by the
munication in different healthcare settings (Sen
patient, while the latter is motivated to stick to the
et al., 2017; Chang et al., 2013; Marzuki et al.,
path and he can reach his rehabilitation goals on
2017), as well as to identify "preclinical" or "pre-
time. Needless to say, these digital assistants are
symptomatic" diseases for specific ranges of pa-
not meant to substitute the clinician.
tients, e.g. discovering early linguistic signs of
cognitive decline (Beltrami et al., 2018). 2 Methodological Background and
Natural Language Processing (NLP) technolo- Related Work
gies are also used to develop new communicative
tools, e.g. virtual assistants, to alleviate the bur- As we described in the previous section, the study
den on medical personnel or shift to a home-based of communication and conversation in the medi-
patient-centered model of care. Through mHealth cal domain is growing in the last years, as well
(mobile health), for example, people can receive as the introduction of conversational agents in the
assistance at home, and monitoring devices can healthcare sector. A review of current applications
check the well-being of a person (Sezgin et al., and evaluation measures of conversational agents
1
used for health-related purposes can be found, for
Copyright c 2019 for this paper by its authors. Use
permitted under Creative Commons License Attribution 4.0 example, in (Laranjo et al., 2018). Otherwise,
International (CC BY 4.0). there is no systematic review of scientific literature
concerning the linguistic analysis of dialogues in specification of the Dialogue Act Markup Lan-
healthcare. Some scientific studies describe how guage (DiAML), used in many annotated corpora.
communication can influence clinical outcomes in In RiMotivAzione project, we deemed RIAS as the
the rehabilitation setting, e.g. how patient satis- most useful one for its specific focus on medical
faction, decision-making, and stress level correlate conversation. Even though RIAS is the closest do-
with physicians’ communicative acts (Hall and main tagset to annotate our corpus, some problems
Roter, 2012). Some researchers propose methods still emerged and they will be presented in next
to detect and track topics in psycho-therapeutic section.
conversations (Chaoua et al., 2018). Other re-
searchers conducted an analysis of actual commu- 3 Corpus Annotation
nicative behaviors, including nonverbal ones, be-
tween physicians and patients in rehabilitation, us- The RiMotivAzione corpus includes two complete
ing transcription and coding of utterances (Chang cycles of physiotherapy sessions with two patients
et al., 2013). in post-stroke rehabilitation (namely, P1 and P2)
and three physiotherapists (T1, T2, T3). The inter-
The analysis of speech acts and conversational views were video recorded in IRCCS Fondazione
interaction can play a relevant role in dialogue Ospedale "San Camillo" in Venice. Each session
modeling for healthcare thanks to the classifica- lasted about 1 hour. The physiotherapy cycle for
tion of utterances, the analysis of dialogue turns patient P1 included 14 sessions, while P2 took 16
and threads, the discovery of recurrent patterns. sessions. Therefore the total duration of record-
Speech acts have been investigated in linguistics ings is about 30 hours.
and computational linguistics for long. Specifi- The patients were carefully selected by the doc-
cally, the task of automatic speech act recogni- tors, since they must present some features. Above
tion has been addressed leveraging both super- all, they had to agree to be part of the experimen-
vised and unsupervised approaches (Basile and tation and they needed to talk in Italian. In an en-
Novielli, 2018). Otherwise, in the healthcare do- vironment where dialect is still strong, their ability
main there is still much room for investigation. to speak Italian was not to be treated lightly. More-
In the RiMotivAzione project, we deal with over, the patients did not have to present any issues
physiotherapy sessions in a hospital. The task is related to aphasia. These requirements restrained
to collect and analyze para-linguistic and linguis- the viable options to two candidates.
tic data, according to the aforementioned goal of Both speakers were encouraged to talk freely
the research project. In this specific setting, i.e. about any topic that may have emerged. Their
conversational analysis of physician-patient dis- only constraint was the use of Italian; when peo-
course, the most widely used method is the Roter ple slipped into dialectal terminology (in this
Interaction Analysis System (RIAS). RIAS was case, Venetian), it was explicitly marked with
developed as a tagset for coding medical dialogue the tag in the corpus. The audio
since 1991 by Debra Roter et al. (Roter, 1991; tracks were transcribed and annotated following
Roter and Larson, 2002) and it has been con- Savy’s (2005) guidelines for orthographic tran-
structed as to be viable for all kind of sessions, e.g. scription for spoken Italian, where applicable. As
conversations in the oncological setting (2017), a pre-processing, we used two Automatic Speech
between patients and psychotherapists or even pa- Recognition (ASR) systems, i.e. Google Speech-
tients and pharmacists. Moreover, RIAS was orig- to-Text and Nuance Transcription Engine. Auto-
inally developed to annotate audio, while we tran- matic transcriptions were corrected manually and
scribed the speech and annotated the transcrip- anonymized. Video and audio tracks have been
tions. This is motivated by the NLP analysis we separately saved for future projects.
wanted to perform on the text, e.g. syntactic and Overlapping between the two speakers and
semantic analysis, machine learning, automatic di- pauses were not marked, as it was not relevant to
alogue act classification. Other dialogue annota- our study. Similarly, any intervention in the dia-
tion schemes exist, namely (Bunt et al., 2017; Ser- logue from a third party was not transcribed since
ban et al., 2017; Stolcke et al., 2000), that includes our interest was solely in the doctor and patient’s
rich taxonomies of communicative functions. The linguistic behaviours. Each dialogue turn of the
ISO 24617-2 standard, for example, includes the corpus was annotated by two different annotators
following the RIAS guidelines. All the annotators hen’s score, was promising (k = 0.63). In case of
have a background in linguistics and a specific ed- disagreement (about 25% of the data), the process
ucation about linguistic corpora. As a single di- was followed by reconciliation or a final decision
alogue turn may contain more than one sentence by a super annotator, where the two annotators
and more than one speech act, the tags assigned to could not overcome the disagreement.
each turn may be more than one. The RiMotivAzione corpus has been built and
RIAS tagset includes 29 categories divided in archived according to GDPR norms. It is not pub-
four macro-categories called Medical Interview licly available but it can be requested to the authors
Functions (MIF) that cover the majority of the for research purposes.
exchanges between a doctor and a patient: Data
Gathering, Information Exchange, Emotional Ex- 4 Corpus Analysis
pression and Responsiveness, Partnership Build-
The RiMotivAzione corpus contains about 98778
ing and Activation. Table 1 contains the list of
tokens. The total number of dialogue turns is
categories occurring at least 200 times in the cor-
7670: 3377 dialogue turns in P1 sessions, 4293
pus, together with examples.
in P2 sessions.
To the best of the authors’ knowledge, the RIAS
In Table 2 and Table 3 we reported the number
system has never been used to annotate sessions
of types, tokens, the ratio between types and to-
of physiotherapy until now. This means that not
kens (the Lexical Richness Index) and the number
all of the tags applied completely to the situa-
of questions for the two patients.
tion, or that some tags may be under-represented
It is worth noticing that Lexical Richness Index
compared to other studies: for instance, the tag
ranges from 0 to 1 and it is closer to 0 in the doc-
Concerns was applied to few sentences, since
tors’ speech, meaning that medical personnel em-
patients in physiotherapy sessions may inherently
ploy a poorer vocabulary while talking to a patient.
express less concern than oncological patients.
This is due to the fact that a therapist needs to stick
All the categories defined in Roter et al. (2017) to a protocol and cannot digress over a certain
were used. Moreover, two more tags were limit. On the other hand, the patient talks quan-
added to include all the exchanges: Unclear and titatively less: he pronounces fewer words, and
Technical problems. The first applied to incom- most of the time those words are simple answers
plete sentences, unintelligible ones (also marked to the questions posed by the clinician. The patient
with the tag), or even in cases where talks less but he can wander more across conversa-
the sentence referred to the physical context, mak- tion topics: he may disclose some personal detail
ing the general meaning impossible to retrieve for about his life or just chit chat. This behavior is ac-
the annotator. The second tag applied to situations tually encouraged by the therapist, since it makes
where the wearable device wasn’t working prop- the therapy session less dull and more spontaneous
erly, therefore resulting in some technical issue out for both the participants (Delany et al., 2010; Ed-
of the scope of the therapy. wards et al., 2004). To sum up, the doctor needs
Another issue concerns the use of irony. Specif- to talk a lot to instruct the patients about the ex-
ically, Patient 2 heavily employed irony while ercise they need to fulfill, as well as to ask ques-
talking to the therapist, even when the dialogue tions (mainly regarding general well-being and in-
concerned his health and well-being. Irony is hard quiries about the therapy itself). Meanwhile, the
to interpret, resulting in the difficulty to assign patient may talk less because most of the time he
correctly a tag to those sentences. Tag Jokes just has to answer short questions (such as "Does
was used in this case, and where inappropriate, it hurt?"); or, when he talks more, it is about some
a discussion between the annotators oriented the external topic which generates an increment in the
choice. vocabulary richness index.
As the annotation task was difficult and it was As the main goal of the study is to replicate
inherently affected by subjectivity, we measured the clinician’s communicative style onto a con-
the resulting inter-annotator agreement and we put versational interface, the major interest is on how
in place strategies to solve the disagreement, in or- the therapists talk, rather than the patients. Pa-
der to annotate all the dialogue turns. The agree- tients’ manner of speaking is taken into consid-
ment calculated at this stage, according to the Co- eration when imagining all the orders or phrases
Specific RIAS code Examples
Social talk non vedevo l’ora di venirla a trovare.
Directions per scendere chiudo, per salire apro la mano.
Agreements esatto, perché lo abbiamo registrato proprio cosí.
Medical condition un po’, poco, fastidio piú che male.
Approvals bravissimo.
Unclear [dialect] vara!
Therapeutic regimen venerdí faremo la parte clinica ti faró io la scala di valutazione.
Jokes and laughter ci vediamo domani, è piú una minaccia che un invito.
Asking for understanding vorrei portarla cosí, hai capito?
Checking for understanding chiudo le dita. cosí?
Concerns sei sicura che funziona?
CeQ Medical condition a fare gli esercizi non ha dolore?
Table 1: Tags and examples of categories occurring at least 200 times in the corpus.
Parameters Patient 1 Therapist Word Frequency
Types 2065 3017 vai 1166
Tokens 10533 39305 apri 432
Lexical Richness Index 0,19 0,07 rilassa 400
Questions 40 667 bravissimo 353
mantieni 314
Table 2: Patient 1 corpus. bravo 288
lascia 199
Parameters Patient 2 Therapist fare 187
Types 2451 2406 prova 156
Tokens 18233 30707 ottimo 153
Lexical Richness Index 0,13 0,07
Questions 380 805 Table 4: Most frequent Verbs and Adjectives used by
therapist 1.
Table 3: Patient 2 corpus.
iotherapy, especially for patients that suffered a
that the user could say to the voice assistant to ex- stroke (Palma and Sidoti, 2019).
press his needs. Table 4 and Table 5 list the most The quantitative analysis operated over the an-
frequent Verbs and Adjectives pronounced by the notated corpus confirms the qualitative remarks
physiotherapists. Apart from "Okay", which is the made so far. In Figure 1 we present the distribution
most frequent word for both therapists (1231 and of dialogue tags, both for patients and therapists,
1019 occurrences), both therapists often use adjec- i.e. the distribution of utterance type according
tives of positive value: bravissimo, bravo, ottimo, to RIAS categories. We plotted on a logarithmic
buono. Other frequent words are mainly verbs ex- scale the frequencies of the tags.
pressed at the first plural person, such as we do, Sentences annotated as Social talk were
we’ll try, or equivalent expressions (let’s relax). abundant, while those marked as Concerns were
The use of the "we" is a communication element copious just for a patient, because he was frus-
that aims at putting on the same level the clini- trated about his health situation and the difficul-
cian and the patient; the goal is to make the pa- ties to manage the physiotherapy. During the ses-
tient feel more comfortable and therefore enhanc- sions with Patient 1, the physiotherapist was able
ing the probability of therapy adherence. At the to engage a conversation about a hobby of his
same time, adjectives such as "good" and "very (motorcycles); even though this discussion topic
good" praise the patient’s efforts, underlining the is not relevant to the therapy, the fact that they
progress he is making. The psychological com- were talking about something interesting for the
ponent is of paramount importance during phys- patient contributed to the improvement of his med-
Figure 1: Distribution of dialogue tags in RiMotivAzione corpus
Word Frequency tic behaviors emerged during the conversations.
vai 340 These patterns were used to build the conversa-
proviamo 199 tional style and infrastructure of the dialogue sys-
apro 198 tem.
pronto 174
facciamo 134 5 Conclusions and Next Steps
attento 124
We created a corpus of conversations between pa-
andare 123
tients and clinicians, in Italian, and we annotated
scendere 120
the dialogue turns according to the Roter Interac-
vediamo 115
tion Analysis System (RIAS). This corpus was the
fare 111
first step in the design of a conversational inter-
face integrated with a smart wearable device, to
Table 5: Most frequent Verbs and Adjectives used by
therapist 2.
guide and assist the patients through the exercises
assigned by the physiotherapist.
The first step in the future work will be to
ical condition (Gard and Gyllenstein, 2000). deepen the linguistic analysis conducted on the
All of these conversational elements are put in corpus, especially regarding the tagged dialogue
place willingly by the clinician and, even more, it acts. A stronger qualitative investigation over the
is the style patients are used to. In the voice assis- data will be carried out. The second step will be
tant design we try to mirror these strategies, pro- to enrich the dataset: unfortunately, only two pa-
viding praises when appropriate and asking ques- tients were deemed appropriate for the experimen-
tions to constantly monitor the user’s well-being. tation, while a corpus should contain dialogues
The data extracted from the transcription and the from more speakers.
annotation represents the most frequent linguis- The RiMotivAzione corpus can be requested to
the authors for research purposes. G. Gard and A. L. Gyllenstein. 2000. The importance
The system prototype will be tested in San of emotions in physiotherapeutic practice. Physical
Therapy Reviews, 5(3):155–160.
Camillo Hospital by a set of stroke patients, fol-
lowing the clinical trial procedures. Thanks to the J. A. Hall and D. L. Roter. 2012. Physician-patient
results of the test, we will produce experimental communication. In H. A. Friedman, editor, The Ox-
data to investigate if and how a voice assistant in- ford Handbook of Health Psychology. Oxford Uni-
versity Press.
tegrated with a wearable device can increase the
effectiveness of the therapy. L. Laranjo, A.G. Dunn, H.L. Tong, A.B. Kocaballi, and
al. 2018. Conversational agents in healthcare: a sys-
6 Acknowledgments tematic review. Journal of the American Medical
Informatics Association, 25(9):1248–1258.
RiMotivAzione is a two-year Research and In- E. Marzuki, C. Cummins, H. Rohde, H. Branigan, and
novation project supported by POR FESR 2014- G. Clegg. 2017. Resuscitation procedures as multi-
2020 Regione Piemonte. The partners are Koiné party dialogue. In Proc. SEMDIAL 2017 (SaarDial)
Sistemi, CELI, IRCCS Fondazione Ospedale San Workshop on the Semantics and Pragmatics of Dia-
logue, pages 60–69.
Camillo, Synesthesia, Istituto Italiano di Tecnolo-
gia (IIT) and Morecognition. We are thankful to R.J. Moore, M.H. Szymanski, R. Arar, and G. J.
our colleagues and project partners, in particular Ren. 2018. Studies in Conversational UX Design.
Springer.
Paolo Ariano and Nicoló Celadon.
S. Palma and E Sidoti. 2019. La comunicazione
nei processi di cura. COMUNIT IMPERFET,
References 46(4):243–251.
P. Basile and N. . Novielli. 2018. Overview of the D. Roter. 1991. The Roter method of interaction pro-
evalita 2018 italian speech act labeling (ilisten) task. cess analysis (RIAS manual). The Johns Hopkins
In Proceedings of the Sixth Evaluation Campaign of University, Baltimore.
Natural Language Processing and Speech Tools for
Italian. Final Workshop (EVALITA 2018) co-located D. Roter, S. Isenberg, and L. Czaplicki. 2017. The roter
with the Fifth Italian Conference on Computational interaction analysis system: Applicability within the
Linguistics (CLiC-it 2018). context of cancer and palliative care. Oxford Text-
book of Communication in Oncology and Palliative
D. Beltrami, G. Gagliardi, Rossini Favretti, E. R., Ghi- Care, pages 717–726.
doni, F. Tamburini, and L. Calzá. 2018. Speech
analysis by natural language processing techniques: D. Roter and S. Larson. 2002. The roter interaction
a possible tool for very early detection of cognitive analysis system (rias): utility and flexibility for anal-
decline? Frontiers in Aging Neuroscience, 10:369. ysis of medical interactions. Patient education and
counseling, pages 128–132.
Harry Bunt, Volha Petukhova, David Traum, and Jan
R. Savy. 2005. Specifiche per la trascrizione or-
Alexandersson. 2017. Dialogue act annotation with
tografica annotata dei testi in Italiano Parlato. Anal-
the iso24617-2 standard. Multimodal Interaction
isi di un dialogo. Liguori, Napoli.
with W3C Standards.
T. Sen, M.R. Ali, M.E. Hoque, R. Epstein, and P. Du-
C.L. Chang, B.K. Park, and S.S. Kim. 2013. Conver- berstein. 2017. Modeling doctor-patient commu-
sational analysis of medical discourse in rehabilita- nication with affective text analysis. In 2017 Sev-
tion: A study in korea. The journal of spinal cord enth International Conference on Affective Comput-
medicine, 36(1):24–30. ing and Intelligent Interaction (ACII), pages 170–
177.
I. Chaoua, D. R. Recupero, S. Consoli, A. Harma, and
R. Helaoui. 2018. Detecting and tracking ongoing Iulian Vlad Serban, Ryan Lowe, Peter Henderson, and
topics in psychotherapeutic conversations. AIH@ Joelle Pineau Laurent Charli and. 2017. A survey of
IJCAI, pages 97–108. available corpora for building data-driven dialogue
systems. arXiv:1512.05742.
C.M. Delany, I. Edwards, G.M. Jensen, and E. Skinner.
2010. Closing the gap between ethics knowledge E. Sezgin, S. Yildirim, S. Ozkan-Yildirim, and
and practice through active engagement: an applied E. Sumuer. 2018. Current and Emerging MHealth
model of physical therapy ethics. Physical Therapy, Technologies: Adoption, Implementation, and Use.
90(7):1068–1078. Springer.
I. Edwards, M. Jones, J. Carr, A. Braunack-Mayer, and R. Shah and A. Chircu. 2018. Iot and ai in healthcare:
G.M. Jensen. 2004. Clinical reasoning strategies in A systematic literature review. Issues in Information
physical therapy. Physical Therapy, 84(4):312–330. Systems, 19(3):33–41.
Andreas Stolcke, Klaus Ries, and Elizabeth Shriberg
Noah Coccaro. 2000. Dialogue act modeling for
automatic tagging and recognition of conversational
speech. Computational Linguistics, 26(3).