=Paper= {{Paper |id=Vol-1772/paper7 |storemode=property |title=Syncretic Text Composition in Artificial Museum Guides |pdfUrl=https://ceur-ws.org/Vol-1772/paper7.pdf |volume=Vol-1772 |authors=Antonio Sorgente,Antonio Calabrese,Gianluca Coda,Paolo Vanacore,Francesco Mele |dblpUrl=https://dblp.org/rec/conf/aiia/SorgenteCCVM16 }} ==Syncretic Text Composition in Artificial Museum Guides== https://ceur-ws.org/Vol-1772/paper7.pdf
         Syncretic Text Composition in Artificial
                     Museum Guides

    Antonio Sorgente, Antonio Calabrese, Gianluca Coda, Paolo Vanacore, and
                                Francesco Mele

    Institute of Applied Sciences and Intelligent Systems “Eduardo Caianiello” of the
                                National Research Council
                   Via Campi Flegrei 34, 80078 Pozzuoli (Naples) Italy
      {a.sorgente, a.calabrese, g.coda, p.vanacore, f.mele}@isasi.cnr.it



        Abstract. In this paper, we present our ongoing research about the
        composition of syncretic text for artificial museum guides. During a mu-
        seum visit, the visitors receive information about the cultural assets and
        responses to their questions. The aim is to reuse existing texts(for exam-
        ple those already published on the web) to compose responses for visitors
        that take into account the time at their disposal, and are balanced with
        respect to possible insights. Finally, system responses will result from a
        composition process that coherently synchronises media elements with a
        synthetic voice related to selected text.


Keywords: syncretic text, multimedia composition, cultural heritage


1     Introduction
Nowadays, the diffusion of new technologies (such as mobile and wearable de-
vices) has allowed the practitioners and organisations operating in the area of
Cultural Heritage to propose new approaches for the fruition of cultural assets.
This approaches allow us to access to museum collections in multiple ways, both
in and off site. Also, the amount of information related to the domain of Cultural
Heritage built by experts, and published on the web, is growing day by day. In
this scenario, an important aspect is related the extraction of information that
must be coherent with the query submitted to the system.
    The goal of a generation system is to produce text in response to a given
stimulus. It must be able to choose what to include in such text and how to
organise this information so that it can be easily understood, and increase the
knowledge of the user. The most common information available on cultural her-
itage is related to the story of the asset or what it depicts. The best way to
represent stories consists of using natural language. Our aim in this work is to
propose an approach that allows us to dynamically generate information that
can be close to the user request, re-using textual information provided by ex-
perts and/or already published on the web, integrating these lastly with media
resources (photos and video) to generate a unique multimedia response.
    In this paper, we present an ongoing research about the production of syn-
cretic text for artificial museum guides. The texts have to take into account the
time available to visitors, and to be balanced with respect to the possible insights.
The latter means that the response provides an explanation of equal length for
each topic involved in the dialogue. The main characteristic of this methodology
is to propose an approach based on the thematic structure of the text, selecting
appropriate contents related to a cultural item and then aggregating them with
media resources (photo and video).
    The construction of the thematic structure is based on CSWL formalism[1].
Also, we want to use the thematic progression as pattern for the selection and
composition of the text to be proposed to the user. The application of the the-
matic progression permits us to improve the cohesion and coherence of the com-
posed text provided to visitors.
    This activity has been developed within the SIMArt project1 . The aim of the
project is to design interactive multimedia systems for the use of the cultural
heritage based on the augmented knowledge paradigm[2].
    We will briefly introduce the concept of thematic organisation of text and
how we define it using CSWL annotation. We will present the approach adopted
to compose text and how to synchronise it with media resources. Finally, some
conclusions and future work will be presented.


2     A dialog model based on Theme-Rheme structure
In the construction of text, the speakers/writers construct their messages gradu-
ally introducing concepts in such a way that the message is clear, like a touristic
guide. A way to achieve this aim is to organise the text through a thematic struc-
ture[3]. This structure is based on two elements: theme and rheme. The theme
(called also topic) is related to ‘what’ the text is talking about, and rheme (called
also comment or focus) is related to ‘what’ is said about the topic. This structure
is known as thematic organisation of text. For the automatic text composition,
our model is based on such structure.
    As reported in [4], the theme typically contains information which has been
previously mentioned or refers to the context of discussion, for example in a
museum it can be a cultural asset. It is followed by the rheme that is the part of
text that explains the theme introducing new information. An example of theme
and rheme in a sentence is the following: (The Basilica of Saint Clara)theme (in
Naples was built between 1310 and 1340)rheme . In the sentence, the goal is to
talk about the Saint Clara church (theme) and to say something about the story
of its construction (rheme).
    In order to build the response text in a dialog, some principles, based on the
thematic organisation (called thematic progression[5]) have been defined. The
thematic progression defines how the theme and rheme are introduced in order to
have cohesion and coherence in the text. It can be seen as the skeleton of the plot.
1
    SIMArt (Interactive Multimedia Systems for the use of Art objects) is a project of
    National Research Council (CNR)




                                       51
The main types of thematic progression, called patterns, are: linear progression,
constant (or parallel) progression, split-theme, and split-rheme progression[5, 6].
    In the Constant Progression (CP) the same theme appears in sequence in a
series of sentences, in some cases using words having equivalent meaning (see
Fig. 1.a). With Linear Progression (LP), the rheme of a sentence will be the
theme of the next sentence (see Fig. 1.b). Instead, in Derived-Theme Progres-
sion (DTP), the theme of a sentence is linked to the theme of the next sentences.
This means there exists a relation (hypertheme) between the themes. An exam-
ple is the meronomic relation as shown in Fig. 1.c. Finally, the Split-Rheme
Progression (SRP), that can be considered a general case of LP, is applied when
a sentence introduces more rhemes. In this case, each rheme becomes the theme
of a sentence (see Fig. 1.d). Analysing some cultural texts, it has been neces-
sary to introduce another pattern that we called: Inverse Progression (IP). With
respect to this pattern, the theme of the sentence becomes a rheme of the new
sentence (see Fig. 1.e).




Fig. 1. Examples of Thematic Progression Pattern: (a) Constant; (b) Linear;
(c) Derived-Theme; (d) Split-Rheme; (e) Inverse.




2.1   Correspondence between event components and theme-rheme
      elements

One of the key ideas of this work is the use of sentences annotated by the CSWL
formalism to represent the thematic structure. Generally, a theme corresponds to



                                     52
the first noun phrase of a sentence that is participant, circumstance, or process,
while the last part of sentence contains the rhemes[7].
    Our starting point is the annotation in CSWL[1]. CSWL is an event-based
formalism and defines three types of entities changing over time: simple events,
complex events and fluents. Simple Events are represented by four compo-
nents: When - the time interval in which the event happens; What - the action
happening in the event; Where - the location where an event takes place; and
Who - the participants in the event. In CSWL, stories are represented through
complex events. A Complex Event is constituted by a set of events, causal
and temporal relationships between them, and all properties holding true over
the time in which the story unfolds. Through the Fluents one can represent
properties, mental events, spatial relations, and meronomic relations, which are
all entities that can change over the time.
    Using event-based annotation, for each sentence S we have one or more events
annotated, and for each event ei there are four components:

                  Event(S)      = e1 , e2 , ...en
                Components(ei ) =< whati , whoi , wherei , wheni >

In our approach, we select as theme the first event component that appears in the
sentence and the remaining components as rhemes. If we consider the following
sentence with related CSWL annotation:

S1 : La basilica di Santa Chiarapo1 in Napoliloc1 fu costruitaact1 tra 1310 e 1340int1 .
 – Event(S1 ) = e1
 – Compoments(e1 )= < act1 , po1 , loc1 , int1 >
       with act1 : Action, po1 : P hysicalObject, loc1 : Location, int1 : Interval

In accordance with this representation, the theme is po1 , that is “la basilica di
Santa Chiara” and the rhemes are act1 , loc1 , int1 . Each thematic schema de-
fined starting from the CSWL annotation, can be enriched with other semantic
relations such as: meronomic, hyperonomic, synonymous. Then, using this rep-
resentation, we implement the thematic progression presented in the previous
section. Through this approach we choose the next sentences according to both
thematic progression and semantic relations.


3    Response as regular expansion

The construction of natural and self-explanatory responses for the visitor needs
to be built with respect to some criteria. In a museum, a key constraint of the
visit is time. Each visitor has a limited amount of time to spend for museum
visits. So, the systems to generate responses have to take in to account such
value. Also, during a presentation, the text need to be clear, so it is necessary to
present and explain all new terms introduced for the first time. Of course, in such
process, the text does not explain obvious things. The choice whether to explain
a concept, or not, can be made in accordance to the visit context and/or user



                                       53
     profile (background). For example, if we consider the sentence The Basilica of
     Saint Clara in Naples was built between 1310 and 1340, and the visitor is located
     in Naples, it’s pointless to explain something about Naples, vice versa, it can
     be useful if the visitor is listening to the story in another country. Finally, the
     response built for the user has to be balanced, ensuring that the text does not
     present insights too large with respect to specific subjects and thus, deflecting
     attention from the main topic.

     3.1    Dialogue responses using thematic progression
     In this section we present, through some examples, how we can expand a sentence
     using a thematic propagation that takes into account the semantic annotation
     of sentences. An important step is to define the procedures for browsing the
     thematic structure of sentences, to research the text that composes the response.

                                 Listing 1.1. Pseudocode 1
1    def spQuery_Expansion(DialogueState,Sx, Tr)
2        Tr = Tr - time(Sx) # response time
3        Expansion = Sx     # expanded response
4        DS = DialogueState # sentences already uttered
5

6          if Tr>0:
7              for each S in (Text -(DS U Expansion)):
8                  if (thematic_progression(S,SX) and
9                      semantic_relation(S, Sx) and time(S) <= Tr+Dt):
10                        Expansion = Expansion + S
11                        Tr = Tr - time(S)
12         if Tr>0:
13             for each S in (Text - (DS U Expansion))
14                 if (linear_expansion(S,Sx) and time(S)<= Tr+Dt):
15                        Expansion = Expansion + S
16                        Tr = Tr - time(S)
17         if Tr>0:
18             for each S in (Text- (DS U Expansion))
19                 if (constant_progression(S,Sx) and Time(S)<= Tr+Dt):
20                        Expansion = Expansion + S
21                        Tr = Tr - time(S)
22         return Expansion

         Generally, the main theme is related to a cultural asset, but in an interactive
     system based on dialogue, the starting theme for the search depends on the user
     query (Q). As a first step, the system identifies the event, and the corresponding
     sentence (Sx)[8], that answers the user query Q. So in the expansion phase we
     will have as the starting point the sentence Sx. The listing 1.1 shows the pseudo-
     code of algorithm for expansion of sentence Sx with respect to a specific query
     Q (e.g. “When did Vaccaro renovated the Basilica?” or “When was it bombed?”.
         As a first step (lines 6..10), the algorithm finds all sentences strongly con-
     nected with Sx. We assume that two sentences S1 and S2 are strongly connected



                                          54
     if there is a thematic progression and a semantic relation between them. For se-
     mantic relations, we have used causal relations and hyperonomy relations. In
     the first case, this means that there is a cause and effect relation between two
     sentences, and in the composition process it is more useful to choose both sen-
     tences for the explanation. While in the second case, there exists a specialisation
     of some sentence component, which is an insight.
         After this selection, the algorithm, for the next text, searches (lines 11..16)
     the sentences S which have a Linear Progression (T heme(S) ∈ Rheme(Sx)),
     which are sentences that have as theme one of the rhemes belonging to Sx.

                                 Listing 1.2. Peudocode 2
1    def genQuery_Expansion(DialogueState,Sx, Tr)
2        Tr = Tr - time(Sx) # response time
3        Expansion = Sx     # expanded response
4        DS = DialogueState # contains the sentences already uttered
5

6        if Tr>0:
7            for each S in (Text- (DS U Expansion)):
8                if (constant_progression(S,Sx) and time(S)<= Tr+Dt):
9                       Expansion = Expansion + S
10                      Tr = Tr - time(S)
11       if Tr>0:
12           for each S in (Text- (DS U Expansion)):
13               if (inverse_progression(S,Sx) and time(S)<= Tr+Dt):
14                      Expansion = Expansion + S
15                      Tr = Tr - time(S)
16       if Tr>0:
17           for each S in (Text- (DS U Expansion)):
18               if (derived_progression(S,Sx) and time(S)<= Tr+DX):
19                      Expansion = Expansion + S
20                      Tr = Tr - time(S)
21       if Tr>0:
22           for each S in (Text-(DS U Expansion)) and St in Expansion:
23               if (linear_progression(S,St) and time(S)<=Tr+DX):
24                      Expansion = Expansion + S
25                      Tr = Tr - time(S)
26       return Expansion

         If the user’s query contains a generic request (e.g. “can you tell me something
     about the basilica?” or “can you give me some information about the style”, this
     means that he/she does not ask specific information about the topic expressed
     in the query. The listing 1.2 shows the pseudo-code of the algorithm for the
     expansion of sentences with respect to a generic query Q. As in the previous
     case, the starting point is the first sentence Sx that the system provides as a
     response. In this case, the request is generic, and it asks information about a
     topic. So, the algorithm finds all sentences that have a Constant progression with
     respect to Sx (lines 6..10) and if there is more narration time, it searches (lines
     11..15) sentences that have inverse progression (T heme(Sx) ∈ Rhemes(S)) with



                                          55
        Fig. 2. Answer of the query “Can you tell me about the Basilica?”




               Fig. 3. Answer of the query “When was it bombed?”


respect to the theme of the query, or try to find sentences with derived theme
progression (lines 16..20). If there is more available time for the narration, the
algorithm finds some deepening of the sentences that were already selected in
the previous steps (lines 21..24). Using this approach, the composed text will
present a generic description of the topic required and, in accordance with the
available time for the response, some deeper insights.
    As experimentation of the adopted approach, we have considered a cultural
text about the Basilica of Santa Chiara in Naples. Asking the query “can you
tell me about the basilica?” we can obtain the answer shown in Fig. 2. In this
figure, the first sentence is selected as the response by the system. To build an
expansion, the algorithm in listing 1.2 is applied. It adds two sentences to the
response: the first with constant progression and the second with derived theme
progression. In the latter, the system can detect the theme through a meronomic
relation. In fact, the interior (po4 ) is part of of the basilica (loc2 ).
    If we consider the query “When was it bombed?” we can obtain an answer
as shown in Fig. 3. In this case, starting from the sentence that contains the
response, for the expansion the algorithm presented in listing 1.1 is used. We
can observe that between the two sentences exist a linear progression and a
causal relation, so they are strongly connected.

3.2   Syncretic approach for multimedia responses
To build the composed text for responding to the user question, we create a mul-
timedia response temporally synchronising text and media according to semantic



                                     56
annotations. This approach is called syncretic text[9], namely a text composed
of heterogeneous languages within a unitary communications model[2], having
features of cohesion and coherence, respect to a same enunciation instance. For
these goals, the system selects, and carries out a ranking, using available mul-
timedia objects, that can be associated to the composed text. The selection is
based on annotated entities using the semantic of CSWL[8] formalism. Then,
multimedia objects selected are synchronised with synthesised text, so that me-
dia items are coherently visualised with the time intervals in which a synthetic
voice talks about the content represented in the media.

4    Conclusions and Future Work
In this work, we presented an ongoing research activity about the composition of
balanced texts that uses a thematic progression structure built through an event
based formalism. What we have presented here represents just a first application
that composes texts using information coming from a single document, but we
believe that this approach can be adopted for building texts integrating more
documents. Future work will consist of analysing more texts to validate the
patterns of thematic progression and discover new ones. We also believe that
presented algorithms can be improved taking into account some characteristics of
the user profile. In addition, because the approach is based on CSWL annotation,
to reduce the time in such phase, we are working on an assisted tool that helps
the users in the annotation.

References
1. Mele, F., Sorgente, A.: CSWL - Cultural Stories Web Language. Technical Report
   180/15, Institute of Cybernetics “E. Caianiello” of CNR (06 March 2015)
2. Sorgente, A., Calabrese, A., Coda, G., Vanacore, P., Mele, F.: Building multimedial
   dialogues annotating heterogeneous resources. In Bordoni, L., Mele, F., Sorgente,
   A., eds.: Artificial Intelligence for Cultural Heritage. Cambridge Scholars Publishing
   (2016) 49–82
3. Ebrahimi, S.F., Ebrahimi, S.: Information development in efl students composition
   writing. Advances in Asian Social Science 1(2) (2012) 212–217
4. Halliday, M.A.K., Matthiessen, C.M.: Halliday’s introduction to functional gram-
   mar. Routledge (2013)
5. Danes, F.: Functional sentence perspective and the organization of the text. In
   Papers on Functional Sentence Perspective (1974) 106–128
6. Soepriatmadji, L.: Thematic progression in thesis abstacts written by english stu-
   dents of fbib unisbank semarang. Jurnal Ilmiah Dinamika Bahasa dan Budaya 3(1)
   (2009) 28–40
7. Jing, W.: Theme and thematic progression in english writing teaching. Journal of
   Education and Practice 6(21) (2015) 178–187
8. Sorgente, A., Vanacore, P., Origlia, A., Leone, E., Cutugno, F., Mele, F.: Multimedia
   responses in natural language dialogues. In: Proceedings of AVI*CH 2016. Volume
   1621 of CEUR Workshop Proceedings., CEUR-WS.org (2016) 15–18
9. Cosenza, G.: Semiotica dei nuovi media. Laterza (2010)




                                        57