<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eleni Gregoromichelaki</string-name>
          <email>elenigregor@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>King's College London University of Osnabrueck</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>The view of NLs as codes mediating a mapping between “expressions” and the world is abandoned to give way to a view where utterances are seen as actions aimed to locally and incrementally alter the affordances of the context. Such actions employ perceptual stimuli composed not only of “words” and “syntax” but also elements like visual marks, gestures, sounds, etc. Any such stimuli can participate in the domain-general processes that constitute the “grammar”. The function of the grammar is dynamic categorisation of various perceptual inputs and their integration in the process of generating the next action steps. Given these assumptions, a challenge that arises is how to account for the reification of such processes as exemplified in apparent metarepresentational practices like quotation, reporting, citation etc. It is argued that even such phenomena can receive adequate and natural explanations through a grammar that allows for the ad hoc creation of occasion-specific content through reflexive mechanisms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        1 Language as action and grammar
Standard models that describe natural languages
(NLs) as representational systems belong to the
‘language-as-product’ paradigm
        <xref ref-type="bibr" rid="ref4">(Clark, 1992)</xref>
        ,
concerned with the definition of linguistic
representations, the “product” of linguistic processing.
In this tradition, it has been a standard assumption
that NL properties should be explained by reifying
NLs as abstract codes, mapping forms (strings of
symbols) to propositional intentions. However, a
substantial amount of evidence indicates that NL
use substantially affects NL structuring indicating
an alternative characterisation: within a ‘language
as action’ paradigm, NL properties can be
explicated as coinciding with those of human action;
an agent’s linguistic actions are structured
sequentially, directed by predictions of upcoming inputs,
interleaved and interacting with other activities and
agents. Accordingly, in everyday conversation,
utterances are not expected to display evidence of
necessary hierarchical constituency, e.g.
sentential structuring: non-sentential utterances are
adequate to underpin interlocutor coordination and all
linguistic dependencies are resolvable across more
than one turn:
(1) Angus: But Domenica Cyril is an intelligent
and entirely well-behaved dog who
Domenica: happens to smell
[radio play, 44 Scotland Street]
In such cases, postulating a notion of
wellformedness based on a code licensing units
ranging over strings of words, as an independent level
of structuring, impedes a natural account of such
phenomena. This is because joining overt forms
together often results in illformedness or misleading
interpretations:
(2) A: I heard a bang. Did you hurt
      </p>
      <p>B: myself? No, but Mary is in a state
Moreover, at the level of semantics/pragmatics of
dialogue, the issue of recoverability of
propositional intentions is also problematic, e.g., in cases
such as (5) where various speech acts are
accomplished within the unfolding of a shared single
proposition (see Gregoromichelaki et al. (2011)):
(3) Jack: I just returned</p>
      <p>Kathy: from . . .</p>
      <p>Jack: Finland. [Lerner (2004)]
(4) Eleni: A: Are you left or</p>
      <p>Yo: Right-handed. [natural data]
(5) Hester Collyer: It’s for me.</p>
      <p>Mrs Elton the landlady: And Mr. Page?
Hester Collyer: is not my husband. But I’d
rather you think of me as Mrs. Page. [The
Deep Blue Sea (film)]</p>
      <p>
        This endemic context-sensitivity and
situatedness of NL use is indicative of the fact that both
content and structure are emergent products of the
processes and practices underpinning human
interaction. For these reasons, the more general
approach to NL analysis argued for here revolves
around the idea that structures, objects, concepts,
concrete reality (and even the individual self) can
all be taken as metaphysically emergent categories
with processes, mechanisms, and change as
ontologically primary.1
2 DS-TTR
A grammar architecture adopting this perspective
can be articulated within DS-TTR (Cann et al.
(2005); Purver et al. (2010); Gregoromichelaki
(in press)). Here NLs are conceived as
comprising sets of processes modelled formally as
procedures. Both NLs’ temporal structuring (syntax)
and lexical specifications are analysed as
involving stored sequences (macros) of elementary
(epistemic) actions, defined in an IF-THEN-ELSE
format. Such actions incrementally and predictively
build or linearise conceptual categories expressed
in TTR-representations
        <xref ref-type="bibr" rid="ref6">(Cooper, 2012)</xref>
        . The model
assumes tight interlinking of NL perception and
action: production uses simulation and testing of
parse states in order to license the generation of
strings; comprehension predictively builds
structures to accommodate upcoming inputs in order to
constrain efficiently the usual overwhelming
ambiguity of NL stimuli. By imposing top-down
predictive and goal-directed processing at all
comprehension and production stages, interlocutor feedback
is incrementally anticipated and integrated. The
model includes subsentential tracking of the
shifting contextual parameters of each word-utterance
event (Eshghi et al. (2015); Gregoromichelaki (in
press)). Context constitutes an integral part of the
grammar, not only as a record of the shifting
parameters that provide for the interpretation of
various indexical elements (e.g. myself in (2)), but also
storing (a) the emergent (partial) structures
constructed from the contributions of all participants;
(b) the phonological/graphical elements that have
been employed; (c) the actions used, recorded as
traversals of paths in a graph display; (d) processing
paths that have been considered as probabilistically
live options but not eventually pursued
        <xref ref-type="bibr" rid="ref16 ref25">(Sato, 2011;
Hough, 2015)</xref>
        . Storing the action paths is necessary
1This view has its roots in an ancient philosophical
programme starting in the Western world with Heraclitus, situated
within a tradition following, among others, Martin Heidegger,
Ilya Prigogine, Gilles Deleuze, and even encompassing
current notions like the concept of the extended mind
        <xref ref-type="bibr" rid="ref3 ref5">(Clark and
Chalmers, 1998; Clark, 2008)</xref>
        .
for the resolution of anaphora and ellipsis,
especially “sloppy” or “paycheck” readings, whose
resolution relies on re-executing (‘rerunning’)
previous action sequences in an updated processing
environment. Maintaining abandoned options is
required for the modelling of backtracking in
subsententially occurring conversational phenomena
like clarification, self-/other-corrections, etc. but
also humour effects and puns (Gregoromichelaki,
in press). Consequently, coordination among
interlocutors is seen not as inferential
metarepresentational activity but as the outcome of the fact that
the grammar consists of a set of licensed
complementary actions that both speakers and hearers have
to perform in synchrony
        <xref ref-type="bibr" rid="ref15">(Gregoromichelaki et al.,
2013)</xref>
        .
2.1
      </p>
      <p>Quotation in DS-TTR
Given these assumptions, a challenge that arises
is how to account for the reification of
grammatical processes as exemplified in apparent
metarepresentational practices like quotation, reporting,
citation etc. As we saw earlier in (1)-(5), perfectly
intelligible moves in dialogue can be achieved
simply by initiating a grammatical dependency which
prompts either interlocutor to fulfill it without
specific determination or identifiability of a given
speech-act. In various other cases though, the
interlocutor completing somebody else’s utterance
might be seen as offering the completion along with
a query as to whether such a (meta)representation
is what the other interlocutor would have said (e.g.
(2)). There are further such phenomena in cases of
citation, quotation, reports, echoing uses, and
codeswitching:
(6) “Cities,” he said, “are a very high priority.”
(7) Wright won’t disclose how much the Nike
deal is worth, saying only that “they treat me
well”. [De Brabanter (2010)]
(8) A doctor tells him [Gustave Flaubert] he is
like a “vieille femme hysterique”; he agrees.
[De Brabanter (2010)]
(9) Alice said that life is “difficult to</p>
      <p>
        understand”. [Cappelen and Lepore (1997)]
(10) Mary felt relieved. If Peter came tomorrow,
she would be saved. [Recanati (2010)]
Despite recent attempts to integrate such
phenomena within standard grammars (e.g.,
        <xref ref-type="bibr" rid="ref11 ref19 ref20 ref23 ref7">(Ginzburg and
Cooper, 2014; Maier, 2014; Potts, 2007)</xref>
        ), certain
data are not amenable to appropriate treatment due
to the lack of modelling incrementality within these
formalisms. For example, as can be seen in (6)-(9),
quotation can appear subsententially, and
discontinuously, at any point, which means that contextual
parameters regarding the utterance event and
semantic evaluation need to be able to shift
incrementally at each word-by-word processing stage.
Additionally, quotation is one of the environments where
the phenomenon of split-utterances is observed
frequently as an opportunity arises for co-constructing
a vivid unified perspective of some (actual or
imaginary) speech/thought event (Gregoromichelaki, in
press):
(11) Clinician: So I watch this person being
killed and then I go to bed and I’m you know
lying there going, “well”
Patient: “did I hear something?” [Duff et al.
(2007)]
The contextual parameters relevant to the
resolution of indexicals (e.g. I) in such cases, even though
needing to shift mid-sentence, do not necessarily
track the current speaker/hearer roles. Moreover,
such role-switches include cases where the same
structure can be employed both as expressing a
speaker’s own voice and as a subsequent quotation:
(12) A: SOMEONE is keen [BBC]
      </p>
      <p>
        B: says the man who slept here all night
In all such cases, issues of “footing”
        <xref ref-type="bibr" rid="ref12">(Goffman,
1979)</xref>
        , namely changes in perspectives and roles
assumed by interlocutors, intersect with
syntactic/semantic issues of direct/indirect speech
constructions and speech-act responsibility and
echoing. For these reasons, an adequate account of
the function of such NL devices can be given
straighforwardly in DS-TTR due to its
incremental modelling of context shifting, the potential for
sharing of syntactic/semantic dependencies, and
the fact that there is no requirement to derive a
global propositional speech act (Gregoromichelaki
(in press); Gregoromichelaki &amp; Kempson (2016)).
      </p>
      <p>On the other hand, modelling the potential of
partially assuming another speaker’s role, being
perceived as “demonstrating” what somebody else
was going to say, and the “metalinguistic”
appearance of various such phenomena might seem
especially problematic aspects for the DS-TTR stance:
(13) “Life is difficult” is grammatical.
(14) James says that “Quine” wants to speak to us.</p>
      <p>[James thinks that McPherson is Quine]
(15) “I talk better English than the both of
youse!” shouted Charles, thereby convincing
me that he didn’t.</p>
      <p>A DS-TTR grammar takes words (and the
operation of “syntax” in general) as offering
affordances exploited by the interlocutors to facilitate
interaction. This means that words and linguistic
constructions are NOT conceptualised as abstract
code elements, expression types, that are associated
with referential/semantic values (cf Cooper (2014)
where string structure is still presumed). With no
privileged semantic entities corresponding to (types
of) expressions, only domain-general mechanisms
for processing stimuli, quotation thus offers a
crucial test for the legitimacy of these DS-TTR claims:
when processing a quoted/cited string, what
happens within the quotation marks (or any other
indications) according to these assumptions?</p>
      <p>In fact, it turns out that such cases are also
unproblematic for the DS-TTR model, and can
be explicated in a natural manner that conforms
with intuitions and parallels the modelling of
anaphora/ellipsis. First, in order to model cases
like (6)-(10), (14), (15), as well as mid-sentence
general code-switching, it has to be assumed that
the context keeps track incrementally, through a
designated metavariable (g in (16)), of which and
whose grammar is being employed at each
particular subsentential stage (cf Ginzburg and Cooper
(2014)). Next, consider the most challenging cases,
namely, metalinguistic uses, for example (13),
socalled pure quotation, where an NL-string appears
in a regular NP position. Under DS-TTR
assumptions, this will be a pointer position where the
grammar has already generated a prediction for
the processing of a singular term (?T y(e), other
cases might involve ?T y(cn), etc.). The
explanation of what happens here is based on the fact
that actions are first-class citizens in DS-TTR. This
means that previous actions can be invoked by the
grammar to be re-executed (‘rerun’) in order to
provide parallel but distinct contents as needed in
cases of sloppy-ellipsis or paycheck-pronoun
readings. From this perspective, metalinguistic, echoic,
and similar uses are cases where the actions
specified by some grammar g for processing a
particular string, e.g. the embedded sentential string in
(13), come to be executed on the spot to provide
an ad hoc conceptualisation of a demonstrated
action. The formalisation of the basic mechanism
is shown in (16) below. Different variants of this
macro and combinations with other independently
needed components of the grammar account for all
such phenomena:
(16)</p>
      <p>(a) demonstration action
In (16), the higher-order action run, also employed
in cases of sloppy anaphora, is triggered. run is
parameterised to some grammar g (replacing the
metavariable g), which can be distinct from the
grammar used for parsing/producing the rest of the
string (see (8), (15)). At the same time, the
executed sequence of actions hαi, ..., αni, bound to
the rule-level variables hai,...,ani, confers the ad
hoc conceptual type of the quoting utterance event
uq which therefore functions as a demonstration.
The performance of this demonstration event is
then categorised as belonging to the already
predicted semantic type, here, in (13), a referential
term (T y(e)) (feasible due to TTR’s subtyping
definitions). The rest of the string then delivers a
content that combines with the reification of this ad hoc
execution. In (13), this delivers the interpretive
result that this demonstration of the execution of the
grammatical actions is characterised as having the
property derived from processing is grammatical.
For echoic cases, where the interpretation of the
indexicals shifts following parameter values
supplied by the invoked context, e.g. (7), (15), a
similarly triggered action execution is accomplished,
this time, with parallel introduction of the quoting
context as a mentioned utterance event u, replacing
the metavariable u of type es, i.e. eventuality:
(17)</p>
      <p>(b) demonstration-and-echoing action
Cases of direct quotation (e.g. (11), (12), (15))
are those where such a freely-available contextual
switch has been grammaticalised in English.</p>
      <p>
        Notably, given that the DS-TTR grammar does
not provide form-meaning correspondences but
only provides for the parsing/generation of stimuli
in context, the same mechanism can be applied to
non-linguistic signals/demonstrations: reifying the
processing of some upcoming element to provide
ad hoc content of another already predicted type
explains how non-linguistic signals can compose
subsententially with linguistic ones as the
conceptualisation of some experience being demonstrated:
(18) John saw the spider and was like “ahh!”
(19) John was eating like [gobbling gesture]
(20) She went “Mm Mmmrn Mphh”
The existence of such compositions, along with all
the previous data, might be challenging, under one
construal, for the account of NL-gesture
coordination in Rieser (2014; 2015). Rieser presents a
framework (the λ-π calculus) where NL and
gesture are modelled as independent but
communicating processes. Even though the process
metaphysics incidentally mentioned there is a
welcome development, the assumption of
independence might be questioned. First, this
assumption seems to be an artifact of presupposing that
NLs are structured codes mediating arbitrary
mappings from standard syntactic forms (trees
inhabited by words) to propositional meanings (e.g.
λcalculus formulae). Since the co-speech gestures
examined are related to imagery (aural, visual, etc.)
in an iconic manner, modelling their contribution
in the standard way needs to abstract
representations from the kinematics that cannot be unified
with NL syntactic representations. In contrast, the
view taken here is that NL actions do not require
an independent syntax relying on the hierarchical
structuring of stimuli sequences. Hence
production/comprehension of stimuli in various
modalities need not be segregated. Second, the major
argument in Rieser’s analysis comes from SaGA
data
        <xref ref-type="bibr" rid="ref18">(Lu¨cking et al., 2013)</xref>
        where NL segments and
gesture-strokes seem not to synchronise perfectly.
However, this is not an argument for considering
such stimuli qualitatively distinct. Perfect
synchronisation is not necessary within a single modality
either, e.g. dialogue participants do not perfectly
synchronise their turns. In a predictive framework
like DS-TTR, such asynchrony might reveal a
purpose, for example, co-speech gestures can be
modelled as elaborating or narrowing down predictions
that precede the processing of NL input. But then,
under this view, there is a viable and useful
application of the λ-π calculus in the DS-TTR framework
too. Given that DS-TTR processing is strictly
incremental pursuing only one path at a time, it is
possible that various sources of information might
compete for sequential positions. Introducing
adhoc channel interfaces, modelled with resources
from the λ-π calculus, can provide for the
implementation of a sequentiality-repair mechanism,
ordering inputs/outputs, even within the same
modality, so that they can be processed strictly
incrementally.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Ronnie</given-names>
            <surname>Cann</surname>
          </string-name>
          , Ruth Kempson, and
          <string-name>
            <given-names>Lutz</given-names>
            <surname>Marten</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>The Dynamics of Language</article-title>
          . Elsevier, Oxford.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Herman</given-names>
            <surname>Cappelen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ernie</given-names>
            <surname>Lepore</surname>
          </string-name>
          .
          <year>1997</year>
          .
          <article-title>Varieties of quotation</article-title>
          .
          <source>Mind</source>
          ,
          <volume>106</volume>
          (
          <issue>423</issue>
          ):
          <fpage>429</fpage>
          -
          <lpage>450</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Andy</given-names>
            <surname>Clark</surname>
          </string-name>
          and
          <string-name>
            <given-names>David</given-names>
            <surname>Chalmers</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>The extended mind</article-title>
          .
          <source>Analysis</source>
          , pages
          <fpage>7</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Herbert H.</given-names>
            <surname>Clark</surname>
          </string-name>
          .
          <year>1992</year>
          .
          <article-title>Arenas of Language Use</article-title>
          . University of Chicago Press.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Andy</given-names>
            <surname>Clark</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Supersizing the mind: Embodiment, action, and cognitive extension</article-title>
          .
          <source>OUP USA.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Robin</given-names>
            <surname>Cooper</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Type theory and semantics in flux</article-title>
          . In R. Kempson,
          <string-name>
            <given-names>T.</given-names>
            <surname>Fernando</surname>
          </string-name>
          , and N. Asher, editors,
          <source>Handbook of the Philosophy of Linguistics</source>
          , volume
          <volume>14</volume>
          : Philosophy of Linguistics, pages
          <fpage>271</fpage>
          -
          <lpage>323</lpage>
          . Elsevier.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Robin</given-names>
            <surname>Cooper</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Phrase structure rules as dialogue update rules</article-title>
          . In V. Rieser and P. Muller, editors,
          <source>Proceedings of DialWatt - Semdial 2014: The 18th Workshop on the Semantics and Pragmatics of Dialogue</source>
          , pages
          <fpage>4352</fpage>
          , Edinburgh, 13
          <year>September 2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Philippe De Brabanter</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>The semantics and pragmatics of hybrid quotations</article-title>
          .
          <source>Language and Linguistics Compass</source>
          ,
          <volume>4</volume>
          (
          <issue>2</issue>
          ):
          <fpage>107</fpage>
          -
          <lpage>120</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Melissa C Duff</surname>
          </string-name>
          , Julie A Hengst, Daniel Tranel, and
          <string-name>
            <surname>Neal</surname>
          </string-name>
          J Cohen.
          <year>2007</year>
          .
          <article-title>Talking across time: Using reported speech as a communicative resource in amnesia</article-title>
          .
          <source>Aphasiology</source>
          ,
          <volume>21</volume>
          (
          <issue>6-8</issue>
          ):
          <fpage>702</fpage>
          -
          <lpage>716</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Arash</given-names>
            <surname>Eshghi</surname>
          </string-name>
          , Christine Howes, Eleni Gregoromichelaki, Julian Hough, and
          <string-name>
            <given-names>Matthew</given-names>
            <surname>Purver</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Feedback in conversation as incremental semantic update</article-title>
          .
          <source>In Proceedings of the 11th International Conference on Computational Semantics</source>
          , pages
          <fpage>261</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Jonathan</given-names>
            <surname>Ginzburg</surname>
          </string-name>
          and
          <string-name>
            <given-names>Robin</given-names>
            <surname>Cooper</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Quotation via dialogical interaction</article-title>
          .
          <source>Journal of Logic, Language and Information</source>
          ,
          <volume>23</volume>
          (
          <issue>3</issue>
          ):
          <fpage>287</fpage>
          -
          <lpage>311</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Erving</given-names>
            <surname>Goffman</surname>
          </string-name>
          .
          <year>1979</year>
          . Footing. Semiotica,
          <volume>25</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Eleni</given-names>
            <surname>Gregoromichelaki</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ruth</given-names>
            <surname>Kempson</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Reporting, dialogue, and the role of grammar</article-title>
          . In Alessandro Capone, Ferenc Kiefer, and Franco Lo Piparo, editors,
          <source>Indirect reports and pragmatics</source>
          , pages
          <fpage>115</fpage>
          -
          <lpage>150</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Eleni</given-names>
            <surname>Gregoromichelaki</surname>
          </string-name>
          , Ruth Kempson, Matthew Purver,
          <string-name>
            <given-names>Gregory J.</given-names>
            <surname>Mills</surname>
          </string-name>
          , Ronnie Cann, Wilfried Meyer-Viol, and
          <string-name>
            <surname>Patrick</surname>
            <given-names>G. T.</given-names>
          </string-name>
          <string-name>
            <surname>Healey</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Incrementality and intention-recognition in utterance processing</article-title>
          .
          <source>Dialogue and Discourse</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>199</fpage>
          -
          <lpage>233</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>E.</given-names>
            <surname>Gregoromichelaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kempson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. Eshghi</given-names>
            <surname>Howes</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>On making syntax dynamic: the challenge of compound utterances and the architecture of the grammar</article-title>
          . In Ipke Wachsmuth, Jan de Ruiter, Petra Jaecks, and Stefan Kopp, editors,
          <source>Alignment in Communication: Towards a New Theory of Communication</source>
          . John Benjamins, “
          <article-title>Advances in Interaction Studies”</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Julian</given-names>
            <surname>Hough</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Modelling Incremental SelfRepair Processing in Dialogue</article-title>
          .
          <source>Ph.D. thesis</source>
          , Queen Mary University of London.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Gene H.</given-names>
            <surname>Lerner</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Collaborative turn sequences. In Conversation analysis: Studies from the first generation</article-title>
          , pages
          <fpage>225</fpage>
          -
          <lpage>256</lpage>
          .
          <string-name>
            <surname>Gene H. Lerner</surname>
          </string-name>
          , John Benjamins.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Andy</given-names>
            <surname>Lu</surname>
          </string-name>
          ¨cking, Kirsten Bergman, Florian Hahn, Stefan Kopp, and
          <string-name>
            <given-names>Hannes</given-names>
            <surname>Rieser</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Data-based analysis of speech and gesture: The Bielefeld Speech and Gesture Alignment Corpus (SaGA) and its applications</article-title>
          .
          <source>Journal on Multimodal User Interfaces</source>
          ,
          <volume>7</volume>
          (
          <issue>1</issue>
          -2):
          <fpage>5</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Emar</given-names>
            <surname>Maier</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Mixed quotation: The grammar of apparently transparent opacity</article-title>
          .
          <source>Semantics &amp; pragmatics, 7</source>
          (
          <issue>7</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>67</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Christopher</given-names>
            <surname>Potts</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>The dimensions of quotation</article-title>
          . In Chris Barker and Polly Jacobson, editors,
          <source>Proceedings from the workshop on direct compositionality</source>
          , pages
          <fpage>405</fpage>
          -
          <lpage>431</lpage>
          . Oxford: Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Matthew</given-names>
            <surname>Purver</surname>
          </string-name>
          , Eleni Gregoromichelaki, Wilfried Meyer-Viol, and
          <string-name>
            <given-names>Ronnie</given-names>
            <surname>Cann</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Splitting the i's and crossing the you's: Context, speech acts and grammar</article-title>
          . In P. Łupkowski and M. Purver, editors,
          <source>Proceedings of SemDial 2010 (PozDial)</source>
          , pages
          <fpage>43</fpage>
          -
          <lpage>50</lpage>
          , Poznan, Poland,
          <year>June 2010</year>
          .
          <article-title>Polish Society for Cognitive Science</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>Franc¸ois Recanati</source>
          .
          <year>2010</year>
          .
          <article-title>Truth-conditional pragmatics</article-title>
          .
          <source>Clarendon Press Oxford.</source>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Hannes</given-names>
            <surname>Rieser</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Gesture and speech as autonomous communicating processes</article-title>
          . MS, University of Bielefeld.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Hannes</given-names>
            <surname>Rieser</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>When hands talk to mouth. gesture and speech as autonomous communicating processes</article-title>
          .
          <source>In Proceedings of SEMDIAL 2015-GoDIAL</source>
          , page
          <volume>122</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Yo</given-names>
            <surname>Sato</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Local ambiguity, search strategies and parsing in Dynamic Syntax</article-title>
          . In R. Kempson, E. Gregoromichelaki, and C. Howes, editors,
          <source>The Dynamics of Lexical Interfaces. CSLI Publications.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>