=Paper=
{{Paper
|id=None
|storemode=property
|title=Formalization of Word-Order Shifts by Restarting Automata
|pdfUrl=https://ceur-ws.org/Vol-1003/2.pdf
|volume=Vol-1003
|dblpUrl=https://dblp.org/rec/conf/itat/LopatkovaP13
}}
==Formalization of Word-Order Shifts by Restarting Automata==
<pdf width="1500px">https://ceur-ws.org/Vol-1003/2.pdf</pdf>
<pre>
ITAT 2013 Proceedings, CEUR Workshop Proceedings Vol. 1003, pp. 2–9
http://ceur-ws.org/Vol-1003, Series ISSN 1613-0073, c 2013 M. Lopatková, M. Plátek


                Formalization of Word-Order Shifts by Restarting Automata∗

                                                 Markéta Lopatková and Martin Plátek

                                   Charles University in Prague, Faculty of Mathematics and Physics
                                        Malostranské nám. 25, 118 00 Prague, Czech Republic
                               lopatkova@ufal.mff.cuni.cz, martin.platek@ufal.mff.cuni.cz

Abstract: The phenomenon of word order freedom plays                    a number of word order shifts within the analysis by re-
an important role in syntactic analysis of many natural lan-            duction. This interplay was exemplified on a sample of
guages. This paper introduces a notion of a word order                  Czech sentences with clitics and non-projectivities (long
shift, an operation reflecting the degree of word order free-           distance dependencies, see esp. [10, 4]). This approach is
dom of natural languages. The word order shift is an op-                based upon the method of analysis reduction – a stepwise
eration based upon word order changes preserving syntac-                simplification of a sentence preserving its correctness. The
tic correctness, individual word forms, their morphologi-               analysis by reduction had been applied in a relatively free
cal characteristics, and their surface dependency relations             manner with no constraints on the projectivity of its indi-
in a course of a stepwise simplification of a sentence, a so            vidual steps. Our investigation focused on a set of syntac-
called analysis by reduction.                                           tically analyzed Czech sentences from the Prague Depen-
   The paper provides linguistic motivation for this opera-             dency Treebank (PDT) [2]. The experiment on treebank
tion and concentrates on a formal description of the whole              data showed that with only basic constraints on the analy-
mechanism through a special class of automata, so called                sis by reduction, see [6], the minimal number of necessary
restarting automata with the shift operation enhanced with              shifts for any sentence from the test set did not exceed one.
a structured output, and their computations.                            This number actually confirmed the initial assumption for
   The goal of this study is to clarify the properties of com-          Czech as a language with relatively high degree of word-
putations needed to perform (enhanced) analysis by reduc-               order freedom; nevertheless, we have continued the search
tion for free word order languages.                                     for more complicated sentences. This lead to the discov-
                                                                        ery of a special construction requiring (at least) two shifts,
                                                                        as it is discussed in Section 2.1.
1    Introduction
                                                                           The analysis by reduction has served as a motivation for
The phenomenon of word order freedom plays an impor-                    new a family of automata, so called restarting automata,
tant role in syntactic analysis of many natural languages.              see esp. [14, 13]. As a first step, analysis by reduction
A relatively free word order combined with additional lin-              was modelled as a formal string to string translation using
guistic constraints constitutes a big challenge for a for-              a suitable model of restarting automata [9]. Then a class of
mal description of syntactic structures.Here we introduce               enhanced restarting automata with an output consisting of
a notion of a word order shift, an operation reflecting the             a single DR-tree was introduced, [16]. Finally, the model
degree of word order freedom of natural languages. The                  was further enhanced – a model of restarting automata that
word order shift is a word order change preserving syntac-              assign a set of dependency structures (DR-structures) to
tic correctness, individual word forms, their morphologi-               every reduction of an input sentence was discussed, [15].
cal characteristics, and their surface dependency relations             DR-structures can capture a set of dependency trees repre-
in the course of a stepwise simplification of a sentence un-            senting sentence on different layers of linguistic descrip-
der investigation, referred to as an analysis by reduction.             tion in a parallel way. Here we further enrich the perfor-
Section 2 provides a linguistic motivation and informal de-             mance of these automata with the shift operation.
scription of the process.
   Section 3 concentrates on a formal description of the
whole mechanism through a special class of automata,                    2   Analysis by Reduction
so called restarting automata with the shift operation en-
hanced with structured output, pRLe -automata, and their
                                                                        Let us first describe the main ideas behind the method used
computations.
                                                                        for sentence analysis. Analysis by reduction (AR) is based
                                                                        on a stepwise simplification of an analyzed sentence, see
Previous Work. The paper [6] introduced a specific ap-                  [8]. It defines possible sequences of reductions (deletions)
proach to the problem of measuring the word-order free-                 in the sentence – each step of AR is represented by deleting
dom. It focused upon an interplay of two phenomena re-                  a sentence member, possibly accompanied by rewriting a
lated to word order: a (non-)projectivity of a sentence and             word of the sentence (and thus its shortening); in specific
  ∗ The paper reports on the research supported by the grant of GAČR   cases, deleting is accompanied by a shift of a word to an-
No. P202/10/1333.                                                       other word order position.
Formalization of Word-Order Shifts by Restarting Automata                                                                                   3


  Let us stress the basic constraints imposed on the anal-          We can notice that the order of reductions reflects the
ysis by reduction:                                               dependency relations in the corresponding dependency
                                                                 tree. Informally, the words are ‘cut from the bottom of the
  (i) the obvious constraint on preserving individual words      tree’ (in other words, the ‘pruning’ operation is applied on
      (word forms), their morphological characteristics          the tree); i.e., a governing node must be preserved in a sim-
      and/or their surface dependency relations;                 plified sentence until all its dependent words are deleted.1
 (ii) the constraint on preserving the correctness (i.e., a      In other words, AR makes it possible to identify the de-
      grammatically correct sentence must remain correct         pendency tree for sentence (1):
      after its simplification);                                                               bojí
                                                                                              Z
(iii) moreover, the application of the shift operation is lim-                               JJZ
                                                                                        Petr se o .
      ited only to cases when a shift is enforced by the                                            JJ
      correctness preserving principle of AR, i.e., a simple                                          otce
      deletion would result in an incorrect word order and          The analysis by reduction has been designed to model a
      thus violate the principle (ii) above.                     human who is able to check the correctness of the analysis.
                                                                 However, it is possible to model correct reductions on the
   As a consequence of AR, it is possible to derive for-
                                                                 basis of large corpora of natural language sentences, see
mal dependency relations between individual sentence
                                                                 [11].
members – pairs of governing and dependent nodes –
based on the possible order(s) of reductions, as it is de-
scribed in [8, 15], see ex. (1) below. Compare also with         2.1 The role of shifts. The above mentioned experiment
other approaches aiming to determine dependency rela-            on a set of non-projective sentences from PDT [2] (namely
tions summed up esp. in [1] and [12].                            on the sentences with a non-projectivity given by a modal
Example:                                                         verb, see [3]) constituted only the first step.
(1) Petr se bojí o otce.                                            As a follow-up experiment, we have decided to supple-
    ‘Peter – REFL – fears – for – father’                        ment the data with ‘suspicious’ types of sentences identi-
    ‘Peter fears for his father.’                                fied in previous work on Czech word order freedom, esp.
The analysis by reduction can be summarized in the fol-          in [4]. It allowed to include also less frequent phenom-
lowing scheme:                                                   ena into our investigations. As a result, we found a well-
                     Petr se bojí o otce.                        formed Czech sentence that requites at least two shifts in
                       delete  Z delete
                                                                 the course of AR, see the following example (2).
                           =
                                 ~
                                  Z                                 Let us stress that the role of shifts in the course of the
             Petr se bojí.         * se bojí o otce.             analysis by reduction was limited to cases where a shift
                    delete
                       ?
                                        shift
                                          ?                      ‘corrects’ word order of a simplified sentence – this con-
               * se bojí.            bojí se o otce.             cerns primarily the cases when an input sentence contains
                             ZZ                                  a clitic (as in sentences (1)-(2)).
                         shift
                               ~   =delete
                               bojí se.                          Example:
Example (1) can be simplified in two ways (for simplicity,       (2) S těžkým se bála pomoci úkolem.
we do not make distinction between upper and lower case              ‘with – difficult – REFL – (she) was afraid – to help – task’
letters at the beginning of the sentence):                           ‘With a difficult task, she was afraid to help.’
(i) Either by simple deletion of the prepositional group         Although the sample sentence is rather strange, the split-
o otce ‘for father’ (following the constraint on correctness     ting of the prepositional group is a grammatical construc-
of the simplified sentence, the pair of word forms must be       tion in Czech allowing to put a strong stress on an adjec-
deleted in a single step; see the left branch of the scheme).    tive modifying the noun and not on the whole prepositional
(ii) Or by deleting the subject Petr (the right part of the      group.
scheme); however, this simplification results in an incor-          Due to the dependency relations present in the sentence
rect word order variant starting with the clitic se (such po-    there is only one possibility how to reduce it, the reduction
sition of a clitic is forbidden in Czech). Thus the shift op-    of the adjective těžkým ‘difficult’. Unfortunately, it results
eration correcting the word order is enforced →shi f t bojí      in a syntactically incorrect word order:
se o otce.                                                       →delete * s se bála pomoci úkolem.
As these possible reductions are independent of each other,      This situation can be corrected in two possible ways, let us
we can conclude that the words Petr and o otce are inde-         focus on one of them:
pendent – in other words, both of them depend on / modify             1 Let us note that some relations – e.g., the relation between a prepo-

a word remaining in the simplified sentence, i.e., the verb      sition and its ‘head’ noun as well as a verb and some clitic – are rather
                                                                 technical (not only) from the point of view of AR: such pairs must be
and its clitic bojí se.
                                                                 reduced in a single step (despite being represented as two nodes in a
Now, we can proceed in a similar way until we get the            dependency tree); here we adhere to the practice used in the PDT anno-
minimal correct simplified sentence → bojí se.                   tation.
4                                                                                                              M. Lopatková, M. Plátek


→shi f t s úkolem se bála pomoci. (A shift of the noun          We have chosen a nondeterministic model to enable vari-
úkolem ‘task’ next to the preposition.)                         ous sequences of reductions. This can serve for the verifi-
→delete * se bála pomoci. (Unfortunately, the next reduc-       cation of the (in)dependency between the individual parts
tion must remove the prepositional group s úkolem ‘with         of a sentence, as was sketched in Section 2.
task’ making the sentence again ungrammatical due to the           A pRL-automaton M is (in general) a nondeterministic
clitic on the sentence first position.)                         machine with a finite alphabet Γ, a head q (window of size
→shi f t bála se pomoci. (Now we can repair the sentence        1) working on a flexible tape delimited by the left sentinel
by shifting the verb bála to the left.)                         c and the right sentinel $ (c, $ 6∈ Γ), and two finite sets
→delete bála se. (The reduction of the word pomoci ‘help’       of restarting meta-instructions R(M) and accepting meta-
ends the process.)                                              instructions A(M), respectively. Such an automaton makes
Similarly for the second sequence of reductions and shifts.     use of k pebbles p1 · · · , pk .
Regardless of the selected sequence, we must apply at least        Let us first describe informally the way how an pRL-
two shifts in the course of AR.                                 automaton works. Each of its finite computations consists
Remark: Note that we limit our observations to simple           of certain phases called cycles, and a last – halting – phase
sentences, i.e., to sentences with a single clause. The rea-    called a tail. In each cycle, the automaton performs two
son is obvious, complex sentences might blur the whole          passes through the tape: during the first pass, it marks se-
picture by bringing additional phenomena into the play.         lected symbols of a processed sentence with pebbles; then
For example, a coordination of clauses might completely         during the second pass, it performs its editing operations
change the total number of shifts in a complex sentence re-     described by meta-instructions from R(M); the operations
gardless of the phenomena contained in individual clauses.      can be applied (only) on the symbols marked by pebbles.
In general, each (coordinated) clause can induce a shift in     In each (accepting) tail, the automaton marks all symbols
the course of AR, thus the overall number of shifts cannot      by pebbles – according to a meta-instruction from A(M) –,
be bounded.                                                     halts, and accepts the analyzed sentence.

                                                                3.1.1 Restarting meta-instructions. Each cycle of the
3 Restarting Automaton with a Structured                        pRL-automaton M is controled by a single restarting meta-
  Output                                                        instruction IR ∈ R(M) of the form

                                                                               IR = (E0 , I1 , . . . Is ; O p ; Restart)          (1)
The crucial issue of the application of a word-order shift
in the process of AR is actually identifying WHAT should        where:
be shifted WHERE. In order to model the whole process           – each Ii is an instruction of the form (ai , Ei ), 1 ≤ i ≤ s,
by means of a special kind of an automaton, we have to          – each Ei is a regular language over Γ; Ei is interpreted as
use special markers – pebbles, which mark the possible          the i-th context of IR (the contexts are usually represented
targets of shift locations. For this purpose we are going       by regular expressions);
to define a restarting automaton with a finite set of special   – each instruction Ii marks the symbol ai ∈ Γ from the tape
meta-instructions using pebbles – an pRL-automaton. The         with the pebble pi within the first pass;
automaton with pebbles was originally introduced in [15]        – O p = {o1 , · · · , o p } is an ordered set of editing operations,
in order to define dependency output structures, and we         o j ∈{dl[i], wr[i, b], sh[i, l]|1 ≤ i, l ≤ s, i 6= l, b ∈ Γ},
are going to enhance its abilities with the shift operation.    – o j is the j-th editing operation performed in the second
   We proceed in several steps. First, a restarting automa-     pass:
ton with pebbles is enhanced with a shift operation (Sec-           — if o j = dl[i] then it deletes the symbol ai ,
tion 3.1). Second, a structured output is added (Section            — if o j = wr[i, b] then it rewrites ai by b,
3.2). Finally, a computation of such enhanced automaton             — if o j = sh[i, l] then it shifts ai to the position behind
is described (Section 3.3).                                     al .
                                                                We require a ‘uniqueness’ of operations on a single sym-
                                                                bol, i.e., if dl[i] ∈ O p then O p does not contain the opera-
3.1   Restarting Automaton with the Shift Operation             tions wr[i, b], sh[i, l] for any b, l.
      Operating on Strings                                      Performing a meta-instruction: For an (input) sentence
                                                                w ∈ Γ∗ , we define the tape inscription cw$, and a reduction
Here we present a class of nondeterministic pRL-automata        w `cIR w0 . When trying to execute a cycle C by performing
that are suitable for modeling AR – these automata allow        the meta-instruction IR , it starts with the tape inscription
for checking the whole input sentence (and marking se-          cw$, M will get stuck (and so reject) if w does not permit a
lected words with pebbles) prior to any changes. It re-         factorization of the form w = v0 a1 v1 a2 . . . vs−1 as vs (vi ∈ Ei
sembles a linguist who can read the whole sentence first,       for all i = 0, . . . , s). On the other hand, if w permits factor-
and then can reduce the sentence in a correct way. Thus         izations of this form, then one of them is (nondeterminis-
he/she makes correct steps of the analysis by reduction.        tically) chosen.
Formalization of Word-Order Shifts by Restarting Automata                                                                              5


   The automaton processes the input word in two passes:          serve for building the structured output, as it is described
1. During the first pass of C, M puts the pebble pi on            in the following sections.
each ai (this corresponds to the instruction Ii , for each i,        The notation u `cM v (a reduction of u to v by M) denotes
1 ≤ i ≤ s).                                                       a reduction performed during a cycle of M that begins with
2. During the second pass of C controlled by the or-              the tape inscription cu$ and ends with the tape inscription
                                                                                       ∗
dered set of operations O p , the tape inscription cw$ is         cv$; the relation `cM is the reflexive and transitive closure
transformed (reduced) into a new tape inscription w0 =            of `cM . By REM(M) we denote the set of reductions per-
v0 r1 v1 · · · vs−1 rs vs , where 1 ≤ i ≤ s:                      formed by M.
                                                                     A string w ∈ Γ∗ is accepted by M, if there is an accept-
   • In the second pass of C, M performs its operations in        ing computation which starts with the tape inscription cw$
     the prescribed order o1 , o2 , · · · , o p .                 and ends by executing an accept operation. By L(M) we
                                                                  denote the language consisting of all words accepted by
   • ri = λ if there is a deleting operation o j ∈ O p such       M; we say that M recognizes (accepts) language L(M).2
     that o j = dl[i];
     i.e., ai is deleted during the cycle C;                         Note that we limit ourselves only to such combinations
                                                                  of operations that are adequate (and necessary) for known
   • ri = b if there is a rewriting operation o j ∈ O p such      linguistic examples.
     that o j = wr[i, b];                                         Example: Let us illustrate the notions of restarting and
     i.e., ai is rewritten with b during the cycle C;             accepting meta-instructions on the analysis of sentence (1)
                                                                  from Section 2. This example formalizes the AR of the
   • ri = λ and r j = al ai if there is a shifting operation
                                                                  sentence Petr se bojí o otce. ‘Peter fears for his father.,
     o j ∈ O p such that o j = sh[i, l];
                                                                  namely the left branch of its AR (see the scheme in Section
     i.e., ai is shifted behind the symbol marked by the
                                                                  2). The respective pRLe -automaton M is described by two
     pebble pl during the cycle C;
                                                                  restarting meta-instructions IR1 and IR2 , and one accepting
   • ri = ai if there is no operation o j ∈ O p applicable on     meta-instruction IA , namely:
     ai .                                                             IR1 = (E0 , I11 , I21 , I31 ; O12 ; Restart)
     (Note that such an ai may – together with symbols                E0 = {Petr se},                           I11 = ( bojí , {λ })
     where some of above mentioned operation is applied                 1
                                                                      O2 = {dl[2], dl[3]},                      I21 = ( o , {λ })
     – create the output structure, see the following sec-                                                      I31 = ( otce , {·})
     tion, and thus they must be marked with pebbles.)                IR2 = (E0 , I1 , I2 , I3 ; O2 ; Restart)
                                                                                    2     2     2    2

                                                                      E0 = {λ },                                I12 = ( Petr, {λ })
   • At the end of each cycle, the Restart operation is per-          O22 = {dl[1], sh[2, 3]},                  I22 = ( se, {bojí})
     formed: all the pebbles are removed from the new                                                           I32 = ( bojí , {λ })
     tape inscription w0 , and the head is placed on the left         IA = ( bojí , se, . , Accept)
     sentinel.                                                    The computation consists of two cycles. Within the first
                                                                  cycle described by the meta-instruction IR1 , the instruc-
Of course, we can delete, rewrite or shift neither the left       tions I11 , I21 and I31 put pebbles p1 , p2 and p3 on the sym-
sentinel c nor the right sentinel $. It is required that during   bols containing the words bojí (with the left context Petr
a cycle, M executes at least one delete operation.                se and empty right context), o, and otce, respectively; the
   If no further cycle is performed, each finite (accepting)      operations dl[2] and dl[3] delete the word o (marked with
computation necessarily finishes in a tail performed ac-          the pebble p2 ) and otce (marked with the pebble p3 ), re-
cording to an accepting meta-instruction.                         spectively. Then the automaton restarts (in particular, all
                                                                  pebbles are removed). Similarly in the second cycle (meta-
3.1.2 Accepting meta-instructions. Tails of accepting             instruction IR2 ), the instructions I12 , I22 and I32 mark the re-
computations are described by a set accepting meta-               spective words and then operation dl[1] deletes the word
instructions A(M), each of the form:                              Petr (marked with p1 ) and the operation sh[2, 3] shifts the
                                                                  word se with the pebble p2 behind the word bojí (with the
                   IA = (a1 , . . . , as , Accept),         (2)   pebble p3 ). The accepting instruction IA just marks the
                                                                  remaining words bojí and se with pebbles.
where ai are symbols from Γ.                                        The following property – obviously ensured by pRL-
Performing a meta-instruction: The tail performed by              automata – has a crucial role in our applications of restart-
the meta-instruction IA starts with the inscription on the        ing automata.
tape cz$; if z = a1 · · · as , then M marks each symbol ai
with the pebble pi , and accepts z (and the whole compu-          Weak Correctness Preserving Property. Any pRL-auto-
tation as well). Otherwise, the computation halts with re-        maton M is weakly correctness preserving, i.e., (for all
jection. Note that only operations distributing pebbles are           2 Note that language L(M) accepted by the automaton M is the same

allowed within an accepting meta-instruction – they will          language as LC (M) in [15].
6                                                                                                                  M. Lopatková, M. Plátek


u, v ∈ Γ∗ )                                                             • U p contains the root of G of the form [i, ai ] (unless the
                     ∗
   v ∈ L(M) and u `cM v imply that u ∈ L(M).                              root is already contained in Uw ).
Our goal is to model the analysis by reduction by automata
with the following stronger property:                                2. An edge (u, v) ∈ H represents:
Correctness Preserving Property. A pRL-automaton                        • either deleting: if dl[i] ∈ O p then the edge has the
M is correctness preserving, if (for all u, v ∈ Γ∗ ) u ∈                  form u = [i, ai ], and v = [ j, a j ] for some j 6= i ( j is in-
               ∗
L(M) and u `cM v imply that v ∈ L(M).                                     terpreted as a position of respective governing node);

3.2    Restarting Automaton with the Shift Operation                    • or rewriting: if wr[i, b] ∈ O p then the edge has the
       Enhanced with Structured Output                                    form u = [i, ai ], and v = [i, b] (i.e., u represents the
                                                                          original symbol ai and v the rewritten)
In this section, we introduce enhanced restarting automata
with structured output – the so-called pRLe -automata.
During their enhanced computations, these automata build             3.2.2 Enhanced accepting meta-instruction IA .
– with the help of pebbles – so called DR-structures, i.e.,          Let IA be an accepting instruction of the form IA =
graphs consisting of items representing individual occur-            (a1 , . . . , as , Accept) (i.e., as it is introduced in Section 3.1,
rences of symbols (words) in the input sentence and di-              X = A). Then a corresponding enhanced accepting meta-
rected edges between them.                                           instruction has a form IEA = (IA , G), where G = (U, H)
   In order to represent a structured output by an pRLe -            is an enhancing graph; the symbols a1 , . . . , as are pebbled
automaton, the items of flexible tape (representing just             and they correspond to the nodes {[1, a1 ], [2, a2 ] . . . , [s, as ]}
sentinels and symbols (= words) of a processed sentence)             of U. For each edge h = (u, v) ∈ H, we suppose that
are enriched with two integers denoting horizontal and ver-          u = [ai , i], v = [a j , j] (1 ≤ i, j ≤ s and i 6= j).
tical position of a respective word in the resulting DR-             Example: Let us return to the analysis of sentence (1)
structure.                                                           Petr se bojí o otce. ‘Peter fears for his father.’, see Section
   An pRLe -automaton is specified in two steps: First, in-          2. In Section 3.1, we have already presented two restart-
structions of an pRL-automaton are enhanced with graphs              ing instructions IR1 and IR2 , and one accepting instruction
(Sections 3.2.1 and 3.2.2). Second, an application of these          IA . Here we enrich these instructions with a graph de-
enhanced instructions (i.e., its computations) are specified         scribing the (part of) resulting DR-structures – the respec-
with the help of DR-structures (Sections 3.2.3 and 3.3).             tive pRLe -automaton Me is described by enhanced instruc-
   A pRLe -automaton Me = (Γ, c, $, ER(Me ), EA(Me ))                tions IER1 = (IR1 , G1 ), IER2 = (IR2 , G2 ), and IEA = (IA , GA ),
consists of an alphabet Γ, sentinels c, $ 6∈ Γ, a set of en-         namely:
hanced restarting meta-instructions ER(Me ), and a set of                G1 = (U1 , H1 )      U1 = {[1, bojí ], [2, o], [3, otce]}
enhanced accepting meta-instructions EA(Me ).                                                 H1 = {([2, o], [1, bojí ]), ([3, otce], [2, o])}
   An enhanced meta-instruction is a pair IEX = (IX , G),                G2 = (U2 , H2 )      U2 = {[1, Petr], [3, bojí ]}
where IX is a (restarting or accepting) meta-instruction                                      H2 = {([1, Petr], [3, bojí ])}
of a pRL-automaton, and G is a directed acyclic graph                    GA = (UA , HA )      UA = {[1, bojí ], [2, se], [3, ·]}
G = (U, H). The graph G represents the required structure                                     HA = {([2, se], [1,bojí ]), ([3, ·], [1,bojí ])}
of symbols processed during the application of the meta-
instruction IX . In this paper we request G to be a rooted
tree. (This restriction ensures linguistically adequate out-         3.2.3 DR-structures.
put structures.)                                                     In this paragraph, we introduce so-called DR-structures
                                                                     (Delete Rewrite structures), which serve for the represen-
                                                                     tation of output structures of restarting automata. A DR-
3.2.1 Enhanced restarting meta-instruction IER .                     structure captures syntactic units (words and their cate-
Let IR be a restarting meta-instruction of the form (1) intro-       gories used in an input sentence and also in its reduced
duced in section 3.1.1. We define the corresponding graph            forms) as nodes of a graph and their mutual syntactic re-
G = (U, H) in the following way:                                     lations as edges; moreover, word order is represented by
1. The set of nodes U consists of three disjunct subsets Un ,        means of total ordering of the nodes.
Uw , and U p :                                                          A DR-structure is a directed acyclic graph D = (V, E),
                                                                     where V is a finite set of nodes, and E ⊂ V ×V is a finite
    • Un = {[i, ai ] | dl[i] ∈ O p or wr[i, b] ∈ O p , 1 ≤ i ≤ s}.
                                                                     set of edges. A node u ∈ V is a tuple u = [i, j, a], where a is
      We can see that Un covers all (original) words which
                                                                     a symbol (word) assigned to the node, and i, j are natural
      are deleted or rewritten words within IR (ai ∈ Γ is the
                                                                     numbers, i represents the horizontal position of the node
      word marked with a pebble pi ).
                                                                     u (i.e., the word order in an input sentence), j represents
    • Uw = {[i, b] | wr[i, b] ∈ O p , 1 ≤ i ≤ s}}. The set Uw        the vertical position of u (it is equal to a number of rewrit-
      represents words resulting from the rewritten opera-           ings of a word on a particular horizontal position). Each
      tions within IR .                                              edge h = (u, v) ∈ E, where u = [iu , ju , a] and v = [iv , jv , b]
Formalization of Word-Order Shifts by Restarting Automata                                                                              7


for some non-negative integers iu , iv , ju , jv and a, b ∈ Γ, is    continues by performing several cycles, and it ends by a
either                                                               tail.
                                                                        A cycle means an application of an enhanced restart-
   • oblique edge iff iu 6= iv ;                                     ing meta-instruction on a given configuration, resulting in
     such edges are interpreted as representing syntactic            a new configuration. During each cycle of C, some items
     relations between respective symbols (words), or                of the tape content are deleted, rewritten, or shifted (as in
                                                                     a cycle of the original automaton without structures); fur-
   • vertical edge iff iu = iv and jv = ju + 1;
                                                                     ther, some edges and (rewritten) nodes are added to the
     such edges are interpreted as representing a rewriting
     of a symbol (word) on the respective position iu .
                                                                     DR-structure of the corresponding configuration, in accor-
                                                                     dance with the graph attached to the respective enhanced
We say that D = (V, E) is a DR-tree if the graph D is a              meta-instruction (see below for more formal description).
rooted tree (i.e., all maximal paths in D end in its single             A tail consists in an application of an enhanced accept-
root).                                                               ing meta-instruction – it results in an acceptance of a con-
                                                                     figuration; further, some edges are added to the current
Remark: It is important to say that a DR-structure pre-
                                                                     DR-structure in accordance with a graph attached to this
serves word order of an original sentence, i.e., the order of
                                                                     enhanced meta-instruction.
nodes in DR-structure (corresponding to words and their
                                                                        Let us stress that during a computation, some nodes and
categories) is not affected by (possible) shifts. However,
                                                                     edges may be added to the DR-structures of the computa-
shifts affect the way how a DR-structure corresponds to
                                                                     tion, and never removed. At the end of each computation,
individual reductions of the automaton – an information
                                                                     we obtain a DR-tree covering the whole input sentence.
of both original and changed positions of individual words
(and their categories) must be recorded during the compu-
tation.                                                              3.3.1 Application of an enhanced restarting meta-
                                                                     instruction IER .
                                                                     Let IER = (IR , G) be an enhanced restarting instruction in
3.3    Computations and (Structured) Languages by                    the previously defined form, with G = (U, H) (see Section
       pRLe -automata                                                3.2). An application of IER on an enhanced configuration
                                                                     Cw = (Tw , Dw ) of a word w results in a new configuration
Based on the definition, it is clear that for a pRLe -
                                                                     Cw0 = (Tw0 , Dw0 ). The application consists in the following
automaton Me , a reduction relation `Me can be defined in
                                                                     steps:
the same way as the reduction relation `M of the corre-
sponding pRL-automaton M; further, Me accepts the same                 1. Choosing a factorization of w of the form w =
language as the M (leaving aside output DR-structures),                   v0 a1 v1 a2 . . . vs−1 as vs such that vi ∈ Ei for all i =
and we can see that a set of reductions REM(Me ) =                        0, . . . , s (see the form of a restarting meta-instruction
REM(M), as well.                                                          (1)). Let us consider that the items from Tw contain-
   Similarly as in [15], computation of an pRLe -automaton                ing the symbols a1 , · · · , as are pebbled.
Me is characterized by sequences of its configurations. A                 If w does not permit any factorization of this form,
configuration is a pair of the form Cw = (Tw , Dw ) , where               then IER cannot be applied on the configuration Cw .
Tw (so called tape content of Cw ) is a string of items rep-
resenting current sentence w on the tape and Dw = (V, E)               2. Transforming the tape content Tw containing the sen-
is a DR-structure (see above). The items of Tw create a                   tence w into the tape Tw0 containing the sentence
subset of V such that Tw contains just all roots of individ-              w0 = v0 r1 v1 · · · vs−1 rs vs ; ri (1 ≤ i ≤ s), are described
ual components (trees) of the DR-structure Dw (thus each                  in section 3.1). The transformation of Tw to Tw0 keeps
item [i, j, xi ] of Tw consists of a word position i (i.e., the           the following rules:
horizontal index), a symbol xi on this position (word of
                                                                              • each item vi ∈ Tw (1 ≤ i ≤ s) which does not
the original sentence), and the number of its rewritings j).
                                                                                undergo an editing operation is copied to Tw0 ,
Moreover, Tw contains at most one item for each horizontal
position i.                                                                   • no deleted item from Tw is transferred to Tw0 ,
   Note that – contrary to [15] – the items of Tw are or-                       and
dered; however, their order need not reflect the original                     • each item [ki , ji , ai ] from Tw that corresponds to
word order of an input sentence as the items (triples) may                      a rewritten symbol ai (hence oi = wr[i, b], b ∈ Γ,
be re-ordered by a shift operation.                                             1 ≤ i ≤ s ) is replaced by [ki , ji + 1, b] in Tw0 ;
   An (accepting) computation C of Me on an input sen-                        • the shift of an item [i, j, ai ] is performed in such
tence w starts in the initial configuration Cwin = (Twin , Dinw ):              a way that the indices i, j remain unchanged in
for the given sentence w = x1 . . . xn (x j ∈ Γ, 1 ≤ j ≤ n),                    Tw0 .
the initial tape content consists in the string of items Twin =
[1, 0, x1 ] · · · [n, 0, xn ], and the DR-structure Din    in /
                                                     w = (Vw , 0),     3. All edges and nodes from the DR-structure Dw are
             in
where Vw = {[1, 0, x1 ], · · · , [n, 0, xn ]}. The computation            transferred into the new DR-structure Dw0 .
8                                                                                                                                  M. Lopatková, M. Plátek


    4. For each edge e ∈ H, a new edge is inserted into the                        Let AC(Me ) denote the set of all accepting computations
       new DR-structure Dw0 = (V 0 , E 0 ):                                     of the pRLe -automaton Me . Then the DR-language of Me
                                                                                is the set DR(Me ) = {DR(C ) | C ∈ AC(Me )}.
             • if e = ([i, ai ], [r, ar ]) is a deleting edge from
                                                                                Remark. From the linguistic point of view, DR(Me )
               H (1 ≤ r ≤ s), and [ki , ji , ai ], [kr , jr , ar ] ∈ Tw
                                                                                should be considered as a set of (shallow) dependency-
               are the items covering the pebbled symbols ai
                                                                                based syntactic trees / analyses computed by the automa-
               and ar , respectively, then an oblique edge eo =
                                                                                ton Me .
               ([ki , ji , ai ], [kr , jr , ar ]) is inserted into Dw0 (i.e.,
               into E 0 ).                                                      Example: Let us return to the example (1), section 2,
                                                                                Petr se bojí o otce. ‘Peter fears for his father.’ In
             • if e = ([i, ai ], [i, bi ]) is a rewriting edge and
                                                                                Sections 3.1.2 and 3.2.2, the pRLe -automaton Me – de-
               [ki , ji , ai ] ∈ Tw is the item covering the pebbled
                                                                                scribed by two restarting meta-instructions and one ac-
               symbol ai , then a vertical edge ev is inserted into
                                                                                cepting meta-instruction together with the graphs enhanc-
               E 0 : ev leads from the item [ki , ji , ai ] to the item
                                                                                ing these meta-instructions – was presented. The sequence
               [ki , ji + 1, bi ].
                                                                                S = IER1 , IER2 , IEA performs the left branch of the analysis
               At the same time the node [ki , ji + 1, bi ] is added
                                                                                by reduction. In the course of the corresponding computa-
               to the set V 0 .
                                                                                tion, the DR-tree D3 is gradually constructed.
   We say that the configuration Cw0 can be reached in one                         Now we can present the accepting computation C =
(restarting) step from the configuration Cw by Me (using                        C0 ,C1 ,C2 ,C3 on the sentence (1) determined by the se-
the instruction IER ) and denote it by Cw |=IMERe Cw0 , or simply               quence of meta-instructions S. Let Cin = (T in , Din ), C1 =
by Cw |=Me Cw0 .                                                                (T1 , D1 ), C2 = (T2 , D2 ), C3 = (T3 , D3 ), C0 is the initial con-
                                                                                figuration of C , C3 is the accepting configuration. Further
                                                                                we gradually describe T in , T1 , T2 , T3 , and Din , D1 , D2 , D3 ,
3.3.2 Application of an enhanced accepting meta-                                where D3 = DR(C ).
instruction IEA .                                                                   T in = [1, 0, Petr][2, 0, se][3, 0, bojí ][4, 0, o][5, 0, otce][6, 0, ·]
Let IEA = (I, G) be an enhanced accepting meta-                                     Din = (V in , 0)
                                                                                                  /
instruction, where I = (a1 , . . . , as , Accept) and G = (U, H).                   V in = {[1, 0, Petr], [2, 0, se], [3, 0, bojí ], [4, 0, o], [5, 0, otce][6, 0, ·]}
Let Cw = (T, D) be an enhanced configuration of Me with                             —————–
the tape content T = [i1 , j1 , a1 ], · · · , [is , j j , as ] covering the         T1 = [1, 0, Petr][2, 0, se][3, 0, bojí ][6, 0, ·]
word w= a1 · · · as . Then the application of IEA on the con-                       D1 = (V in , E1 )
figuration Cw results in a new configuration Cw0 = (T 0 , D0 ),                     E1 = {([4, 0, o], [3, 0, bojí ]), ([5, 0, otce], [4, 0, o])}
where D0 = (V 0 , E 0 ):                                                            —————–
    1. T 0 = [i1 , j1 , a1 ], · · · , [is , j j , as ] = T ;                        T2 = [3, 0, bojí ][2, 0, se][6, 0, ·]
                                                                                    D2 = (V in , E1 ∪ E2 )
    2. all edges and nodes from the DR-structure D are                              E2 = {([1, 0, Petr], [3, 0, bojí ])}
       transferred into the new DR-structure D0 ;                                   —————–
                                                                                    T3 = T2
    3. moreover, for each edge e = ([r, ar ], [s, at ]) ∈ H the                     D3 = (V in , E1 ∪ E2 ∪ E3 )
       new edge ([ir , jr , air ], [it , jt , ait ]) is inserted into D0 .          E3 = {([2, 0, se], [3, 0, bojí ]), ([6, 0, ·], [3, 0, bojí ])}
   Similarly as in the previous case, we say that the con-                      We can see that the presented example is very simple –
figuration Cw0 can be reached in one (accepting) step from                      the context languages from the instructions contain always
the configuration Cw by Me using the instruction IEA , and                      one string (sometimes the empty one λ ). Moreover, we
denote it by C |=IMEAe C0 .                                                     have not used the operation wr[i] at all. That caused that
                                                                                all items used in C have their vertical index equal to 0.
   We can finally define an accepting computation of an                            We can notice that D3 is a formalization of the depen-
enhanced pRLe -automaton and subsequently also a DR-                            dency tree from the example (1). (Let us note that a com-
language consisting from the output DR-structures.                              putation performing the another branch of AR should cre-
   Let C = C0 ,C1 , . . . ,Ck be a sequence of configurations                   ate the same DR-tree.)
such that
– C0 is the initial configuration for a word w, – Ci |=IMi e Ci+1
holds for all i = 0, . . . , k − 1,                                             Observations. Let us finally (and just informally) formu-
– Ii are restarting instructions for all i = 0, . . . , k − 2,                  late some observations concerned the presented notions.
– Ik−1 is an accepting instruction.                                             1. The classes of introduced languages, sets of reductions,
Then we say that C is an accepting computation of Me ,                          and DR-languages create infinite hierarchies with respect
the word w is accepted by Me , and the DR-structure Dk                          to the number of used pebbles. We are able to prove ex-
(from the last configuration Ck = (Tk , Dk )) is the output                     actly such propositions.
DR-structure of the computation C . We write DR(C ) =                           2. On the other hand, we can observe that very few peb-
Dk .                                                                            bles (six, according to our preliminary hypothesis) suffice
Formalization of Word-Order Shifts by Restarting Automata                                                                         9


to analyze adequately any sentence from the Prague De-          References
pendency Treebank (see [2]). It means that the complexity
of analysis by reduction of natural languages is not too         [1] Gerdes, K., Kahane, S.: Defining dependencies (and con-
hight; however, it is not too trivial.                               stituents). In: Gerdes, Hajičová, Wanner (eds.) Proceedings
                                                                     of Depling 2011. pp. 17–27. Barcelona (2011)
3. In a similar way, we can see that the number of used
                                                                 [2] Hajič, J., Panevová, J., Hajičová, E., Sgall, P., Pajas, P.,
shifts in one cycle does not need to exceed one.                     Štěpánek, J., Havelka, J., Mikulová, M., Žabokrtský, Z.,
4. As far as we can foresee, the use of rewritings within            Ševčíková-Razímová, M.: Prague Dependency Treebank
the analysis by reduction is forced only by two linguistic           2.0. Linguistic Data Consortium, Philadelphia (2006)
phenomena: coordinations and noun groups with numer-             [3] Hajičová, E., Havelka, J., Sgall, P., Veselá, K., Zeman, D.:
als. Moreover, the number of rewritings in one cycle does            Issues of Projectivity in the Prague Dependency Treebank.
not need to exceed one.                                              The Prague Bulletin of Mathematical Linguistics 81, 5–22
5. Further, it not hard to see that the shift operation can-         (2004)
not be simulated by rewritings within the class of pRLe -        [4] Holan, T., Kuboň, V., Oliva, K., Plátek, M.: On Complex-
automata, due to the limited number of rewritings in en-             ity of Word Order. Les grammaires de dépendance – Traite-
                                                                     ment automatique des langues 41(1), 273–300 (2000)
coded in meta-instructions.
                                                                 [5] Kuboň, V., Lopatková, M.: A case study of a free word
6. Finally, the measure of word-order complexity from [4]            order (manuscript)
is not bounded by any natural number for DR-languages
                                                                 [6] Kuboň, V., Lopatková, M., Plátek, M.: Studying formal
of pRLe -automata.                                                   properties of a free word order language. In: Youngblood,
                                                                     G., McCarthy, P. (eds.) Proceedings of the FLAIRS 25 Con-
Conclusion and Perspectives                                          ference. pp. 300–305. AAAI Press, Palo Alto (2012)
                                                                 [7] Kunze, J.: Abhängigkeitsgrammatik, Studia Grammatica,
The main novelty of the work presented in the paper is an            vol. XII. Akademie Verlag, Berlin (1975)
enhancement of a formal system in a way that makes it            [8] Lopatková, M., Plátek, M., Kuboň, V.: Modeling Syntax
possible to capture adequately relations between linguistic          of Free Word-Order Languages: Dependency Analysis by
notions in the course of (enhanced) analysis by reductions,          Reduction. In: Matoušek, V. et al. (ed.) Proceedings of TSD
namely it describes dependency structures on the one side,           2005. LNCS, vol. 3658, pp. 140–147. Springer (2005)
and (complexity of) word order on the other side (for the        [9] Lopatková, M., Plátek, M., Sgall, P.: Towards a Formal
clarity reason, we do not distinguish between input sym-             Model for Functional Generative Description: Analysis by
bols and their categories in this paper). However, the in-           Reduction and Restarting Automata. The Prague Bulletin
                                                                     of Mathematical Linguistics 87, 7–26 (2007)
vestigation of these notions is complete neither from the
linguistic point of view, nor from the point of view of the     [10] Marcus, S.: Sur la notion de projectivité. Zeitschrift fur
                                                                     Mathematische Logik und Grundlagen der Mathematik
formal automata theory.
                                                                     11(1), 181–192 (1965)
   As for the future work, we plan to continue the research
                                                                [11] Mareček, D., Žabokrtský, Z.: Exploiting reducibility in un-
in the direction of combining more types of linguistic phe-
                                                                     supervised dependency parsing. In: Proceedings of EMNL-
nomena which influence the degree of word order freedom              CoNLL 2012. pp. 297–307. ACL (2012)
for natural languages as well as enhancing the proposed
                                                                [12] Mel’čuk, I.A.: Dependency in language. In: Gerdes, Ha-
formal system so that it would capture the added phenom-             jičová, Wanner (eds.) Proceedings of Depling 2011. pp. 1–
ena. Further research of the presented type of automata              16. Barcelona (2011)
should present its properties in an exact way, as well.         [13] Otto, F.: Restarting Automata and Their Relation to the
   Further, we plan to propose a refinement of AR allow-             Chomsky Hierarchy. In: Ésik, Z., Fülöp, Z. (eds.) Proceed-
ing to set constraints on possible reduction steps, namely a         ings of DLT 2003. LNCS, vol. 2710, pp. 55–74. Springer,
constraint on projective reductions only, as the additional          Berlin (2003)
experiments with a different or modified set of constraints     [14] Otto, F.: Restarting Automata. In: Ésik, Z., Martin-Vide,
applied on the shift operation will probably provide new             C., Mitrana, V. (eds.) Recent Advances in Formal Lan-
measures of word order freedom. An interesting research              guages and Applications, Studies in Computational Intelli-
direction in this respect would be the investigation of pro-         gence. vol. 25, pp. 269–303. Springer-Verlag, Berlin (2006)
jectivization of sentences prior to the individual reductions   [15] Plátek, M., Mráz, F., Lopatková, M.: (In)Dependencies
of AR, which might completely change the picture and al-             in Functional Generative Description by Restarting Au-
low for a more detailed investigation of an interplay be-            tomata. In: Bordihn, H. et al. (ed.) Proceedings of NCMA
tween clitics and non-projective sentences, see also [5].            2010. books@ocg.at, vol. 263, pp. 155–170. Österreichis-
   Further investigation of the shift operation might also           che Computer Gesellschaft, Wien, Austria (2010)
help to describe the relationship between different levels of   [16] Plátek, M., Mráz, F., Lopatková, M.: Restarting Automata
formal description of natural languages in the Functional            with Structured Output and Functional Generative Descrip-
Generative Description, namely between the tectogram-                tion. In: Dediu A. et al. (ed.) Proceedings of LATA 2010.
                                                                     LNCS, vol. 6031, pp. 500–511. Springer, Berlin Heidelberg
matical and analytical layer. This would strengthen the
                                                                     (2010)
formal background of the theory.

</pre>