On Reducing Automata and Their Normalizations
Martin Procházka


                                    Abstract
                                    The reducing automaton, a variant of the restarting automaton, is introduced and its normalizations are studied. They
                                    provide useful features such as prefix-correctness and state minimality. The LR(0) grammar generating the same language is
                                    constructed for any monotone reducing automaton. This grammar is used to construct an equivalent monotone reducing
                                    automaton that is prefix-correct. The minimization of a set of states of any reducing automaton using the method already
                                    invented in the theory of Moore machines is described. Both normalizations can be applied to the monotone reducing
                                    automaton sequentially so that the obtained automaton is both prefix-correct and state-minimal.

                                    Keywords
                                    reducing automata, LR(0) grammars, prefix-correctness, state-minimality


1. Introduction                                                                            Figure 1: Reducing automaton with a state 𝑠 in its control
                                                                                           unit and a working head scanning an item of the working list
This paper is a revised translation of the author’s disser-                                at position 2 that contains a symbol + .
tation thesis [1]. Some parts are reworded, some supple-
mented, others omitted.
   The reducing automaton (first mentioned in [2]), like                                                            0     1      2     3     4      5     6     7      8

its predecessor and ideal, the restarting automaton (see                                                            «     a     +      (     (     a      )     )     »
[3]), is a device suitable for describing a syntax of both
formal and natural languages. It is based on the notion
of reduction analysis, i.e., a gradual truncation of the an-
alyzed string that preserves both its incorrectness and
                                                                                                       𝑠
correctness1 . The following example of a gradual sim-
plification of the arithmetic expression a + ( ( a ) ) shows
that the reduction analysis is quite natural.
                                                a+((a))                                    state-minimal reducing automaton (Section 4). On the
                                                a+( a )                                    other hand the mentioned differences does not prevent
                                                a+  a                                      us to adopt results reached for restarting automata as
                                                a                                          any reducing automaton can be simulated by a restarting
                                                                                           automaton and vice versa while preserving (combina-
The reducing automaton differs from the restarting au-                                     tion of) properties like determinism or monotony. Here
tomaton in several details: 1. It lacks a fixed size looka-                                we focus on monotone reducing automata, which recog-
head window that is actually moved to the control unit.                                    nize deterministic context-free languages, as shown for
2. The sequence of the head movement and the state                                         their predecessors, deterministic monotone restarting
change is reversed. 3. Positions of the reduced work-                                      automata in [3].
list items are precisely determined. It allows us to reuse
some of the concepts and methods invented in standard
grammar and automata theory and simplifies the tech-                                       2. Basic notions and properties
niques presented here: (a) construction of an LR(0) gram-
mar that generates the same language as a given mono-                                                            The reducing automaton is shown in Figure 1. It processes
tone reducing automaton (Section 3), (b) construction                                                            a finite list of items. The first and the last item of the list
of an equivalent prefix-correct monotone reducing au-                                                            contains the delimiter « and » respectively. Any other
tomaton, and (c) construction of a strongly equivalent                                                           item contains a symbol of the finite input alphabet dif-
                                                                                                                 ferent from both delimiters and a natural number, the
ITAT’22: Information technologies – Applications and Theory, Septem-
ber 23–27, 2022, Zuberec, Slovakia
                                                                                                                 position of the item in the list. The positions of the items
Envelope-Open martproc@gmail.com (M. Procházka)                                                                  from left to right form an increasing sequence. The posi-
                    © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License tion of any item does not change during the computation.
                   Attribution 4.0 International (CC BY 4.0).
 CEUR
 Workshop
 Proceedings        CEUR Workshop Proceedings (CEUR-WS.org)
               http://ceur-ws.org
               ISSN 1613-0073
                                                                                                                 The reducing automaton resembles a finite-state automa-
1
    Restarting automata preserve correctness if they are deterministic. ton. It consists of a control unit and a working head. At
    Reducing automata are deterministic by definition, as we shall see
    below.
                                                                                                                 any given time the control unit is in one of the finitely
Figure 2: Reducing the working list.                               in the initial state with the head on the left delimiter and
                                                                   finishes in any final or reducing state.
                                                                      Formally, we define the reducing automaton 𝑀 (red-
                   0    1   2    3    4   5    6    7    8
                                                                   automaton, for short) as a 7-tuple

                   «   a    +    (    (   a    )    )   »                            𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ),

                                                                   where Σ is the finite input alphabet, «,» ∉ Σ is the left
                                                                   and right delimiter of an input word, respectively, 𝑆 is the
                                                                   finite set of transition states, 𝑠0 ∈ 𝑆 is the initial state, 𝐹
     RED (101 )                                                    is the finite set of final and reducing states, 𝑓 ∶ 𝑆 × (Σ ∪
                                                                   {»}) ⟶ (𝑆 ∪ 𝐹 ) is the transition function. Sets 𝐹 and 𝑆 are
                                                                   disjunctive. 𝐹 optionally contains accepting and rejecting
                                                                   final states ACC and ERR and finitely many reducing states
                   0    1   2    3        5         7    8
                                                                   of a form RED (𝑛), where 𝑛 is a sequence of 0 and 1 that
                   «   a    +    (        a         )   »          begins with 1 (𝑛 ∈ 1 ⋅ (1 |0 )∗ ). 0 means “leave a symbol at a
                                                                   corresponding position in a working list”, 1 means “delete
                                                                   a symbol at a corresponding position from a working list”.
                                                                   If 𝑓 (𝑠, 𝑎) = 𝑠 ′ , we say that the automaton 𝑀 moves from
                                                                   the state 𝑠 over the symbol 𝑎 to the new state 𝑠 ′ , or also
         𝑠0                                                        that the transition function switches the control unit of
                                                                   the automaton from the state 𝑠 over the symbol 𝑎 to the
                                                                   new state 𝑠 ′ .
                                                                       We extend the transition function 𝑓 to the domain
many states and the working head reads one item of the             (𝑆 ∪𝐹 ∪{RED })×(Σ∗ ⋅{𝜆, »}), where RED is the new auxiliary
list. The automaton starts its computation in the initial          state different from all states from 𝑆 and 𝜆 is an empty
state with the head placed on the very first item of the           word. We mark the new extended function as 𝛿. The
list containing the left delimiter «. At each step of the          function 𝛿 equals to the transition function 𝑓 for all pairs
computation the automaton moves its working head to                (𝑠, 𝑎) ∈ 𝑆 × (Σ ∪ {»}). For any other pair from its domain
the right onto the next item and changes its state accord-         it is defined as follows:
ing to the current state and the symbol contained in the
scanned item of the list. The new state is determined by                   𝛿(ACC , 𝑎) = ACC           𝛿(RED (𝑛), 𝑎) = RED
a transition function.                                                     𝛿(ERR , 𝑎) = ERR               𝛿(RED , 𝑎) = RED
   Some states of the automaton are final, some others
are reducing. The final states are ACC , the accepting state,      We define the reflexive and transitive closure 𝛿 ∗ of the
and ERR the rejecting state. Reaching the ACC state means          function 𝛿 in the usual way. We consider only automata
finishing the computation and accepting the word con-              with the extended transition function satisfying the fol-
tained in the working list. By switching into the ERR              lowing conditions for any 𝑠 ∈ 𝑆, 𝑎 ∈ Σ ∪ {»}, 𝑠 ′ ∈ 𝑆 ∪ 𝐹,
state, the automaton terminates the computation by re-             and 𝑢 ∈ Σ∗ ⋅ {𝜆, »}:
jecting the word being processed. RED (𝑛), 𝑛 ∈ 1 ⋅ (1 |0 )∗ , is
the reducing state. Just after transition into this state, the             𝛿(𝑠, ») = 𝑠 ′          ⟹ 𝑠′ ∈ 𝐹
automaton truncates the working list as prescribed by                      𝛿(𝑠, ») = RED (𝑛) ⟹ 0 is the suffix of 𝑛
the reducing sequence 𝑛, moves its working head to the                     𝛿(𝑠, 𝑎) = ACC          ⟹ 𝑎=»
beginning of the working list, and transfers its control                   𝛿 ∗ (𝑠0 , 𝑢) = RED (𝑛) ⟹ |𝑢| ≥ |𝑛|
unit to the initial state. The reducing sequence deter-              Characteristic constant of the automaton 𝑀 is the
mines items to be removed from the list in the following           length of the longest binary sequence contained in re-
way: If its 𝑖-th symbol from the right equals to 1 then the        ducing states of the automaton 𝑀,
automaton removes the 𝑖-th item from the list counting
from the item under the working head to the left (count-                          𝑘(𝑀) = max {|𝑛| ∣ RED (𝑛) ∈ 𝐹}                   (1)
ing the item under the working head as the first one). If
it equals to 0 the corresponding item remains in the list.            We describe the reduction of a word by a reduction se-
An example of truncation of the word « a+((a)) » by the            quence using the reduction operation / which we define
reducing sequence 101 can be seen in the figure 2. Final           as follows:
state closes a certain stage of automaton computation. By
                                                                       𝑎/0 = 𝑎     𝜆/𝑛 = 𝜆            (𝑢 ⋅ 𝑎)/𝜆 = 𝑢 ⋅ 𝑎
stage, we mean any part of the computation that starts
                                                                       𝑎/1 = 𝜆     𝑢/𝜆 = 𝑢       (𝑢 ⋅ 𝑎)/(𝑛 ⋅ 𝑖) = (𝑢/𝑛) ⋅ (𝑎/𝑖)
where 𝑢 ∈ Σ∗ , 𝑎 ∈ Σ∗ ∪ {»}, 𝑛 ∈ (10 ∗ )∗ and 𝑖 ∈ {0 , 1 }. Here           𝛿1∗ (𝑠1 , 𝑢) = ACC ⟺ 𝛿2∗ (𝑠2 , 𝑢) = ACC
we restrict neither the word 𝑢 nor the reduction sequence                  𝛿1∗ (𝑠1 , 𝑢) = ERR ⟺ 𝛿2∗ (𝑠2 , 𝑢) = ERR
𝑛 by any constant, 𝑢 can be longer than 𝑛 and vice versa.
The truncation of the words (a) , +a » by the reduction            Using the reduction relation, we can express a basic
sequences 101 , 110 respectively looks like this:                property of reducing automata called error and correctness
                                                                 preserving property.
              ( a )                       + a »
          / 1 0 1                   / 1 1 0                      Lemma 2.1. If «𝑤1 » ⇒𝑀 «𝑤2 », then 𝑤1 ∈ 𝐿(𝑀), iff
          =        a                =              »             𝑤2 ∈ 𝐿(𝑀).

Based on the reduction operation, we introduce reduction           Proof. Let’s assume that «𝑤1 » ⇒𝑀 «𝑤2 ». 1. If 𝑤2 ∈
relation. This allows us to describe how the automaton             𝐿(𝑀), then «𝑤2 » ⇒∗𝑀 «𝑤» ∈ «𝐿0 (𝑀)» for some 𝑤 and
successively rewrites the processed word 𝑤 ∈ Σ∗ . The              «𝑤1 » ⇒𝑀 «𝑤2 » ⇒∗𝑀 «𝑤» is the accepting analysis for the
reducing automaton 𝑀 reduces the word «𝑤» to the word              word 𝑤1 . 2. If 𝑤1 ∈ 𝐿(𝑀), then «𝑤1 » ⇒∗𝑀 «𝑤» ∈ «𝐿0 (𝑀)»
«𝑤 ′ »,                                                            for some 𝑤. This analysis starts by «𝑤1 » ⇒𝑀 «𝑤2 »,
                     «𝑤» ⇒𝑀 «𝑤 ′ »,                                otherwise 𝑀 wouldn’t be deterministic. So, we have
                                                                   «𝑤2 » ⇒∗𝑀 «𝑤» ∈ «𝐿0 (𝑀)».
if 𝛿 ∗ (𝑠0 , 𝑢) = RED (𝑛), 𝑤» = 𝑢𝑣, 𝑤 ′ » = (𝑢/𝑛)⋅𝑣 for some 𝑢 ∈
Σ∗ ⋅ {𝜆, »}. Obviously, |𝑤| > |𝑤 ′ | and the above reduction          Similarly to [3], we introduce monotony for reducing
is shortening. We refer to the reflexive and transitive            automata. A reducing automaton is monotone reducing
closure of the reduction relation as ⇒∗𝑀 . By reduction            automaton (mon-red-automaton, for short), if – at any
analysis of the automaton 𝑀 we mean any sequence of                stage – it visits all the items visited at the previous stage
reductions                                                         that remain in the working list. Formally, the reducing
                                                                   automaton 𝑀 is monotone if the following condition is
               «𝑤1 » ⇒ «𝑤2 » ⇒ … ⇒ «𝑤𝑛 »,                          satisfied for each 𝑤 ∈ Σ∗ ⋅ {𝜆, »}:
which cannot be extended further. If 𝛿 ∗ (𝑤𝑛 ») = ACC ,
                                                                    𝛿 ∗ (𝑠0 , 𝑤) = RED (𝑛) ⟹ 𝛿 ∗ (𝑠0 , (𝑤/𝑛)) ∉ {ERR , RED }
we are talking about an accepting reduction analysis,
otherwise it is a rejecting reduction analysis. We often use       Monotony is important for characterizing the deter-
the shorter term analysis instead of reduction analysis. ministic context-free languages (DCFL for short). De-
    The words over the alphabet Σ accepted by the reduc- terministic monotone restarting automata (det-mon-R-
ing automaton 𝑀 in one stage form its simple language automata) recognize just all deterministic context-free
                                                                languages. In the same way, we can characterize the
            𝐿0 (𝑀) = {𝑤 ∈ Σ∗ ∣ 𝛿 ∗ (𝑠0 , 𝑤») = ACC }.
                                                                DCFL class using mon-red-automata. An R-automaton
It is obvious that 𝐿0 (𝑀) is regular. We define the language differs from a red-automaton only in that its working
accepted or also recognized by the reducing automaton head is extended by a so-called lookahead window, which
𝑀 as the set of all words over the alphabet Σ for which scans a continuous subword of fixed length to the right
there exists an accepting analysis of the automaton 𝑀,          of the working head, and that it reduces the symbols
                                                                scanned by the working head and the lookahead win-
       𝐿(𝑀) = {𝑤 ∈ Σ∗ ∣ «𝑤» ⇒∗𝑀 «𝑤 ′ » ∈ «𝐿0 (𝑀)»}.             dow. Thus, any R-automaton can be simulated by a red-
                                                                automaton that stores the contents of the lookahead win-
    Let us assume that 𝑀1 = (Σ1 , «, », 𝑆1 , 𝑠1 , 𝐹1 , 𝑓1 ) and dow in its control unit. Conversely, any red-automaton
𝑀2 = (Σ2 , «, », 𝑆2 , 𝑠2 , 𝐹2 , 𝑓2 ) are any reducing automata. 𝑀 can be simulated by an R-automaton with a lookahead
We say that 𝑀1 and 𝑀2 are equivalent, if                        window of size 𝑘(𝑀). It is obvious that at each stage the
                                                                simulated and simulating automaton visit the same items
                         𝐿(𝑀1 ) = 𝐿(𝑀2 ).                       in the working list, so that monotony is preserved.
                 ′       ∗       ∗                                 Representation. A red-automaton can be represented
If for any 𝑤, 𝑤 ∈ (Σ1 ∪ Σ2 )
                                                                by a transition table in the same way as a finite-state
           𝐿0 (𝑀1 ) = 𝐿0 (𝑀2 ) and                              machine. A transition table of a red-automaton recog-
                            ′                       ′           nizing a language {a 𝑛 b 𝑛 ∣ 𝑛 ≥ 0} is shown in Table 1.
           «𝑤» ⇒𝑀1 «𝑤 » ⟺ «𝑤» ⇒𝑀2 «𝑤 »,                         Each state encodes a regular expression mentioned in
then these automata are reductionally equivalent. They the first column of the Table. So, a lookahead window of
are strongly equivalent if for any prefix «𝑢 of the word R-automata is in case of red-automata moved into their
«𝑤», the following holds                                        states.

        𝛿1∗ (𝑠1 , 𝑢) = RED (𝑛) ⟺ 𝛿2∗ (𝑠2 , 𝑢) = RED (𝑛)
Table 1                                                                  Table 2
Transition table.                                                        Graphical representation of grammar rules.
            𝑓                 a          b              »                                      classical         our
      →     𝑠0 = «            𝑠1         ERR            ACC                     rule        representation       representation
            𝑠1 = «a ∗ a       𝑠1         𝑠2             ERR                                       𝑋                   𝑎
            𝑠2 = «a ∗ ab      ERR        RED (110 )     RED (110 )              𝑋 → 𝑎𝑌           ↙ ↘                  ↑
                                                                                                𝑎    𝑌                𝑋→𝑌
                                                                                                  𝑋
3. Grammars                                                                     𝑋 →𝑌              ↓                   𝑋→𝑌
                                                                                                  𝑌
For any mon-red-automaton 𝑀, we construct a gram-                                                                     𝑎
mar 𝐺𝑀 whose derivation trees correspond closely to                                               𝑋                   ↑
computations of a given automaton 𝑀. We show that                               𝑋 → 𝑎𝑌𝑍          ↙↓↘                  𝑋→𝑌
this grammar generates the language recognized by the                                           𝑎 𝑌 𝑍                 ↓
automaton 𝑀 and that it is an LR(0) grammar.                                                                          𝑍
   Let’s assume that 𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ) is a mon-                                       𝑋                  𝑋→𝑌
                                                                                𝑋 → 𝑌𝑍            ↙ ↘                 ↓
red-automaton. The grammar of the automaton 𝑀 is the
                                                                                                𝑌    𝑍                𝑍
grammar 𝐺𝑀 , which is obtained by reducing the grammar
                                                                                                   𝑋                  𝑋
                        𝐺 = (𝑉 , 𝑁 , 𝑆, 𝑃),                                     𝑋 →𝑎               ↓                  ↑
                                                                                                   𝑎                  𝑎
where 𝑉 = Σ ∪ {«, »} is the set of terminals; 𝑁 is the set                                         𝑋
of nonterminals containing 5-tuples (𝑎, 𝑠, 𝑢, 𝑜, 𝑣), where                      𝑋 →𝜆               ↓                  𝑋
𝑎 ∈ Σ ∪ {«, »}, 𝑠 ∈ 𝑆 ∪ (𝐹 ⧵ {ERR }), 𝑢 ∈ {𝜆, «} ⋅ Σ∗ ⋅ {𝜆, »},                                    𝜆
|𝑢| ≤ 𝑘(𝑀), 𝑣 ∈ Σ∗ ⋅ {𝜆, »}, |𝑣| ≤ 𝑘(𝑀), and 𝑜 ∈ {ACC } ∪
{RED (𝑛) ∣ ∃𝑛′ ∶ RED (𝑛𝑛′ ) ∈ 𝐹 }; 𝑆 = («, 𝑠0 , 𝜆, ACC , 𝜆) ∈ 𝑁
is the initial nonterminal; 𝑃 is the set of rules defined as                We draw derivation trees of grammar 𝐺𝑀 differently
follows:                                                                 than we are used to for context-free grammars. We start
                                                                         with the initial nonterminal 𝑆0 at the top right and expand
   𝑋 → 𝑎 𝑌 ∈ 𝑃,         if    𝑋 = (𝑎, 𝑠, 𝜆, RED (𝑛), (𝑏/𝑖) ⋅ 𝑣)          each nonterminal as indicated in Table 2. For example
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝜆, RED (𝑛 ⋅ 𝑖), 𝑣)   the rule 𝑋 → 𝑎𝑌 𝑍 is drawn so that its right hand side is
                                                                         written clockwise around the left hand side nonterminal
                        or    𝑋 = (𝑎, 𝑠, 𝜆, ACC , 𝜆)
                                                                         𝑋, the terminal 𝑎 at 12 o’clock, the nonterminal 𝑌 at 3
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝜆, ACC , 𝜆),         o’clock and the nonterminal 𝑍 at 6 o’clock. This yields
   𝑋 → 𝑌 ∈ 𝑃,           if    𝑋 = (𝑎, 𝑠, 𝑎𝑢, RED (𝑛), (𝑏/𝑖) ⋅ 𝑣)         derivation tree pictures in which each maximum non-
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝑢, RED (𝑛 ⋅ 𝑖), 𝑣)   terminal branch resembles a working list visited by the
                                                                         automaton during the stage of its computation. When
                        or    𝑋 = (𝑎, 𝑠, 𝑎𝑢, ACC , 𝜆)                    drawing the derivation tree we prevent the edges from
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝑢, ACC , 𝜆),         crossing. We omit 𝜆-rules.
   𝑋 → 𝑎 𝑌 𝑍 ∈ 𝑃,       if    𝑋 = (𝑎, 𝑠, 𝜆, 𝑜, 𝑣)                           Suppose 𝑤 ⇒𝑀 𝑤 ′ for some words 𝑤 and 𝑤 ′ . We show
                                                                         how to move from a derivation tree that gives the word
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝜆, RED (1 ), 𝑥)
                                                                         𝑤 to a derivation tree that gives the word 𝑤 ′ . First, we
                               𝑍 = (𝑎, 𝑠, 𝑎𝑥, 𝑜, 𝑣),                     introduce the following notation: Let
   𝑋 → 𝑌 𝑍 ∈ 𝑃,         if    𝑋 = (𝑎, 𝑠, 𝑎𝑢, 𝑜, 𝑣)
                                                                                              𝑋 = (𝑎, 𝑠, 𝑢, 𝑜, 𝑣)
                                  𝑌 = (𝑏, 𝑓 (𝑠, 𝑏), 𝑢, RED (1 ), 𝑥)
                               𝑍 = (𝑎, 𝑠, 𝑎𝑥, 𝑜, 𝑣),                     be any nonterminal of the grammar 𝐺𝑀 . Then
   𝑋 → 𝑎 ∈ 𝑃,           if    𝑋 = (𝑎, RED (𝑛), 𝜆, RED (𝑛), 𝜆)                                𝑋 𝜆 = (𝑎, 𝑠, 𝜆, 𝑜, 𝑣).
                        or    𝑋 = (𝑎, ACC , 𝜆, ACC , 𝜆),
                                                                         The following implications follow directly from the defi-
   𝑋 → 𝜆 ∈ 𝑃,           if    𝑋 = (𝑎, RED (𝑛), 𝑎, RED (𝑛), 𝜆)            nition of the rules of the grammar 𝐺𝑀 , where 𝑎 is a symbol
                        or    𝑋 = (𝑎, ACC , 𝑎, ACC , 𝜆),                 in the first component of 𝑋 and 𝜔 ∈ 𝑁 ∪ {𝜆}:
for any 𝑠 ∈ 𝑆 ∪ (𝐹 ⧵ {ERR }), 𝑢 ∈ {𝜆, «} ⋅ Σ∗ ⋅ {𝜆, »}, |𝑢| ≤ 𝑘(𝑀),                       𝑋 ∈ 𝑉 ⟹ 𝑋𝜆 ∈ 𝑉                          (2)
𝑣 ∈ Σ∗ ⋅ {𝜆, »}, |𝑣| ≤ 𝑘(𝑀), 𝑛 ∈ (10 ∗ )+ , 𝑖 ∈ {0 , 1 }, 𝑎 ∈ Σ ∪ {«},
                                                                                 𝑋 → 𝑎 𝑌𝑍 ∈ 𝑃 ⟹ 𝑋 = 𝑍𝜆                            (3)
and 𝑏 ∈ Σ ∪ {»}. 𝐺 (and therefore 𝐺𝑀 as well) is obviously
a context-free grammar.                                                            𝑋 → 𝑌 𝜔 ∈ 𝑃 ⟹ 𝑋 𝜆 → 𝑎 𝑌 𝜆𝜔 ∈ 𝑃                 (4)
Figure 3: Function 𝑅 applied to a derivation tree 𝑇 with the first two branches shown in Fig. 4a. The function 𝑅 transforms
these two branches into the single one shown in Fig. 4b, the rest of tree keeping unchanged.


                         «      𝑎1     …      𝑎𝑖−1     𝑎𝑖      𝑎𝑖+1        …      𝑎𝑚1         𝑎𝑗′        …      𝑎𝑚′ 2
                         ↑       ↑              ↑       ↑          ↑                ↑           ↑                  ↑
                        𝑆0    → 𝑆1   → ⋯     → 𝑆𝑖−1 → 𝑆𝑖      → 𝑆𝑖+1 → ⋯         → 𝑆𝑚
                                                                                        1

                                                       ↓
                                                       𝑆𝑖′               → ⋯                → 𝑆𝑗′       → ⋯    → 𝑆𝑚′
                                                                                                                       2


                               (a) the first two maximum nonterminal paths of the original tree 𝑇


                         «      𝑎1     …      𝑎𝑖−1     𝑎𝑖′     𝑎𝑖+1        …      𝑎𝑚1         𝑎𝑗′        …      𝑎𝑚′ 2
                         ↑       ↑              ↑       ↑                                       ↑                  ↑
                        𝑆0    → 𝑆1   → ⋯     → 𝑆𝑖−1 → 𝑆𝑖′𝜆               → ⋯                → 𝑆𝑗′𝜆      → ⋯    → 𝑆𝑚′𝜆
                                                                                                                       2


                               (b) the first maximum nonterminal path of the resulting tree 𝑅(𝑇 )


            𝑋 → 𝜆 ∈ 𝑃 ⟹ 𝑋𝜆 → 𝑎 ∈ 𝑃                           (5)         (i) 𝑅(𝑇 ) is the derivation tree of the grammar 𝐺𝑀 ,
        𝑋 → 𝑎 𝑌 𝜔 ∈ 𝑃 ⟹ 𝑋 = 𝑋 𝜆 and 𝑌 = 𝑌 𝜆                  (6)        (ii) 𝑅(𝑇 ) contains one less maximal nonterminal path
                                                                             than 𝑇,
Next, we introduce a function 𝑅 that assigns a tree 𝑅(𝑇 )
to any derivation tree 𝑇 of the grammar 𝐺𝑀 with at least               (iii) if 𝑇 gives the word 𝑤 and 𝑅(𝑇 ) gives the word 𝑤 ′ , then
two maximum nonterminal paths. We show that the                              𝑤 ⇒𝑀 𝑤 ′ .
resulting tree is a derivation tree of the grammar 𝐺𝑀 .
Suppose that the first two maximum nonterminal paths                   Proof. Suppose that the first two maximum nonterminal
of the tree 𝑇, together with terminals they generate, are              paths of the tree 𝑇, together with the generated terminals,
drawn in Figure 4a. Note that the vertex denoted by the                look like paths depicted in Figure 4a.
nonterminal 𝑆𝑖 is the branching point of these nontermi-                  (i). It follows from (2) that the vertices 𝑆𝑖′𝜆 , …, 𝑆𝑗′𝜆 , …,
nal paths. We construct the tree 𝑅(𝑇 ) in the following                 ′𝜆
                                                                       𝑆𝑚2 in Figure 4b are nonterminals of the grammar 𝐺𝑀 .
way:                                                                      The following rules are used in the first maximal non-
    1. Remove the terminal vertices 𝑎𝑖 , …, 𝑎𝑚1 and 𝑎𝑗′ , …,           terminal path of the tree 𝑇 for some 𝜔 ∈ 𝑉 ∪ {𝜆}:
        ′ .
       𝑎𝑚2                                                                                          𝑆𝑖−1 → 𝑎𝑖−1 𝑆𝑖 𝜔
    2. Remove the vertices 𝑆𝑖 , … , 𝑆𝑚1 .                                                              𝑆𝑖 → 𝑎𝑖 𝑆𝑖+1 𝑆𝑖′
    3. For each 𝑙 ∈ {𝑖, … , 𝑚2 } do the following:
                                                                       According to (3), 𝑆𝑖 = 𝑆𝑖′𝜆 . So
           a) replace the vertex 𝑆𝑙′ with the vertex 𝑆𝑙′𝜆
           b) add the terminal vertex 𝑎𝑙′ , where 𝑎𝑙′ is the                    𝑆𝑖−1 → 𝑎𝑖−1 𝑆𝑖′𝜆 𝜔
              symbol in the first component of the non-
              terminal 𝑆𝑙′𝜆                                  is a rule of grammar 𝐺𝑀 . The following rules are used in
                                                             the second maximal nonterminal path of the tree 𝑇 for
           c) add an edge from 𝑆𝑙′𝜆 to 𝑎𝑙′                   some 𝜔𝑖+1 , …, 𝜔𝑚2 ∈ 𝑉 ∪ {𝜆}:
    4. Add a horizontal edge from the vertex 𝑆𝑖−1 to the
       vertex 𝑆𝑙′𝜆 .                                                                                  𝑆𝑖′ → 𝑆𝑖+1
                                                                                                             ′ 𝜔
                                                                                                                 𝑖+1
                                                                                                          ⋮
Function 𝑅 converts the first two maximum nonterminal                                                ′ → 𝑆′ 𝜔
                                                                                                    𝑆𝑗−1     𝑗 𝑗
paths of the tree 𝑇 (including the terminals they generate)
in Figure 4a to the path drawn in Figure 4b.                                                          𝑆𝑗′ → 𝑎𝑗′ 𝑆𝑗+1
                                                                                                                 ′ 𝜔
                                                                                                                     𝑗+1

Lemma 3.1. For every derivation tree 𝑇 of the grammar                                                     ⋮
𝐺𝑀 that lies in the domain of the function 𝑅 (contains                                       𝑆𝑚′ 2 −1 → 𝑎𝑚′     𝑆′ 𝜔
                                                                                                            2 −1 𝑚2 𝑚2
at least two maximal nonterminal paths), the following
                                                                                                 𝑆𝑚′ 2 → 𝑎𝑚
                                                                                                          ′
statements hold:                                                                                            2
According to (4), (5) and (6) the following rules                                Proof. If «𝑤 ′ » ⇒𝑀 «𝑤», then for some 𝑢, 𝑣, and 𝑛 the
                                                                                 word 𝑢𝑣 is a prefix of 𝑤 ′ », 𝛿 ∗ (𝑠, 𝑢𝑣) = RED (𝑛), and |𝑣| = |𝑛|.
                            𝑆𝑖′𝜆 → 𝑎𝑖′ 𝑆𝑖+1
                                        ′𝜆 𝜔
                                            𝑖+1                                  Let 𝑆0 → 𝑆1 → … → 𝑆𝑚 be the first maximal nonterminal
                                   ⋮                                             path of the derivation tree 𝑇 and the first components of
                           ′𝜆 → 𝑎 ′ 𝑆 ′𝜆 𝜔                                       nonterminals on this path contain the terminal symbols
                          𝑆𝑗−1   𝑗−1 𝑗    𝑗                                      «, 𝑎0 , 𝑎1 , …, 𝑎𝑚 . Since the automaton 𝑀 is monotone,
                            ′𝜆    ′ ′𝜆
                           𝑆𝑗 → 𝑎𝑗 𝑆𝑗+1 𝜔𝑗+1                                     𝑚 ≥ |𝑢𝑣/𝑛| = 𝑙 and 𝑢𝑣/𝑛 = 𝑎1 … 𝑎𝑘 … 𝑎𝑙 , where 𝑘 = |𝑢|.
                                                                                    The tree 𝑇 ′ is obtained from the tree 𝑇 in the following
                                   ⋮
                                                                                 way:
                        𝑆𝑚′𝜆2 −1 → 𝑎𝑚
                                    ′     𝑆 ′𝜆 𝜔
                                      2 −1 𝑚2 𝑚2
                                                                                      1. Remove the terminal symbols 𝑎𝑘 , …, 𝑎𝑙 .
                           𝑆𝑚′𝜆2 → 𝑎𝑚
                                    ′
                                      2
                                                                                      2. Replace each nonterminal 𝑆𝑗 , 𝑘 ≤ 𝑗 ≤ 𝑙, on the
are rules of the grammar 𝐺𝑀 . The tree 𝑅(𝑇 ) is therefore                                first nonterminal path with the nonterminals 𝑆𝑗′ ,
the derivation tree of the grammar 𝐺𝑀 .                                                  which we get from 𝑆𝑗 by replacing the empty word
   (ii). The construction of the tree 𝑅(𝑇 ) from the tree 𝑇                              𝜆 in the third component with the word 𝑎𝑗 … 𝑎𝑙 .
involves the removal of the suffix of the first nonterminal
path just behind the branching point with the second                                  3. Add new nonterminals
nonterminal path. All other nonterminals are eventually                                              𝑆𝑘″ = (𝑎𝑘 , 𝑠𝑘 , 𝜆, 𝑜, 𝑣) = 𝑆𝑘 ,
replaced by other nonterminals, and all other edges be-                                              ″ = (𝑏 , 𝑠 , 𝜆, RED (𝑛′ ), 𝑣 /𝑛 ),
tween nonterminals in the tree remain unchanged. Thus,                                              𝑆𝑘+𝑖   𝑖 𝑘+𝑖           𝑖     𝑖 𝑖
the derivation tree 𝑅(𝑇 ) contains one less maximal non-
                                                                                          where 𝑏𝑖 is the 𝑖-th symbol of the word 𝑣 from
terminal path than the derivation tree 𝑇.
                                                                                          the left, 𝑣𝑖′ is the prefix of the word 𝑣 of length 𝑖,
   (iii). If the tree 𝑇 gives the word 𝑤 and its first two
                                                                                          𝑣𝑖′ 𝑣𝑖 = 𝑣, 𝑛𝑖′ is the prefix of the reduction sequence
maximum nonterminal paths are shown in Figure 4a,
                                                                                          of length 𝑖, 𝑛𝑖′ 𝑛𝑖 = 𝑛 and 𝑠𝑘+𝑖 = 𝛿(𝑠𝑘 , 𝑣𝑖′ ) for all
then obviously
                                                                                          𝑖 ∈ {1, … , |𝑣|}.
                       𝑤 = 𝑎0 𝑎1 … 𝑎𝑖 𝑎𝑖+1 … 𝑎𝑚1 𝑥                                    4. Add a terminal 𝑎𝑘 and an edge from the 𝑆𝑘″ ter-
                                                                                         minal to this terminal. Similarly, we add the ter-
for some 𝑥. The nonterminal 𝑆𝑖 is the last common non-
                                                                                         minals 𝑏1 , …, 𝑏|𝑣| and connect them to the new
terminal of the first two nonterminal paths of the tree
                                                                                         derivation tree with an edge leading along the
𝑇 and is rewritten to 𝑎𝑖 𝑆𝑖+1 𝑆𝑖′ using the grammar 𝐺𝑀                                                   ″ , …, 𝑆 ″ .
                                                                                         line from the 𝑆𝑘+1       𝑘+|𝑣|
rule. The definition of the grammar 𝐺𝑀 rule for each
𝑙 ∈ {𝑖, … , 𝑚1 − 1} implies that 𝑠𝑙+1 = 𝑓 (𝑠𝑙 , 𝑎𝑙+1 ), where 𝑠𝑙 is                   5. We join the nonterminal 𝑆𝑘″ to the new derivation
the state contained in the second component of the non-                                  tree by an edge leading from the nonterminal 𝑆𝑘−1 .
terminal 𝑆𝑙 , 𝑠𝑚1 = RED (𝑛) for some reduction sequence                                  We use the grammar rules 𝑆𝑘−1 → 𝑎𝑘−1 𝑆𝑘″ 𝑆𝑘′ . Sim-
𝑛 is the state contained in the second component of the                                                                     ″ , …, 𝑆 ″
                                                                                         ilarly, we join the nonterminals 𝑆𝑘+1      𝑘+|𝑣| to
nonterminal 𝑆𝑚1 ,                                                                        the derivation tree using the rules 𝑆𝑘″ → 𝑎𝑘 𝑆𝑘+1
                                                                                                                                         ″ ,
                                                                                              ″                  ″
                                                                                         …, 𝑆𝑘+|𝑣|−1 → 𝑎𝑘+|𝑣|−1 𝑆𝑘+|𝑣| .
                             𝑎𝑖
                              ↑                                                  The above construction yields a derivation tree 𝑇 ′ which
                                                                                 gives the word «𝑤 ′ » and 𝑅(𝑇 ′ ) = 𝑇. We can check the
      (𝑎𝑖 , 𝑠𝑖 , 𝜆, 𝑜, 𝑣) = 𝑆𝑖         → 𝑆𝑖+1 = (𝑎𝑖+1 , 𝑠𝑖+1 , 𝜆, RED (1 ), 𝑢)
                                                                                 correctness of this construction using the definition of
                                                                                 nonterminals and the rules of the grammar 𝐺𝑀 .
                              ↓
  (𝑎𝑖 , 𝑠𝑖 , 𝑎𝑖 ⋅ 𝑢, 𝑜, 𝑣) = 𝑆𝑖′                                                 Lemma 3.3. Let 𝑝 be any natural number, 𝑇1 , …, 𝑇𝑝 be
                                                                                 any derivation trees of the grammar 𝐺𝑀 , and 𝑤1 , …, 𝑤𝑝
for some words 𝑢 and 𝑣 and some operation 𝑜, and 𝑢 =                             any words from « Σ∗ ». If at the same time
                    ′ … 𝑎 ′ . Together we get that 𝑤 ′ =
(𝑎𝑖+1 … 𝑎𝑚1 )/𝑛 = 𝑎𝑖+1    𝑗−1
𝑎0 𝑎1 … 𝑎𝑖 𝑢𝑥, and hence 𝑤 ⇒𝑀 𝑤 ′ .                                                    • 𝑇𝑖 = 𝑅(𝑇𝑖+1 ) for all 𝑖 ∈ {1, … , 𝑝 − 1},

Lemma 3.2. For any derivation tree 𝑇 of the grammar 𝐺𝑀          • 𝑇𝑖 gives the word 𝑤𝑖 for all 𝑖 ∈ {1, … , 𝑝},
and any words «𝑤» and «𝑤 ′ », such that 𝑇 gives the word        • 𝑇1 contains exactly one maximal nonterminal path,
«𝑤» and «𝑤 ′ » ⇒𝑀 «𝑤», there is a derivation tree 𝑇 ′ of
the grammar 𝐺𝑀 that gives the word «𝑤 ′ » and 𝑅(𝑇 ′ ) = 𝑇. then at the same time
                                                                                       • 𝑤𝑖+1 ⇒𝑀 𝑤𝑖 for all 𝑖 ∈ 1, … , 𝑝 − 1,
     • 𝑤1 is accepted by the 𝑀 automaton in a single stage. Lemma 3.4. Let 𝑝 be any natural number and 𝑤1 , …, 𝑤𝑝
                                                            any words from « Σ∗ ». If at the same time
Proof. We prove the theorem by induction on the nat-
ural number 𝑝. Obviously, 𝑇𝑖 contains just 𝑖 maximal            • 𝑤𝑖+1 ⇒𝑀 𝑤𝑖 for all 𝑖 ∈ {1, … , 𝑝 − 1},
nonterminal paths.
                                                                • 𝑤1 is accepted by the 𝑀 automaton in a single stage.
  𝑝 = 1. Let 𝑇1 be the derivation tree of the grammar
𝐺𝑀 , which gives the word 𝑤1 and contains exactly one then for some derivation trees 𝑇1 , …, 𝑇𝑝 of the grammar
maximal nonterminal path                                    𝐺𝑀

       𝑆0     → 𝑆1   →⋯      → 𝑆𝑖−1   → 𝑆𝑖    →⋯     → 𝑆𝑚 .               • 𝑇𝑖 = 𝑅(𝑇𝑖+1 ) for all 𝑖 ∈ {1, … , 𝑝 − 1},
                                                                 • 𝑇𝑖 gives the word 𝑤𝑖 for all 𝑖 ∈ {1, … , 𝑝}.
This means that no rule with two nonterminals on the
right-hand side is used in this tree. Thus, only the fol- Proof. We prove the theorem by induction on the natural
lowing rules are used:                                       number 𝑝.
                                                               𝑝 = 1. Suppose that the word 𝑤 is accepted by the
                   𝑆𝑖−1 → 𝑎𝑖 | 𝑎𝑖 𝑆𝑖 | 𝑆𝑖 | 𝜆                automaton 𝑀 in one stage. So for some 𝑠0 , …, 𝑠𝑚 (𝑠0 is
The word in the third component of the nonterminal 𝑆𝑖 the initial state of 𝑀, 𝑠𝑚 = ACC ) and 𝑎0 , 𝑎1 , …, 𝑎𝑚 (𝑎0 = «,
is empty (for all 𝑖 ∈ {1, … , 𝑚}) because it is empty in the 𝑎𝑚 = »), and for each 𝑖 ∈ {1, … , 𝑚}
initial nonterminal 𝑆0 and for all rules 𝑆𝑖−1 → 𝑎𝑖 𝑆𝑖 | 𝑆𝑖 ,                                  𝛿(𝑠𝑖−1 , 𝑎𝑖 ) = 𝑠𝑖 .
the word in the third component of the nonterminal 𝑆𝑖
is not longer than the word in the third component of                It follows that
the nonterminal 𝑆𝑖−1 . Thus, only the following rules are
used in the tree 𝑇1 :                                                                      𝑆𝑖 = (𝑎𝑖 , 𝑠𝑖 , 𝜆, ACC , 𝜆)

                        𝑆𝑖−1 → 𝑎𝑖 | 𝑎𝑖 𝑆𝑖                            is the nonterminal of the grammar 𝐺𝑀 , 𝑆0 is the initial
                                                                     nonterminal of the grammar 𝐺𝑀 and
and the calculation tree 𝑇1 for the word 𝑤1 looks like this:
                                                                                 𝑆𝑖−1 → 𝑎𝑖−1 𝑆𝑖                          𝑆𝑚 → 𝑎𝑚
        «      𝑎1      …      𝑎𝑖−1      𝑎𝑖     …      𝑎𝑚
         ↑       ↑              ↑        ↑              ↑            are the rules of the grammar 𝐺𝑀 . So we can construct
                                                                     the following derivation tree of the grammar 𝐺𝑀 , which
        𝑆0    → 𝑆1    →⋯     → 𝑆𝑖−1    → 𝑆𝑖   →⋯     → 𝑆𝑚            gives the word 𝑤 = «𝑎1 … 𝑎𝑚−1 »:

It follows immediately from the definition of the grammar                   «      𝑎1        …       𝑎𝑖−1        𝑎𝑖         …       »
                                                                             ↑         ↑                ↑            ↑               ↑
𝐺𝑀 rules that 𝑓 (𝑠𝑖−1 , 𝑎𝑖 ) = 𝑠𝑖 for all 𝑖 ∈ {1, … , 𝑚}, where 𝑠0
is the state in the second component of the initial nonter-                       → 𝑆1     →⋯       → 𝑆𝑖−1      → 𝑆𝑖       →⋯      → 𝑆𝑚
                                                                            𝑆0
minal 𝑆0 and also the initial state of the automaton 𝑀, 𝑠𝑖
is the state in the second component of the nonterminal
                                                                       Induction step. Assume that the theorem holds for
𝑆𝑖 .
                                                                     𝑝 = 𝑞 ≥ 1. We show that it is also true for 𝑝 = 𝑞 + 1. Let
     Because the fourth components of the nonterminals
                                                                     𝑤 be an arbitrary word of the language « 𝐿(𝑀) » that the
𝑆𝑖−1 and 𝑆𝑖 are the same for all 𝑖 ∈ {1, … , 𝑚} and there is
                                                                     automaton 𝑀 accepts in exactly 𝑞 stages. Let further 𝑤 ′
ACC in the fourth component of the initial nonterminal
                                                                     be any word such that 𝑤 ′ ⇒𝑀 𝑤. Thus, by the induction
𝑆0 , the operation ACC is contained in all nonterminals of
                                                                     assumption, some derivation tree 𝑇 of the grammar 𝐺𝑀
the tree 𝑇1 .
                                                                     yields the word 𝑤. Then, according to 3.2, there is a
     The automaton 𝑀 only accepts after moving over the
                                                                     derivation tree 𝑇 ′ which gives the word 𝑤 ′ and 𝑇 =
right delimiter », so 𝑎𝑚 = ». We get 𝑎0 𝑎1 … 𝑎𝑚−1 𝑎𝑚 = 𝑤1
                                                                     𝑅(𝑇 ′ ). So the theorem holds for the word 𝑤 ′ , which the
and
                                                                     automaton 𝑀 accepts in 𝑞 + 1 stages.
                 𝛿 ∗ (𝑠0 , 𝑎1 … 𝑎𝑚−1 𝑎𝑚 ) = ACC .
   Induction step. Suppose the theorem holds for 𝑝 = 𝑞 ≥              The following theorem is a direct consequence of lem-
1. We show that it is also true for 𝑝 = 𝑞 + 1. Let 𝑇 be any          mata 3.3 and 3.4.
derivation tree of the grammar 𝐺𝑀 with 𝑞 + 1 maximal
                                                                     Theorem 3.1. 𝐿(𝐺𝑀 ) = «𝐿(𝑀)».
nonterminal paths, which yields the word 𝑤. Then, by
3.1, 𝑇 ′ = 𝑅(𝑇 ) is a derivation tree of the grammar 𝐺𝑀                 The next theorem says that the grammar 𝐺𝑀 of the
with 𝑞 maximal nonterminal paths, and if it gives the                monotone reducing automaton 𝑀 is suitable for con-
word 𝑤 ′ , then 𝑤 ⇒𝑀 𝑤 ′ . For the tree 𝑇 ′ the theorem by           structing a classical syntactic parser for the language
induction holds, so it also holds for the tree 𝑇.                    recognized by this automaton.
                                                                     We define the characteristic of the item 𝑝 = 𝑋 → 𝛼 . 𝛽 as
Theorem 3.2. 𝐺𝑀 is an LR(0) grammar.
                                                                     follows:
Proof. First, let us recall some classical notions intro-
duced in the context of LR(0) grammars, as given, for [𝑝] = ([𝑋 ], 𝛼, [𝛽])
example, in [4].
                                                               The characteristic of the set of items 𝐼 of the grammar 𝐺𝑀
   We call any expression of type
                                                               is defined as the following set:
                         𝑋 → 𝛼 . 𝛽,
                                                                [𝐼 ] = {[𝑝] ∣ 𝑝 ∈ 𝐼 }
an item of the context-free grammar 𝐺, where 𝑋 → 𝛼𝛽 is
a rule of the grammar 𝐺. In particular, 𝑋 → . is an item if        Checking all types of rules of the grammar 𝐺𝑀 , we
𝑋 → 𝜆 is a rule of grammar 𝐺. For each rule 𝑋 → 𝛾, we get that the first two components of item’s characteristic
call 𝑋 → 𝛾 . a complete item. We say that 𝐴 → 𝛼 . 𝛽 is a uniquely determine its third component. More precisely,
valid item for the string 𝜔 if there exists a right sentential if (𝑐1,1 , 𝛼1 , 𝑐1,2 ) and (𝑐2,1 , 𝛼2 , 𝑐2,2 ) are characteristics of two
form 𝜉 𝐴𝑢 (𝑢 is a string of terminals) such that 𝜉 𝛼 = 𝜔. items of the grammar 𝐺𝑀 , then
We use the notation 𝐼 (𝜔) for the set of all valid items for
the string 𝜔 .                                                       (𝑐1,1 = 𝑐2,1 and 𝛼1 = 𝛼2 )                   ⟹      𝑐1,2 = 𝑐2,2
   A context-free grammar 𝐺 = (𝑉 , 𝑁 , 𝑆, 𝑃) is called an
LR(0) grammar if it satisfies the following conditions:            A set of items 𝐼 has a continuous characteristic if
     1. The initial symbol 𝑆 does not appear on the right-                    [𝐼 ] = {(𝑐0 , 𝛼, 𝑐1 ), (𝑐1 , 𝜆, 𝑐2 ), … , (𝑐𝑛−1 , 𝜆, 𝑐𝑛 )}
        hand side of any rule.
                                                                     for some 𝑐0 , 𝑐1 , 𝑐2 , …, 𝑐𝑛−1 , 𝑐𝑛 and 𝛼.
     2. No reduce/reduce conflict. For any string 𝛾 ∈
                                                                        By verifying all types of rules of the grammar 𝐺𝑀 ,
        (𝑉 ∪ 𝑁 )∗ , there is at most one complete item in
                                                                     we can prove the validity of the following statements,
        the set 𝐼 (𝛾 ).
                                                                     which say that construction of the states of the item
     3. No shift/reduce conflict. If a complete item occurs          automaton creates only sets of items with a continuous
        in 𝐼 (𝛾 ), then there is no item with a terminal to          characteristic.
        the right of the dot in 𝐼 (𝛾 ).
                                                                          1. 𝐼 (𝜆) has a continuous characteristic.
The first condition is satisfied for grammar 𝐺𝑀 , because
the initial nonterminal 𝑆 does not occur on the right-hand                2. If 𝐼 has a continuous characteristic, 𝑝 = 𝐴 →
side of any rule of 𝐺𝑀 .                                                     𝛼 . 𝐵𝛽 ∈ 𝐼 and 𝐼𝑝 = {𝐵 → . 𝛾 ∣ 𝐵 → 𝛾 ∈ 𝐺𝑀 }, then
   We prove the remaining two conditions using a method                      𝐼 ∪ 𝐼𝑝 has a continuous characteristic.
taken from [4] which is based on the notion of character-                 3. If 𝐼 has a continuous characteristic, then for each
istic. We define the characteristic sequentially, first for                  symbol 𝑥 of the grammar 𝐺𝑀 , the set 𝐼𝑥 = {𝐴 →
terminal and nonterminal symbols, then for strings of ter-                   𝛼𝑥 . 𝛽 ∣ 𝐴 → 𝛼 . 𝑥𝛽 ∈ 𝐼 } also has a continuous
minal and nonterminal symbols, then for item sets of the                     characteristic.
grammar, and finally for item sets obtained by construct-
ing the states of the item automaton of the grammar 𝐺𝑀 .             Thus, any state 𝐼 (𝛾 ) has a continuous characteristic. The
We show that each item set that is a state of the item               definition of a set of items with continuous characteris-
automaton has a continuous characteristic, which results             tic implies that such a set can contain at most one triple
in the absence of both types of conflicts.                           whose third component is empty or terminal . Each com-
   For each terminal or nonterminal symbol 𝑥 of grammar              plete item 𝐴 → 𝛼 . has characteristic ([𝐴], 𝛼, empty ), and
𝐺𝑀 , we define the characteristic [𝑥] of the symbol 𝑥 as             each item 𝐴 → 𝛼 . 𝛽 containing a terminal to the right
follows:                                                             of the dot has characteristic ([𝐴], 𝛼, terminal ). There-
        initial ,    if 𝑥 = 𝑆,                                       fore, any set of items with a continuous characteristic
      ⎧
      ⎪terminal ,                                                    can contain at most one complete item, and if it does, it
[𝑥] =                if 𝑥 ∈ 𝑉
      ⎨(𝑎, 𝑠, 𝑢),                                                    no longer contains any item with a terminal to the right
      ⎪              if 𝑥 = (𝑎, 𝑠, 𝑢, 𝑜, 𝑣) ∈ 𝑁 ⧵ {𝑆}                of the dot.
      ⎩              for some 𝑜 and 𝑣.                                  Thus, each set 𝐼 (𝛾 ) satisfies the second and third con-
The characteristic of the strings 𝛼 formed by terminals              ditions of the definition of LR(0) grammar and 𝐺𝑀 is an
and nonterminals of the grammar 𝐺𝑀 is defined by the                 LR(0) grammar.
following rule:
         empty ,   if 𝛼 = 𝜆,
[𝛼] = {[𝑥],        if 𝑥 ∈ 𝑉 ∪ 𝑁 and 𝛼 = 𝑥𝛼 ′
                   for some 𝛼 ′ .
4. Normalizations                                            the first nonterminal path of any of grammar’s derivation
                                                             tree uses rules of types listed in the first row only, i.e. the
Normalized mon-red-automata form a subclass of reduc- rules with the right hand side starting with the terminal.
ing automata. A normalized mon-red-automaton is any Thus, the last rule used in the first maximal nonterminal
mon-red-automaton that is both prefix-correct and state path of any derivation tree of grammar 𝐺𝑀 is of type 𝑋 →
minimal. In this section, we show how to construct an 𝑎, and the first complete item encountered by the item
equivalent normalized mon-red-automaton to any mon- automaton is of type 𝑋 → 𝑎 .. Therefore, we construct
red-automaton that accepts a non-empty language.             the states of the reducing automaton 𝑀 ′ only from the
   We use a grammar of mon-red-automaton constructed following types of items
in the previous section to construct an equivalent prefix-
correct mon-red-automaton. Then we show how to min-                    𝑋 → .𝑎𝑌𝑍          𝑋 → .𝑎𝑌            𝑋 → .𝑎
imize the set of states of any red-automaton including                 𝑋 → 𝑎.𝑌𝑍          𝑋 → 𝑎.𝑌            𝑋 → 𝑎.
the elimination of unreachable states.
                                                             and we consider the transitions between item sets over
                                                             terminal symbols only.
4.1. Prefix-correctness                                         Now let us describe a construction of the prefix-correct
Suppose that 𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ) is a monotone re- automaton using just mentioned principles. For any 𝑢 ∈
ducing automaton. We say that 𝑀 is prefix-correct if for Σ∗ and 𝑎 ∈ Σ ∪ {«, »} we define the item sets using the
any word 𝑢 ∈ Σ∗ and reduction sequence 𝑛 the following following rules
implications hold:
                                                                 𝐼 (𝜆) = {𝑆 → . « 𝛾 ∣ 𝑆 → « 𝛾 ∈ 𝑃},
             𝛿 ∗ (𝑠0 , 𝑢) ∈ 𝑆 ⟹ ∃𝑣 ∈ Σ∗ ∶ 𝑢𝑣 ∈ 𝐿(𝑀)            𝐼 (𝑢𝑎) = {𝑋 → 𝑎 . 𝛼 ∣ 𝑋 → . 𝑎 𝛼 ∈ 𝐼 (𝑢)}
       𝛿 ∗ (𝑠0 , 𝑢) = RED (𝑛) ⟹ ∃𝑣 ∈ Σ∗ ∶ 𝑢𝑣 ∈ 𝐿(𝑀)                       ∪ {𝑌 → . 𝑏 𝛽 ∣ 𝑋 → . 𝑎 𝑌 𝛾 ∈ 𝐼 (𝑢) a 𝑌 → 𝑏 𝛽 ∈ 𝑃},
       ∗
      𝛿 (𝑠0 , 𝑢») = RED (𝑛) ⟹ 𝑢 ∈ 𝐿(𝑀)
                                                         where 𝑆 is the starting nonterminal of the grammar 𝐺𝑀 .
                                                         We obtain the above item sets utilizing slightly modified
The first two implications say that any word 𝑢 ∈ Σ∗ , over
which the automaton 𝑀 moves its working head while       method already used in classical theory of parsing to
                                                         construct an item automaton for a given LR(0) grammar.
switching its control unit from the initial state 𝑠0 to any
                                                         Here, we consider only items of the above types. Thus,
transition or reducing state, is a prefix of some word from
the language 𝐿(𝑀). The third implication states that any our item sets are subsets of states of the classical item
word 𝑢 » ∈ Σ∗ », over which the automaton 𝑀 moves        automaton, and hence contain neither shift/reduce nor re-
its working head while switching its control unit from   duce/reduce conflict. A set of all nonterminal states of the
                                                         reducing automaton 𝑀 ′ consists of all constructed item
the initial state 𝑠0 to any reducing state, is a word of the
language 𝐿(𝑀)». However, the automaton may need a        sets except 𝐼 (𝜆) and sets containing only a complete item.
few more stages to formally accept it.                   Its initial state is the set 𝐼 («). The reducing states are just
                                                         all RED (𝑛) for which there exists a complete item 𝑋 → 𝑎 .
Theorem 4.1. For each mon-red-automaton, an equiva- of the grammar 𝐺(𝑀 ′ ) and 𝑋 = (𝑎, RED (𝑛), 𝜆, RED (𝑛), 𝜆).
lent prefix-correct mon-red-automata can be constructed. We define the transition function as follows:

Proof. Suppose that 𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ) is a mon-                             RED (𝑛), if 𝐼 (𝑢𝑎) = {𝑋 → 𝑎 .} and
                                                                                      ⎧
red-automaton. First, we construct an LR(0) grammar                                   ⎪          𝑋 = (𝑎, RED (𝑛), 𝜆, RED (𝑛), 𝜆)
𝐺𝑀 . Next, we construct a reducing automaton 𝑀 ′ =                                    ⎪          for some 𝑛,
(Σ, «, », 𝑆 ′ , 𝑠0′ , 𝐹 ′ , 𝑓 ′ ) based on the grammar 𝐺𝑀 . Finally,                  ⎪
                                                                                      ⎪ACC ,     if 𝑎 = » and
we show that 𝑀 ′ is monotone, prefix-correct, and equiv-
                                                                     𝑓 ′ (𝐼 (𝑢), 𝑎) =            𝐼 (𝑢») = {𝑋 → » .} and
alent to the original reducing automaton 𝑀.                                           ⎨
   Construction of the reducing automaton 𝑀 resembles  ′                              ⎪          𝑋 = (», ACC , 𝜆, ACC , 𝜆),
                                                                                      ⎪
a construction of an item automaton for the grammar                                   ⎪ERR ,     if 𝐼 (𝑢𝑎) = ∅,
𝐺𝑀 . Transition states of the automaton 𝑀 ′ are defined                               ⎪
as sets of items of this grammar, in the same way as in                               ⎩𝐼 (𝑢𝑎),   otherwise.
case of an item automaton. However, we are interested
in those grammar rules only that can be used in the first                Equivalence of automata 𝑀 ′ and 𝑀. Apparently,
nonterminal path of 𝐺𝑀 ’s derivation tree. While the                 «𝐿(𝑀)»      = 𝐿(𝐺𝑀 ). We show that
grammar 𝐺𝑀 contains rules of the following types                                            𝐿(𝐺𝑀 ) = «𝐿(𝑀 ′ )».
         𝑋 → 𝑎𝑌𝑍            𝑋 → 𝑎𝑌            𝑋→𝑎                Suppose that «𝑤» ∈ 𝐿(𝐺𝑀 ) and 𝑇 is the derivation tree
         𝑋 → 𝑌𝑍             𝑋→𝑌               𝑋→𝜆                of the grammar 𝐺𝑀 , which gives «𝑤». By induction on
the number 𝑝 of maximal nonterminal paths in the tree            the left column from the sets 𝐼 («), …, 𝐼 («𝑢𝑣) and then use
𝑇, we prove that 𝑤 ∈ 𝐿(𝑀 ′ ).                                    their rules in the derivation listed in the right column:
   If the tree 𝑇 contains only one maximal nonter-
minal path, then 𝐼 («𝑤») = {𝑋 → » . } and 𝑋 =                           𝑆 ⇒ « . 𝑆1 𝛾1        𝑆 ⇒ « 𝑆1 𝛾 1
(», ACC , 𝜆, ACC , 𝜆), so 𝛿 ′∗ (𝐼 («), 𝑤») = ACC and thus 𝑤 ∈         𝑆1 → 𝑎1 . 𝑆2 𝛾2          ⇒ « 𝑎1 𝑆2 𝛾2 𝛾1
𝐿(𝑀 ′ ).                                                                   ⋮                    ⋮
   Assume that the statement holds for all derivation trees
of the grammar 𝐺𝑀 with at most 𝑝 maximal nonterminal                𝑆𝑘−1 → 𝑎𝑘−1 . 𝑆𝑘 𝛾𝑘        ⇒ « 𝑎1 … 𝑎𝑘−1 𝑆𝑘 𝛾𝑘 … 𝛾2 𝛾1
paths. We then prove that it also holds for derivation                 𝑆𝑘 → 𝑎𝑘 . 𝑆𝑘+1 𝑋0       ⇒ « 𝑎1 … 𝑎𝑘−1 𝑎𝑘 𝑆𝑘+1 𝑋0 𝛾
trees with 𝑝 + 1 maximal nonterminal paths. If 𝑇 is a               𝑆𝑘+1 → 𝑎𝑘+1 . 𝑆𝑘+2         ⇒ « 𝑢 𝑎𝑘+1 𝑆𝑘+2 𝑋0 𝛾
derivation tree with 𝑝 + 1 maximal nonterminal paths,
                                                                           ⋮                    ⋮
which gives the word «𝑤», then by Lemma 3.3, 𝑅(𝑇 ) is
a tree with 𝑝 maximal nonterminal paths that yields the                𝑆𝑙 → 𝑎 𝑙 .              ⇒ « 𝑢 𝑎𝑘+1 … 𝑎𝑙−1 𝑎𝑙 𝑋0 𝛾 ,
word «𝑤 ′ » such that «𝑤» ⇒𝑀 «𝑤 ′ ». By the induction
assumption 𝑤 ′ ∈ 𝐿(𝑀 ′ ). Thus, 𝛿 ∗ (𝑠0 , 𝑢𝑎) = RED (𝑛) for      where 𝛾 = 𝛾𝑘 … 𝛾1 , 𝑢 = 𝑎1 … 𝑎𝑘 , and 𝑣 = 𝑎𝑘+1 … 𝑎𝑙 . Since
some prefix «𝑢𝑎 of «𝑤», where 𝐼 («𝑢𝑎) = {𝑋 → 𝑎 . } and           𝐺𝑀 is reduced, some terminal string can be derived from
𝑋 = (𝑎, RED (𝑛), 𝜆, RED (𝑛), 𝜆). So 𝛿 ′∗ (𝐼 («𝑢), 𝑎) = RED (𝑛)   𝑋0 . Suppose that 𝑣/𝑛 = 𝑏1 … 𝑏𝑚 . From the definition of
and «𝑤» ⇒𝑀 ′ «𝑤 ′ ». Therefore 𝑤 ∈ 𝐿(𝑀 ′ ).                      the grammar 𝐺𝑀 , it follows that some of its nonterminals
   We can prove the reverse inclusion («𝐿(𝑀 ′ )» ⊆ 𝐿(𝐺𝑀 ))       𝑋1 , 𝑋2 , …, 𝑋𝑚 contain the symbols 𝑏1 , 𝑏2 , …, 𝑏𝑚 in their
in a similar way by induction on the number of stages            first component, and the grammar contains the following
of the reduction analysis of «𝑤» by the automaton 𝑀 ′            rules
using Lemma 3.2.
                                                                           𝑆𝑘 → 𝑎𝑘 𝑆𝑘+1 𝑋0            𝑆𝑘 = 𝑋0𝜆
   Prefix correctness. Suppose first that the automaton
𝑀 ′ moves from the initial state 𝐼 («) over the prefix 𝑢 ∈ Σ∗             𝑋0 → 𝑋1 𝛾1′                𝑋0𝜆 → 𝑎𝑘 𝑋1𝜆 𝛾1′
of the word 𝑤» to the state 𝐼 («𝑢), formally                              𝑋1 → 𝑋2 𝛾2′                𝑋1𝜆 → 𝑏1 𝑋2𝜆 𝛾2′
                      𝛿 ′∗ (𝐼 («), 𝑢) = 𝐼 («𝑢).                           𝑋2 → 𝑋3 𝛾3′                𝑋2𝜆 → 𝑏2 𝑋3𝜆 𝛾3′
                                                                                ⋮                           ⋮
This means that the items listed below in the left column
can be selected from the sets 𝐼 («), …, 𝐼 («𝑢) and used in                𝑋𝑚   → 𝛾′                  𝑋𝑚𝜆 → 𝑏𝑚 𝛾 ′
the derivation by 𝐺𝑀 listed in the right column
                                                                 The equality of the nonterminals 𝑆𝑘 and 𝑋0𝜆 together with
   𝑆 ⇒ « . 𝑆1 𝛾1           𝑆 ⇒ « 𝑆1 𝛾 1                          the existence of the rules in the second column follows
  𝑆1 → 𝑎 1 . 𝑆2 𝛾 2           ⇒ « 𝑎1 𝑆2 𝛾2 𝛾1                    from the application of implications (3), (4), and (5). These
       ⋮                       ⋮                                 rules allow the automaton 𝑀 ′ to move from the state
                                                                 𝐼 («𝑢) over the word 𝑣/𝑛 = 𝑏1 𝑏2 … 𝑏𝑚 to the transition
  𝑆𝑘 → 𝑎𝑘 . 𝑆𝑘+1 𝛾𝑘+1         ⇒ « 𝑎1 … 𝑎𝑘 𝑆𝑘+1 𝛾𝑘+1 𝛾𝑘 … 𝛾2 𝛾1   state 𝐼 («𝑢𝑣/𝑛) or to ACC or to some reducing state. Thus
                                                                 the reducing automaton 𝑀 ′ is monotone.
where 𝑎1 … 𝑎𝑘 = 𝑢. Since the grammar 𝐺𝑀 is reduced, all
nonterminals in 𝑆𝑘+1 𝛾𝑘+1 𝛾 can be rewritten into some
terminal strings, so that 𝑢 is a prefix of some word from        4.2. State minimality
𝐿(𝑀).
                                                                 We show how to minimize the set of transition, final, and
   Another possibility is that 𝛿 ′∗ (𝐼 («), 𝑢) = RED (𝑛) for
                                                                 reducing states of a reducing automaton while preserving
some reduction sequence 𝑛. Then 𝐼 («𝑢) = {𝑋 → 𝑎𝑘 .},
                                                                 reduction analysis and, moreover, visited working list
where 𝑋 = (𝑎𝑘 , RED (𝑛), 𝜆, RED (𝑛), 𝜆). Just replace the last
                                                                 positions in each stage.
item 𝑆𝑘 → 𝑎𝑘 . 𝑆𝑘+1 𝛾𝑘+1 in the list above with the com-
                                                                     Suppose 𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ) is any reducing au-
plete item 𝑆𝑘 → 𝑎𝑘 . to get the derivation 𝑆𝑘 ⇒∗𝐺𝑀 « 𝑢 𝛾. If
                                                                 tomaton and 𝑠 ∈ 𝑆 ∪ 𝐹. We call the state 𝑠 reachable if
𝑎𝑘 ≠ », then 𝑢 is a prefix of some word of the language
                                                                 𝛿 ∗ (𝑠0 , 𝑤) = 𝑠 for some 𝑤 ∈ Σ∗ ⋅ {𝜆, »}. A state that is not
𝐿(𝑀). If 𝑎𝑘 = », then 𝛾 ⇒∗𝐺𝑀 𝜆 (otherwise the grammar
                                                                 reachable is called unreachable.
𝐺𝑀 would generate a word outside «Σ∗ ») and 𝑢 ∈ 𝐿(𝑀)».
   Monotony. Suppose the automaton 𝑀 ′ moves from                Theorem 4.2. An equvalent reducing automaton with
the state 𝐼 («) via the prefix 𝑢𝑣 of the word 𝑤» to the state    only reachable states can be constructed to any reducing
RED (𝑛) ∈ 𝐹 ′ , |𝑣| = |𝑛|, and that the word 𝑢 consists of       automaton.
the symbols 𝑎1 , …, 𝑎𝑘 ∈ Σ and the word 𝑣 consists of the
symbols 𝑎𝑘+1 , …, 𝑎𝑙−1 ∈ Σ, 𝑎𝑙 ∈ Σ ∪ {»}. As in the proof of     Proof. A reducing automaton can be viewed as a finite-
prefix correctness, we can now select the items listed in        state machine with an alphabet Σ ∪ {»}, a set of states
𝑆 ∪ 𝐹 ∪ {RED }, an initial state 𝑠0 , a set of final states 𝐹, and a   output function, marks in the input word whether and
transition function 𝛿. Thus, we can use the construction               where to accept, reject or reduce it (and how to reduce it)
of a finite-state machine containing only reachable states             in the same way as the reducing automaton 𝑀. We then
from the theory of finite-state machines.                              construct its reduct 𝐴′ (with the same behaviour as 𝐴)
                                                                       and finally convert it back to the reducing automaton 𝑀.
   Let 𝑀1            =       (Σ, «, », 𝑆1 , 𝑠1 , 𝐹1 , 𝑓1 ) and 𝑀2 =       The Moore machine 𝐴 = (𝑆𝐴 , 𝑠𝐴 , Σ𝐴 , Γ𝐴 , 𝛿𝐴 , 𝜇𝐴 ),
(Σ, «, », 𝑆2 , 𝑠2 , 𝐹2 , 𝑓2 ) be reducing automata with                where 𝑆𝐴 is a set of states, 𝑠𝐴 is an initial state, Σ𝐴 is
the same input alphabet Σ. We define the relation                      an input alphabet, Γ𝐴 is an output alphabet, 𝛿𝐴 is a tran-
∼ ⊆ 𝑆1 × 𝑆2 in the following way: 𝑠 ∼ 𝑠 ′ , iff for each               sition function, and 𝜇𝐴 is an output function, is defined
word 𝑤 ∈ Σ∗ ⋅ {𝜆, »} from 𝛿1∗ (𝑠, 𝑤) ∈ 𝐹1 or 𝛿2∗ (𝑠 ′ , 𝑤) ∈ 𝐹2 it     in the following way:
follows that 𝛿1∗ (𝑠, 𝑤) = 𝛿2∗ (𝑠 ′ , 𝑤). If 𝑀1 = 𝑀2 , then the
relation ∼ is the equivalence on the set of transition                                    𝑆𝐴 = 𝑆 ∪ 𝐹 ∪ {RED }
states 𝑆1 of the automaton 𝑀1 . We call the transition                                     𝑠𝐴 = 𝑠 0
states, that are in the relation ∼, stage-equivalent. We
                                                                                          Σ𝐴 = Σ ∪ {»}
call reducing automata 𝑀1 and 𝑀2 stage-equivalent, if
their initial states are stage-equivalent (𝑠1 ∼ 𝑠2 ). Two                                 Γ𝐴 = 𝐹 ∪ {RED } ∪ {𝑠0 }
stage-equivalent reducing automata either both accept,                                    𝛿𝐴 = 𝛿
both reject, or both reduce any word, in all three cases
                                                                                                 𝑠 ,       if 𝑠 ∈ 𝑆
at the same place in the working list, and in the case of                              𝜇𝐴 (𝑠) = { 0
reduction also according to the same reduction sequence.                                         𝑠,        if 𝑠 ∈ 𝐹 ∪ {RED }
Thus, any two stage-equivalent reducing automata are                      For the Moore machine 𝐴 defined in this way, we con-
obviously strongly equivalent and vice versa. Since the                struct its reduct 𝐴′ = (𝑆𝐴′ , 𝑠𝐴′ , Σ𝐴′ , Γ𝐴′ , 𝛿𝐴′ , 𝜇𝐴′ ) by the
stage equivalence and strong equivalence name the same                 construction given, for example, in [5].
phenomenon, we henceforth use only the term strong                        Finally, we move from the reduct 𝐴′ back to the re-
equivalence.                                                           ducing automaton 𝑀 ′ = (Σ′ , «, », 𝑆 ′ , 𝑠0′ , 𝐹 ′ , 𝑓 ′ ) defined as
   Let 𝑀 be any reducing automaton. An automaton 𝑀 is                  follows:
state minimal if (i) all its states are reachable, and (ii) no
its different transition states are equivalent. A reducing                            Σ′ = Σ𝐴′ ⧵ {»} = Σ𝐴 ⧵ {»} = Σ
automaton 𝑀 ′ is called a reduct of a reducing automaton                              𝑆 ′ = {𝑠 ∣ 𝜇𝐴′ (𝑠) = 𝑠0 }
𝑀 if (i) 𝑀 ′ is strongly equivalent to 𝑀, and (ii) 𝑀 ′ is state
minimal.                                                                              𝑠0′ = 𝑠𝐴′ = 𝑠𝐴 = 𝑠0
                                                                                      𝐹 ′ = {𝑠 ′ ∈ 𝐹 ∣ ∃𝑠 ∈ 𝑆𝐴′ ∶ 𝑠 ′ = 𝜇𝐴′ (𝑠)}
Theorem 4.3. A reduct can be constructed to any reducing
automaton.                                                                                       𝑠′,       if 𝑠 ′ = 𝛿𝐴′ (𝑠, 𝑎) ∈ 𝑆 ′
                                                                                𝑓 ′ (𝑠, 𝑎) = {
                                                                                                 𝜇𝐴′ (𝑠 ), if 𝑠 ′ = 𝛿𝐴′ (𝑠, 𝑎) ∉ 𝑆 ′
                                                                                                       ′
Proof. Let 𝑀 = (Σ, «, », 𝑆, 𝑠0 , 𝐹 , 𝑓 ) be a reducing automa-
ton. We show how to construct its reduct 𝑀 ′ .                            Now it is not hard to see that the constructed reducing
   We convert the problem of constructing a reduct of                  automaton 𝑀 ′ is a reduct of the automaton 𝑀.
a reducing automaton to the problem of constructing                       The strong equivalence of the reducing automata 𝑀
a reduct of a Moore machine, which is already solved                   and 𝑀 ′ follows from the following facts: 1. The set 𝐹 ′
in the theory of finite-state machines and finite-state                contains exactly all reachable final or reducing states of
transducers.                                                           the reducing automaton 𝑀. 2. The initial state 𝑠𝐴′ of the
   We can look at a reducing automaton as a finite-state               Moore machine 𝐴′ is behaviorally equivalent to the initial
machine extended with the ability to reduce a working                  state 𝑠𝐴 of the Moore machine 𝐴. Thus, for any word
list. The reducing automaton 𝑀 computes a transition                   𝑤 ∈ Σ∗ ⋅ {𝜆, »} and the reachable reducing state RED (𝑛)
function over a word contained in a working list to de-                of the reducing automaton 𝑀, the following equivalence
termine whether and where to accept, reject, or reduce                 holds:
a word, and in the case of reduction, how to reduce it.
                                                                                              𝛿 ′∗ (𝑠0′ , 𝑤) = RED (𝑛) ⟺
Moore machine is also, in principle, a finite-state ma-
                                                                                            ∗
                                                                                      𝜇𝐴′ (𝛿𝐴′ (𝑠𝐴′ , 𝑤)) = RED (𝑛) ⟺
chine augmented by marking the input word. It also
computes a transition function over an input word and                                   𝜇𝐴 (𝛿𝐴∗ (𝑠𝐴 , 𝑤)) = RED (𝑛) ⟺
continuously marks the input word based on its value.                                          𝛿 ∗ (𝑠0 , 𝑤) = RED (𝑛)
The reduct 𝐴′ of Moore machine 𝐴 is a state-minimal
machine that marks any word in the same way as the                     The same equivalences holds if we replace the reducing
original machine 𝐴. So the idea is to design a Moore                   state RED (𝑛) with the accepting or rejecting states ACC or
machine 𝐴 for a reducing automaton 𝑀 which, by its                     ERR .
   The state-minimality of the automaton 𝑀 ′ follows di- References
rectly from the state-minimality of Moore machine 𝐴′
and from the way we obtained the automaton 𝑀 ′ from [1] M. Procházka, Redukční automaty a syntaktické chy-
the machine 𝐴′ . The reducing automaton 𝑀 ′ is therefore         by, Ph.D. thesis, Univerzita Karlova v Praze, fakulta
a reduct of the reducing automaton 𝑀.                            Matematicko-fyzikální, Praha, 2012.
                                                             [2] M. Procházka, M. Plátek, Redukční automaty, mono-
   We say that reducing automata 𝑀 and 𝑀 ′ with the              tonie a redukovanost, ITAT, 2002.
same alphabet Σ are isomorphic if there is a one to one [3] P. Jančar, F. Mráz, M. Plátek, J. Vogel, Restarting
mapping ℎ,                                                       automata, Lecture Notes in Computer Science 965
                    ℎ ∶ 𝑆 ∪ 𝐹 ⟶ 𝑆 ′ ∪ 𝐹 ′,                       (1995) 283–292.
                                                             [4] M. Chytil, Automaty a gramatiky, SNTL Praha,
such that
                                                                 Praha, 1984.
  (i) ℎ(𝑠) = 𝑠 for any 𝑠 ∈ 𝐹;                                [5] E. F. Moore, Gedanken-experiments on sequential
                                                                 machines, Annals of Mathematics studies 34 (1956)
 (ii) 𝛿(𝑠, 𝑎) = 𝑠 ′ ⟺ 𝛿 ′ (ℎ(𝑠), 𝑎) = ℎ(𝑠 ′ ) for any 𝑠 ∈ 𝑆,     129–153.
      𝑎 ∈ Σ ∪ {»}, and 𝑠 ′ ∈ 𝑆 ∪ 𝐹.

Theorem 4.4. Any two reducts of the same reducing au-
tomaton are isomorphic.

  This theorem can obviously be proved by modifying
the proof of the same theorem for Moore machines.

Corollary 4.1. No reducing automaton has fewer states
than its reduct.

   As the reduct is strongly equivalent to the reducing
automaton for which it is constructed, the construction
retains both monotony and prefix-correctness. So, for
any monotone reducing automaton we can construct an
equivalent monotone reducing automaton, which is both
prefix-correct and state-minimal.


5. Conclusion
We introduced the reducing automaton and described
two of its normalizations. Further, we plan to pro-
pose a construction of an equivalent mon-red-automaton
with only repeatable reductions. A reduction is repeat-
able if «𝑥𝑢𝑦𝑣𝑧» ⇒𝑀 𝑥𝑦𝑧 implies «𝑥𝑢 𝑛+1 𝑦𝑣 𝑛+1 𝑧» ⇒𝑀
«𝑥𝑢 𝑛 𝑦𝑣 𝑛 𝑧» for each 𝑛 ∈ ℕ0 . Further, we would like to
distinguish repeatable reductions reducing regular con-
texts (either 𝑢 or 𝑣 is empty) and linear contexts (neither
𝑢 nor 𝑣 is empty) and normalize a monotone reducing
automaton so that it reduces regular contexts from the
right.


Acknowledgments
This paper was created as part of the Parsing and Syn-
tactic Analysis seminar led by František Mráz and Martin
Plátek at the Faculty of Mathematics and Physics of Charles
University in Prague.