=Paper= {{Paper |id=Vol-1231/long17 |storemode=property |title=Structural complexity of multi-valued partial functions computed by nondeterministic pushdown automata |pdfUrl=https://ceur-ws.org/Vol-1231/long17.pdf |volume=Vol-1231 |dblpUrl=https://dblp.org/rec/conf/ictcs/Yamakami14 }} ==Structural complexity of multi-valued partial functions computed by nondeterministic pushdown automata== https://ceur-ws.org/Vol-1231/long17.pdf
    Structural complexity of multi-valued partial
      functions computed by nondeterministic
                pushdown automata
                           (Extended Abstract)

                               Tomoyuki Yamakami

              Department of Information Science, University of Fukui
                     3-9-1 Bunkyo, Fukui 910-8507, Japan



      Abstract. This paper continues a systematic and comprehensive study
      on the structural properties of CFL functions, which are in general multi-
      valued partial functions computed by one-way one-head nondeterministic
      pushdown automata equipped with write-only output tapes (or push-
      down transducers), where CFL refers to a relevance to context-free lan-
      guages. The CFL functions tend to behave quite differently from their
      corresponding context-free languages. We extensively discuss contain-
      ments, separations, and refinements among various classes of functions
      obtained from the CFL functions by applying Boolean operations, func-
      tional composition, many-one relativization, and Turing relativization.
      In particular, Turing relativization helps construct a hierarchy over the
      class of CFL functions. We also analyze the computational complexity
      of optimization functions, which are to find optimal values of CFL func-
      tions, and discuss their relationships to the associated languages.

      Keywords: multi-valued partial function, oracle, Boolean operation, re-
      finement, many-one relativization, Turing relativization, CFL hierarchy,
      optimization, pushdown automaton, context-free language


1   Much Ado about Functions

In a traditional field of formal languages and automata, we have dealt primarily
with languages because of their practical applications to, for example, a parsing
analysis of programming languages. Most fundamental languages listed in many
undergraduate textbooks are, unarguably, regular (or rational) languages and
context-free (or algebraic) languages.
    Opposed to the recognition of languages, translation of words, for exam-
ple, requires a mechanism of transforming input words to output words. Aho
et al. [1] studied machines that produce words on output tapes while reading
symbols on an input tape. Mappings on strings (or word relations) that are real-
ized by such machines are known as transductions. Since languages are regarded,
from an integrated viewpoint, as Boolean-valued (i.e., {0, 1}-valued) total func-
tions, it seems more essential to study the behaviors of those functions. This


                                       225
T.Yamakami. Structural complexity of multi-valued partial functions

  task is, however, quite challenging, because these functions often demand quite
  different concepts, technical tools, and proof arguments, compared to those for
  languages. When underlying machines are particularly nondeterministic, they
  may produce numerous distinct output values (including the case of no output
  values). Mappings realized by such machines become, in general, multi-valued
  partial functions transforming each admissible string to a certain finite (possibly
  empty) set of strings.
     Based on a model of polynomial-time nondeterministic Turing machine, com-
  putational complexity theory has long discussed the structural complexity of var-
  ious NP function classes, including NPMV, NPSV, and NPSVt (where MV and
  SV respectively stand for “multi-valued” and “single-valued” and the subscript
  “t” does for “total”). See, e.g., a survey [14].
      Within a scope of formal languages and automata, there is also rich litera-
  ture concerning the behaviors of nondeterministic finite automata equipped with
  write-only output tapes (known as rational transducers) and properties of as-
  sociated multi-valued partial functions (known also as rational transductions).
  Significant efforts have been made over the years to understand the functional-
  ity of such functions. A well-known field of functions include “CFL functions,”
  which were formally discussed in 1963 by Evey [4] and Fisher [6] and general
  properties have been since then discussed in, e.g., [3, 10]. CFL functions are gen-
  erally computed by one-way one-head nondeterministic pushdown automata (or
  npda’s, in short) equipped with write-only output tapes. For example, the function
  P ALsub (w) = {x ∈ {0, 1}∗ | ∃u, v [w = uxv], x = xR } for every w ∈ {0, 1}∗ is a
  CFL function, where xR is x in reverse. As subclasses of CFL functions, three
  fundamental function classes CFLMV, CFLSV, and CFLSVt were recognized
  explicitly in [19] and further explored in [20].
      In recent literature, fascinating structural properties of CFL functions have
  been extensively investigated. Konstantinidis et al. [10] took an algebraic ap-
  proach toward the characterization of CFL functions. In relation to cryptogra-
  phy, it was shown that there exists a pseudorandom generator in CFLSVt that
  “fools” every language in a non-uniform (or an advised) version of REG [19].
  Another function class CFLMV(2) in [20] contains pseudorandom generators
  against a non-uniform analogue of CFL. The behaviors of functions in those func-
  tion classes seem to look quite different from what we have known for context-free
  languages. For instance, the single-valued total function class CFLSV can be seen
  as a functional extension of the language family CFL ∩ co-CFL rather than CFL
  [20]. In stark contrast with a tidy theory of NP functions, circumstances that
  surround CFL functions differ significantly because of mechanical constraints
  (e.g., a use of stacks, one-way moves of tape heads, and λ-moves) that har-
  ness the behaviors of underlying npda’s with output tapes. One such example is
  concerning a notion of refinement [13] (or uniformization [10]). Unlike language
  families, a containment between two multi-valued partial functions is customar-
  ily replaced by refinement. Konstantinidis et al. [10] posed a basic question of
  whether every function in CFLMV has a refinement in CFLSV. This question


                                         226
T.Yamakami. Structural complexity of multi-valued partial functions

  was lately solved negatively [22] and this result clearly contrasts a situation that
  a similar relationship is not known to hold between NPMV and NPSV.
      Amazingly, there still remains a vast range of structural properties that await
  for further investigation. Thus, we wish to continue a coherent and systematic
  study on the structural behaviors of the aforementioned CFL functions. This
  paper aims at exploring fundamental relationships (such as, containment, sep-
  aration, and refinement) among the above function classes and their natural
  extensions via four typical operations: (i) Boolean operations, (ii) functional
  composition, (iii) many-one relativization, and (iv) Turing relativization. The
  last two operations are a natural generalization of many-one and Turing CFL-
  reducibilities among languages [21]. We use the Turing relativization to introduce
  a hierarchy of function classes ΣkCFL MV, ΠkCFL MV, and ΣkCFL SV for each level
  k ≥ 1, in which the first Σ-level classes coincide with CFLMV and CFLSV,
  respectively. We show that all functions in this hierarchy have linear space-
  complexity. With regard to refinement, we show that, if the CFL hierarchy of
                                                          CFL
  [21] collapses to the kth level, every function in Σk+1     MV has a refinement in
    CFL
  Σk+1 SV for every index k ≥ 2.
      Every nondeterministic machine with an output tape can be naturally seen as
  a process of generating a set of feasible “solutions” of each instance. Among those
  solutions, it is useful to study the complexity of finding “optimal” solutions. This
  gives rise to optimization functions. Earlier, Krentel [11] discussed properties of
  OptP that is composed of polynomial-time optimization functions. Here, we are
  focused on similar functions induced by npda’s with output tapes. We denote
  by OptCFL a class of those optimization CFL functions. This function class is
  proven to be located between CFLSVt and Σ4CFL SVt . Moreover, we show the
  class separation between CFLSVt and OptCFL.
      To see a role of functions during a process of recognizing languages, Köbler
  and Thierauf [9] introduced a //-advice operator by generalizing the Karp-Lipton
  /-advice operator, and they argued the computational complexity of languages
  in P//F and NP//F induced by any given function class F. Likewise, we discuss
  the complexity of REG//F and CFL//F when F are various subclasses of CFL
  functions, particularly CFLSVt and OptCFL.
      All omitted or abridged proofs, because of the page limit, will appear shortly
  in a complete version of this paper.

  2    A Starting Point
  Formal Languages. Let N be the set of all natural numbers (i.e., nonnegative
  integers) and set N+ = N − {0}. Throughout this paper, we use the term “ poly-
  nomial” to mean polynomials on N with nonnegative coefficients. In particular, a
  linear polynomial is of the form ax + b with a, b ∈ N. The notation A − B for two
  sets A and B indicates the set difference {x | x ∈ A, x 6∈ B}. Given any set A,
  P(A) denotes the power set of A. A set Σ k (resp., Σ ≤k ), where kS∈ N, consists
  only of strings of length k (resp., at most k). Here, we set Σ ∗ = k∈N Σ k . The
  empty string is always denoted λ. Given any language A over Σ, its complement
  is Σ ∗ − A, which is also denoted by A as long as Σ is clear from the context.


                                         227
T.Yamakami. Structural complexity of multi-valued partial functions

        We adopt a track notation [xy] from [15]. For two symbols σ and τ , [στ] expresses
  a new symbol. For two strings x = x1 x2 · · · xn and y = y1 y2 · · · yn of length n,
  [ xy] denotes a string [ xy11 ][ xy22 ] · · · [ xynn ]. Whenever |x| =
                                                                       6 |y|, we follow a convention
                                                                               m
  introduced in [15]: if |x| < |y|, then [ xy] actually means [ x#y ], where m = |y| − |x|
  and # is a designated new symbol. Similarly, when |x| > |y|, the notation [ xy]
  expresses [ y#xm ] with m = |x| − |y|.
        As our basic computation model, we use one-way one-head nondeterministic
  pushdown automata (or npda’s, in short) allowing λ-moves (or λ-transitions)
  of their input-tape heads. The notations REG and CFL stand for the families
  of all regular languages and of all context-free languages, respectively. For each
  number k ∈ N+ , the k-conjunctive closure of CFL, denoted by CFL(k), is defined
           Tk
  to be { i=1 Ai | A1 , A2 , . . . , Ak ∈ CFL} (see, e.g., [19]).
        Given any language A (used as an oracle), CFLA                  T (or CFLT (A)) expresses a
  collection of all languages recognized by npda’s equipped with write-only query
  tapes with which the npda’s make non-adaptive oracle queries to A, provided
  that all computation paths of the npda’s must terminate in O(n) steps no
  matter what oracle is used [21]. We use S                   the notation CFLCT (or CFLT (C)) for
  language family C to denote the union A∈C CFLA                        T . Its deterministic version
                              C
  is expressed as DCFLT . The CFL hierarchy {∆k , ΣkCFL , ΠkCFL | k ∈ N+ }
                                                                      CFL

  is composed of classes ∆CFL         1          = DCFL, Σ1CFL = CFL, ΠkCFL = co-ΣkCFL ,
      CFL               CFL                      CFL
  ∆k+1 = DCFLT (Πk ), and Σk+1                            = CFLT (ΠkCFL ) for each index k ≥ 1 [21].

  Functions and Refinement. Our terminology associated with multi-valued
  partial functions closely follows the standard terminology in computational com-
  plexity theory. Throughout this paper, we adopt the following convention: the
  generic term “function” always refers to “multi-valued partial function,” pro-
  vided that single-valued functions are viewed as a special case of multi-valued
  functions and, moreover, partial functions include total functions. We are in-
  terested in multi-valued partial functions mapping1 Σ ∗ to P(Γ ∗ ) for certain
  alphabets Σ and Γ . When f is single-valued, we often write f (x) = y instead
  of y ∈ f (x). Associated with f , dom(f ) denotes the domain of f , defined to be
  {x ∈ Σ ∗ | f (x) 6= Ø}. If x 6∈ dom(x), then f (x) is said to be undefined. The
  range ran(f ) of f is a set {y ∈ Γ ∗ | f −1 (y) 6= Ø}.
      For any language A, the characteristic function for A, denoted by χA , is a
  function defined as χA (x) = 1 if x ∈ A and χA (x) = 0 otherwise. We also use a
  quasi-characteristic function ηA , which is defined as ηA (x) = 1 for any string x
  in A and ηA (x) is not defined for all the other strings x.
      Concerning all function classes discussed in this paper, it is natural to concen-
  trate only on functions whose outcomes are bounded in size by fixed polynomials.
  More precisely, a multi-valued partial function f : Σ ∗ → P(Γ ∗ ) is called poly-
  nomially bounded (resp., linearly bounded) if there exists a polynomial (resp., a
  linear polynomial) p such that, for any two strings x ∈ dom(f ) and y ∈ Γ ∗ , if
   1
       To describe a multi-valued partial function f , the expression “f : Σ ∗ → Γ ∗ ” is cus-
       tomarily used in the literature. However, the current expression “f : Σ ∗ → P(Γ ∗ )”
       matches a fact that the outcome of f on each input string in Σ ∗ is a subset of Γ ∗ .



                                                228
T.Yamakami. Structural complexity of multi-valued partial functions

  y ∈ f (x) then |y| ≤ p(|x|) holds. In this paper, we understand that all function
  classes are made of polynomially-bounded functions. Given two alphabets Σ and
  Γ , a function f : Σ ∗ → P(Γ ∗ ) is called length preserving if, for any x ∈ Σ ∗ and
  y ∈ Γ ∗ , y ∈ f (x) implies |x| = |y|.
       Whenever we refer to a write-only tape, we always assume that (i) initially,
  all cells of the tape are blank, (ii) a tape head starts at the so-called start cell,
  (iii) the tape head steps forward whenever it writes down any non-blank symbol,
  and (iv) the tape head can stay still only in a blank cell. An output (outcome
  or output string) along a computation path is a string produced on the output
  tape after the computation path is terminated. We call an output string valid
  (or legitimate) if it is produced along a certain accepting computation path.
      To describe npda’s, we take the following specific definition. For any given
  function f : Σ ∗ → P(Γ ∗ ), an npda N equipped with a one-way read-only in-
  put tape and a write-only output tape that computes f must have the form
  (Q, Σ, {|c, $}, Θ, Γ, δ, q0 , Z0 , Qacc , Qrej ), where Q is a set of inner states, Θ is a
  stack alphabet, q0 (∈ Q) is the initial state, Z0 (∈ Θ) is the stack’s bottom
  marker, and δ : (Q − Qhalt ) × (Σ̌ ∪ {λ}) × Θ → P(Q × Θ∗ × (Γ ∪ {λ})) is a
  transition function, where Qhalt = Qacc ∪ Qrej , Σ̌ = Σ ∪ {|c, $}, and c| and $ are
  respectively left and right endmarkers. It is important to note that, in accor-
  dance with the basic setting of [21], we consider only npda’s whose computation
  paths are all terminate (i.e., reach halting states) in O(n) steps,2 where n refers
  to input size. We refer to this specific condition regarding execution time as the
  termination condition.
       A function class CFLMV is composed of all (multi-valued partial) functions
  f , each of which maps Σ ∗ to P(Γ ∗ ) for certain alphabets Σ and Γ and there
  exists an npda N with a one-way read-only input tape and a write-only output
  tape such that, for every input x ∈ Σ ∗ , f (x) is a set of all valid outcomes of
  N on the input x. The termination condition imposed on our npda’s obviously
  leads to an anticipated containment CFLMV ⊆ NPMV. Another class CFLSV is
  a subclass of CFLMV consisting of single-valued partial functions. In addition,
  CFLMVt and CFLSVt are respectively restrictions of CFLMV and CFLSV onto
  total functions. Those function classes were discussed earlier in [19].
      An important concept associated with multi-valued partial functions is re-
  finement [13]. This concept (denoted by vref ) is more suitable to use than set
  containment (⊆). Given two multi-valued partial functions f and g, we say that
  f is a refinement of g, denoted by g vref f , if (1) dom(f ) = dom(g) and (2)
  for every x, f (x) ⊆ g(x) (as a set inclusion). We also say that g is refined by f .
  Given two sets F and G of functions, G vref F if every function g in G can be
  refined by a certain function f in F. When f is additionally single-valued, we
  call f a single-valued refinement of g.


   2
       If no execution time bound is imposed, a function computed by an npda that non-
       deterministically produces every binary string on its output tape for each input
       becomes a valid CFL function; however, this function is no longer an NP function.



                                           229
T.Yamakami. Structural complexity of multi-valued partial functions

  3    Basic Operations for Function Classes
  Let us discuss our theme of the structural complexity of various classes of (multi-
  valued partial) functions by exploring fundamental relationships among those
  function classes. In the course of our discussion, we will unearth an exquisite
  nature of CFL functions, which looks quite different from that of NP functions.
       We begin with demonstrating basic features of CFL functions. First, let us
  present close relationships between CFL functions and context-free languages.
  Notice that, for any function f in CFLMV, both dom(f ) and ran(f ) belong
  to CFL. It is useful to recall from [21] a notion of \-extension. Assuming that
  \ 6∈ Σ, a \-extension of a given string x ∈ Σ ∗ is a string x̃ over Σ ∪ {\} satisfying
  the following requirement: x is obtained directly from x̃ simply by removing all
  occurrences of \ in x̃. For example, if x = 01101, then x̃ may be 01\1\01 or
  \0110\\1. The next lemma resembles Nivat’s representation theorem for rational
  transductions (see, e.g., [10, Theorem 2]).

  Lemma 1. For any function f ∈ CFLMV, there exist a language A ∈ CFL
  and a linear polynomial p such that, for every pair x and y, y ∈ f (x) iff (i)
  [ x̃ỹ ] ∈ A, (ii) |y| ≤ p(|x|), and (iii) |x̃| = |ỹ| for certain strings x̃ and ỹ, which
  are \-extensions of x and y, respectively.

     An immediate application of Lemma 1 leads to a functional version of the
  well-known pumping lemma [2].

  Lemma 2. (functional pumping lemma for CFLMV) Let Σ and Γ be any two
  alphabets and let f : Σ ∗ → P(Γ ∗ ) be any function in CFLMV. There exist three
  numbers m ∈ N+ and c, d ∈ N that satisfy the following condition. Any string
  w ∈ Σ ∗ with |w| ≥ m and any string s ∈ f (w) are decomposed into w = uvxyz
  and s = abpqr such that (i) |vxy| ≤ m, (ii) |vybq| ≥ 1, (iii) |bq| ≤ cm + d, and
  (iv) abi pq i r ∈ f (uv i xy i z) for any number i ∈ N. In the case where f is further
  length preserving, the following condition also holds: (v) |v| = |b| and |y| = |q|.
  Moreover, (i)–(ii) can be replaced by (i’) |bq| ≥ 1.

      Boolean operations over languages in CFL have been extensively discussed
  in the past literature (e.g., [16, 21]). Similarly, it is possible to consider Boolean
  operations that are directly applicable to functions. In particular, we are focused
  on three typical Boolean operations: union, intersection, and complement. Let us
  define the first two operations. Given two function classes F1 and F2 , let F1 ∧ F2
  (resp., F1 ∨ F2 ) denote a class of all functions f defined as f (x) = f1 (x) ∩ f2 (x)
  (set intersection) (resp., f (x) = f1 (x) ∪ f2 (x), set union) over all inputs x, where
  f1 ∈ F1 and f2 ∈ F2 . Expanding CFLMV(2) in Section 1, we inductively define
  a k-conjunctive function class CFLMV(k) as follows: CFLMV(1) = CFLMV
  and CFLMV(k + 1) = CFLMV(k) ∧ CFLMV for any index k ∈ N+ . Likewise,
  CFLSV(k) is defined using CFLSV instead of CFLMV.

  Proposition 3. Let k, m ≥ 1.
   1. CFLMV(max{k, m}) ⊆ CFLMV(k) ∨ CFLMV(m) ⊆ CFLMV(km).


                                            230
T.Yamakami. Structural complexity of multi-valued partial functions

   2. CFLMV(max{k, m}) ⊆ CFLMV(k) ∧ CFLMV(m) ⊆ CFLMV(k + m).
   3. CFLSV(k) $ CFLSV(k + 1).

      Note that Proposition 3(3) follows indirectly from a result in [12].
      Fenner et al. [5] considered “complements” of NP functions. Likewise, we can
  discuss complements of CFL functions. Let F be any family of functions whose
  output sizes are upper-bounded by certain linear polynomials. A function f is in
  co-F if there are a linear polynomial p, another function g ∈ F, and a constant
  n0 ∈ N such that, for any pair (x, y) with |x| ≥ n0 , y ∈ f (x) iff both |y| ≤ p(|x|)
  and y 6∈ g(x) holds. This condition implies that f (x) = Σ ≤a|x|+b − g(x) (set
  difference) for all x ∈ Σ ≥n0 . The finite portion Σ