=Paper=
{{Paper
|id=Vol-1231/long17
|storemode=property
|title=Structural complexity of multi-valued partial functions computed by nondeterministic pushdown automata
|pdfUrl=https://ceur-ws.org/Vol-1231/long17.pdf
|volume=Vol-1231
|dblpUrl=https://dblp.org/rec/conf/ictcs/Yamakami14
}}
==Structural complexity of multi-valued partial functions computed by nondeterministic pushdown automata==
Structural complexity of multi-valued partial
functions computed by nondeterministic
pushdown automata
(Extended Abstract)
Tomoyuki Yamakami
Department of Information Science, University of Fukui
3-9-1 Bunkyo, Fukui 910-8507, Japan
Abstract. This paper continues a systematic and comprehensive study
on the structural properties of CFL functions, which are in general multi-
valued partial functions computed by one-way one-head nondeterministic
pushdown automata equipped with write-only output tapes (or push-
down transducers), where CFL refers to a relevance to context-free lan-
guages. The CFL functions tend to behave quite differently from their
corresponding context-free languages. We extensively discuss contain-
ments, separations, and refinements among various classes of functions
obtained from the CFL functions by applying Boolean operations, func-
tional composition, many-one relativization, and Turing relativization.
In particular, Turing relativization helps construct a hierarchy over the
class of CFL functions. We also analyze the computational complexity
of optimization functions, which are to find optimal values of CFL func-
tions, and discuss their relationships to the associated languages.
Keywords: multi-valued partial function, oracle, Boolean operation, re-
finement, many-one relativization, Turing relativization, CFL hierarchy,
optimization, pushdown automaton, context-free language
1 Much Ado about Functions
In a traditional field of formal languages and automata, we have dealt primarily
with languages because of their practical applications to, for example, a parsing
analysis of programming languages. Most fundamental languages listed in many
undergraduate textbooks are, unarguably, regular (or rational) languages and
context-free (or algebraic) languages.
Opposed to the recognition of languages, translation of words, for exam-
ple, requires a mechanism of transforming input words to output words. Aho
et al. [1] studied machines that produce words on output tapes while reading
symbols on an input tape. Mappings on strings (or word relations) that are real-
ized by such machines are known as transductions. Since languages are regarded,
from an integrated viewpoint, as Boolean-valued (i.e., {0, 1}-valued) total func-
tions, it seems more essential to study the behaviors of those functions. This
225
T.Yamakami. Structural complexity of multi-valued partial functions
task is, however, quite challenging, because these functions often demand quite
different concepts, technical tools, and proof arguments, compared to those for
languages. When underlying machines are particularly nondeterministic, they
may produce numerous distinct output values (including the case of no output
values). Mappings realized by such machines become, in general, multi-valued
partial functions transforming each admissible string to a certain finite (possibly
empty) set of strings.
Based on a model of polynomial-time nondeterministic Turing machine, com-
putational complexity theory has long discussed the structural complexity of var-
ious NP function classes, including NPMV, NPSV, and NPSVt (where MV and
SV respectively stand for “multi-valued” and “single-valued” and the subscript
“t” does for “total”). See, e.g., a survey [14].
Within a scope of formal languages and automata, there is also rich litera-
ture concerning the behaviors of nondeterministic finite automata equipped with
write-only output tapes (known as rational transducers) and properties of as-
sociated multi-valued partial functions (known also as rational transductions).
Significant efforts have been made over the years to understand the functional-
ity of such functions. A well-known field of functions include “CFL functions,”
which were formally discussed in 1963 by Evey [4] and Fisher [6] and general
properties have been since then discussed in, e.g., [3, 10]. CFL functions are gen-
erally computed by one-way one-head nondeterministic pushdown automata (or
npda’s, in short) equipped with write-only output tapes. For example, the function
P ALsub (w) = {x ∈ {0, 1}∗ | ∃u, v [w = uxv], x = xR } for every w ∈ {0, 1}∗ is a
CFL function, where xR is x in reverse. As subclasses of CFL functions, three
fundamental function classes CFLMV, CFLSV, and CFLSVt were recognized
explicitly in [19] and further explored in [20].
In recent literature, fascinating structural properties of CFL functions have
been extensively investigated. Konstantinidis et al. [10] took an algebraic ap-
proach toward the characterization of CFL functions. In relation to cryptogra-
phy, it was shown that there exists a pseudorandom generator in CFLSVt that
“fools” every language in a non-uniform (or an advised) version of REG [19].
Another function class CFLMV(2) in [20] contains pseudorandom generators
against a non-uniform analogue of CFL. The behaviors of functions in those func-
tion classes seem to look quite different from what we have known for context-free
languages. For instance, the single-valued total function class CFLSV can be seen
as a functional extension of the language family CFL ∩ co-CFL rather than CFL
[20]. In stark contrast with a tidy theory of NP functions, circumstances that
surround CFL functions differ significantly because of mechanical constraints
(e.g., a use of stacks, one-way moves of tape heads, and λ-moves) that har-
ness the behaviors of underlying npda’s with output tapes. One such example is
concerning a notion of refinement [13] (or uniformization [10]). Unlike language
families, a containment between two multi-valued partial functions is customar-
ily replaced by refinement. Konstantinidis et al. [10] posed a basic question of
whether every function in CFLMV has a refinement in CFLSV. This question
226
T.Yamakami. Structural complexity of multi-valued partial functions
was lately solved negatively [22] and this result clearly contrasts a situation that
a similar relationship is not known to hold between NPMV and NPSV.
Amazingly, there still remains a vast range of structural properties that await
for further investigation. Thus, we wish to continue a coherent and systematic
study on the structural behaviors of the aforementioned CFL functions. This
paper aims at exploring fundamental relationships (such as, containment, sep-
aration, and refinement) among the above function classes and their natural
extensions via four typical operations: (i) Boolean operations, (ii) functional
composition, (iii) many-one relativization, and (iv) Turing relativization. The
last two operations are a natural generalization of many-one and Turing CFL-
reducibilities among languages [21]. We use the Turing relativization to introduce
a hierarchy of function classes ΣkCFL MV, ΠkCFL MV, and ΣkCFL SV for each level
k ≥ 1, in which the first Σ-level classes coincide with CFLMV and CFLSV,
respectively. We show that all functions in this hierarchy have linear space-
complexity. With regard to refinement, we show that, if the CFL hierarchy of
CFL
[21] collapses to the kth level, every function in Σk+1 MV has a refinement in
CFL
Σk+1 SV for every index k ≥ 2.
Every nondeterministic machine with an output tape can be naturally seen as
a process of generating a set of feasible “solutions” of each instance. Among those
solutions, it is useful to study the complexity of finding “optimal” solutions. This
gives rise to optimization functions. Earlier, Krentel [11] discussed properties of
OptP that is composed of polynomial-time optimization functions. Here, we are
focused on similar functions induced by npda’s with output tapes. We denote
by OptCFL a class of those optimization CFL functions. This function class is
proven to be located between CFLSVt and Σ4CFL SVt . Moreover, we show the
class separation between CFLSVt and OptCFL.
To see a role of functions during a process of recognizing languages, Köbler
and Thierauf [9] introduced a //-advice operator by generalizing the Karp-Lipton
/-advice operator, and they argued the computational complexity of languages
in P//F and NP//F induced by any given function class F. Likewise, we discuss
the complexity of REG//F and CFL//F when F are various subclasses of CFL
functions, particularly CFLSVt and OptCFL.
All omitted or abridged proofs, because of the page limit, will appear shortly
in a complete version of this paper.
2 A Starting Point
Formal Languages. Let N be the set of all natural numbers (i.e., nonnegative
integers) and set N+ = N − {0}. Throughout this paper, we use the term “ poly-
nomial” to mean polynomials on N with nonnegative coefficients. In particular, a
linear polynomial is of the form ax + b with a, b ∈ N. The notation A − B for two
sets A and B indicates the set difference {x | x ∈ A, x 6∈ B}. Given any set A,
P(A) denotes the power set of A. A set Σ k (resp., Σ ≤k ), where kS∈ N, consists
only of strings of length k (resp., at most k). Here, we set Σ ∗ = k∈N Σ k . The
empty string is always denoted λ. Given any language A over Σ, its complement
is Σ ∗ − A, which is also denoted by A as long as Σ is clear from the context.
227
T.Yamakami. Structural complexity of multi-valued partial functions
We adopt a track notation [xy] from [15]. For two symbols σ and τ , [στ] expresses
a new symbol. For two strings x = x1 x2 · · · xn and y = y1 y2 · · · yn of length n,
[ xy] denotes a string [ xy11 ][ xy22 ] · · · [ xynn ]. Whenever |x| =
6 |y|, we follow a convention
m
introduced in [15]: if |x| < |y|, then [ xy] actually means [ x#y ], where m = |y| − |x|
and # is a designated new symbol. Similarly, when |x| > |y|, the notation [ xy]
expresses [ y#xm ] with m = |x| − |y|.
As our basic computation model, we use one-way one-head nondeterministic
pushdown automata (or npda’s, in short) allowing λ-moves (or λ-transitions)
of their input-tape heads. The notations REG and CFL stand for the families
of all regular languages and of all context-free languages, respectively. For each
number k ∈ N+ , the k-conjunctive closure of CFL, denoted by CFL(k), is defined
Tk
to be { i=1 Ai | A1 , A2 , . . . , Ak ∈ CFL} (see, e.g., [19]).
Given any language A (used as an oracle), CFLA T (or CFLT (A)) expresses a
collection of all languages recognized by npda’s equipped with write-only query
tapes with which the npda’s make non-adaptive oracle queries to A, provided
that all computation paths of the npda’s must terminate in O(n) steps no
matter what oracle is used [21]. We use S the notation CFLCT (or CFLT (C)) for
language family C to denote the union A∈C CFLA T . Its deterministic version
C
is expressed as DCFLT . The CFL hierarchy {∆k , ΣkCFL , ΠkCFL | k ∈ N+ }
CFL
is composed of classes ∆CFL 1 = DCFL, Σ1CFL = CFL, ΠkCFL = co-ΣkCFL ,
CFL CFL CFL
∆k+1 = DCFLT (Πk ), and Σk+1 = CFLT (ΠkCFL ) for each index k ≥ 1 [21].
Functions and Refinement. Our terminology associated with multi-valued
partial functions closely follows the standard terminology in computational com-
plexity theory. Throughout this paper, we adopt the following convention: the
generic term “function” always refers to “multi-valued partial function,” pro-
vided that single-valued functions are viewed as a special case of multi-valued
functions and, moreover, partial functions include total functions. We are in-
terested in multi-valued partial functions mapping1 Σ ∗ to P(Γ ∗ ) for certain
alphabets Σ and Γ . When f is single-valued, we often write f (x) = y instead
of y ∈ f (x). Associated with f , dom(f ) denotes the domain of f , defined to be
{x ∈ Σ ∗ | f (x) 6= Ø}. If x 6∈ dom(x), then f (x) is said to be undefined. The
range ran(f ) of f is a set {y ∈ Γ ∗ | f −1 (y) 6= Ø}.
For any language A, the characteristic function for A, denoted by χA , is a
function defined as χA (x) = 1 if x ∈ A and χA (x) = 0 otherwise. We also use a
quasi-characteristic function ηA , which is defined as ηA (x) = 1 for any string x
in A and ηA (x) is not defined for all the other strings x.
Concerning all function classes discussed in this paper, it is natural to concen-
trate only on functions whose outcomes are bounded in size by fixed polynomials.
More precisely, a multi-valued partial function f : Σ ∗ → P(Γ ∗ ) is called poly-
nomially bounded (resp., linearly bounded) if there exists a polynomial (resp., a
linear polynomial) p such that, for any two strings x ∈ dom(f ) and y ∈ Γ ∗ , if
1
To describe a multi-valued partial function f , the expression “f : Σ ∗ → Γ ∗ ” is cus-
tomarily used in the literature. However, the current expression “f : Σ ∗ → P(Γ ∗ )”
matches a fact that the outcome of f on each input string in Σ ∗ is a subset of Γ ∗ .
228
T.Yamakami. Structural complexity of multi-valued partial functions
y ∈ f (x) then |y| ≤ p(|x|) holds. In this paper, we understand that all function
classes are made of polynomially-bounded functions. Given two alphabets Σ and
Γ , a function f : Σ ∗ → P(Γ ∗ ) is called length preserving if, for any x ∈ Σ ∗ and
y ∈ Γ ∗ , y ∈ f (x) implies |x| = |y|.
Whenever we refer to a write-only tape, we always assume that (i) initially,
all cells of the tape are blank, (ii) a tape head starts at the so-called start cell,
(iii) the tape head steps forward whenever it writes down any non-blank symbol,
and (iv) the tape head can stay still only in a blank cell. An output (outcome
or output string) along a computation path is a string produced on the output
tape after the computation path is terminated. We call an output string valid
(or legitimate) if it is produced along a certain accepting computation path.
To describe npda’s, we take the following specific definition. For any given
function f : Σ ∗ → P(Γ ∗ ), an npda N equipped with a one-way read-only in-
put tape and a write-only output tape that computes f must have the form
(Q, Σ, {|c, $}, Θ, Γ, δ, q0 , Z0 , Qacc , Qrej ), where Q is a set of inner states, Θ is a
stack alphabet, q0 (∈ Q) is the initial state, Z0 (∈ Θ) is the stack’s bottom
marker, and δ : (Q − Qhalt ) × (Σ̌ ∪ {λ}) × Θ → P(Q × Θ∗ × (Γ ∪ {λ})) is a
transition function, where Qhalt = Qacc ∪ Qrej , Σ̌ = Σ ∪ {|c, $}, and c| and $ are
respectively left and right endmarkers. It is important to note that, in accor-
dance with the basic setting of [21], we consider only npda’s whose computation
paths are all terminate (i.e., reach halting states) in O(n) steps,2 where n refers
to input size. We refer to this specific condition regarding execution time as the
termination condition.
A function class CFLMV is composed of all (multi-valued partial) functions
f , each of which maps Σ ∗ to P(Γ ∗ ) for certain alphabets Σ and Γ and there
exists an npda N with a one-way read-only input tape and a write-only output
tape such that, for every input x ∈ Σ ∗ , f (x) is a set of all valid outcomes of
N on the input x. The termination condition imposed on our npda’s obviously
leads to an anticipated containment CFLMV ⊆ NPMV. Another class CFLSV is
a subclass of CFLMV consisting of single-valued partial functions. In addition,
CFLMVt and CFLSVt are respectively restrictions of CFLMV and CFLSV onto
total functions. Those function classes were discussed earlier in [19].
An important concept associated with multi-valued partial functions is re-
finement [13]. This concept (denoted by vref ) is more suitable to use than set
containment (⊆). Given two multi-valued partial functions f and g, we say that
f is a refinement of g, denoted by g vref f , if (1) dom(f ) = dom(g) and (2)
for every x, f (x) ⊆ g(x) (as a set inclusion). We also say that g is refined by f .
Given two sets F and G of functions, G vref F if every function g in G can be
refined by a certain function f in F. When f is additionally single-valued, we
call f a single-valued refinement of g.
2
If no execution time bound is imposed, a function computed by an npda that non-
deterministically produces every binary string on its output tape for each input
becomes a valid CFL function; however, this function is no longer an NP function.
229
T.Yamakami. Structural complexity of multi-valued partial functions
3 Basic Operations for Function Classes
Let us discuss our theme of the structural complexity of various classes of (multi-
valued partial) functions by exploring fundamental relationships among those
function classes. In the course of our discussion, we will unearth an exquisite
nature of CFL functions, which looks quite different from that of NP functions.
We begin with demonstrating basic features of CFL functions. First, let us
present close relationships between CFL functions and context-free languages.
Notice that, for any function f in CFLMV, both dom(f ) and ran(f ) belong
to CFL. It is useful to recall from [21] a notion of \-extension. Assuming that
\ 6∈ Σ, a \-extension of a given string x ∈ Σ ∗ is a string x̃ over Σ ∪ {\} satisfying
the following requirement: x is obtained directly from x̃ simply by removing all
occurrences of \ in x̃. For example, if x = 01101, then x̃ may be 01\1\01 or
\0110\\1. The next lemma resembles Nivat’s representation theorem for rational
transductions (see, e.g., [10, Theorem 2]).
Lemma 1. For any function f ∈ CFLMV, there exist a language A ∈ CFL
and a linear polynomial p such that, for every pair x and y, y ∈ f (x) iff (i)
[ x̃ỹ ] ∈ A, (ii) |y| ≤ p(|x|), and (iii) |x̃| = |ỹ| for certain strings x̃ and ỹ, which
are \-extensions of x and y, respectively.
An immediate application of Lemma 1 leads to a functional version of the
well-known pumping lemma [2].
Lemma 2. (functional pumping lemma for CFLMV) Let Σ and Γ be any two
alphabets and let f : Σ ∗ → P(Γ ∗ ) be any function in CFLMV. There exist three
numbers m ∈ N+ and c, d ∈ N that satisfy the following condition. Any string
w ∈ Σ ∗ with |w| ≥ m and any string s ∈ f (w) are decomposed into w = uvxyz
and s = abpqr such that (i) |vxy| ≤ m, (ii) |vybq| ≥ 1, (iii) |bq| ≤ cm + d, and
(iv) abi pq i r ∈ f (uv i xy i z) for any number i ∈ N. In the case where f is further
length preserving, the following condition also holds: (v) |v| = |b| and |y| = |q|.
Moreover, (i)–(ii) can be replaced by (i’) |bq| ≥ 1.
Boolean operations over languages in CFL have been extensively discussed
in the past literature (e.g., [16, 21]). Similarly, it is possible to consider Boolean
operations that are directly applicable to functions. In particular, we are focused
on three typical Boolean operations: union, intersection, and complement. Let us
define the first two operations. Given two function classes F1 and F2 , let F1 ∧ F2
(resp., F1 ∨ F2 ) denote a class of all functions f defined as f (x) = f1 (x) ∩ f2 (x)
(set intersection) (resp., f (x) = f1 (x) ∪ f2 (x), set union) over all inputs x, where
f1 ∈ F1 and f2 ∈ F2 . Expanding CFLMV(2) in Section 1, we inductively define
a k-conjunctive function class CFLMV(k) as follows: CFLMV(1) = CFLMV
and CFLMV(k + 1) = CFLMV(k) ∧ CFLMV for any index k ∈ N+ . Likewise,
CFLSV(k) is defined using CFLSV instead of CFLMV.
Proposition 3. Let k, m ≥ 1.
1. CFLMV(max{k, m}) ⊆ CFLMV(k) ∨ CFLMV(m) ⊆ CFLMV(km).
230
T.Yamakami. Structural complexity of multi-valued partial functions
2. CFLMV(max{k, m}) ⊆ CFLMV(k) ∧ CFLMV(m) ⊆ CFLMV(k + m).
3. CFLSV(k) $ CFLSV(k + 1).
Note that Proposition 3(3) follows indirectly from a result in [12].
Fenner et al. [5] considered “complements” of NP functions. Likewise, we can
discuss complements of CFL functions. Let F be any family of functions whose
output sizes are upper-bounded by certain linear polynomials. A function f is in
co-F if there are a linear polynomial p, another function g ∈ F, and a constant
n0 ∈ N such that, for any pair (x, y) with |x| ≥ n0 , y ∈ f (x) iff both |y| ≤ p(|x|)
and y 6∈ g(x) holds. This condition implies that f (x) = Σ ≤a|x|+b − g(x) (set
difference) for all x ∈ Σ ≥n0 . The finite portion Σ