<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>It May Be in the Structure, Not the Combinations: Graph Metrics as an Alternative to Statistical Measures in Corpus-Linguistic Research</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna Shadrova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Humboldt University of Berlin Berlin</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>245</fpage>
      <lpage>278</lpage>
      <abstract>
        <p>The following contribution summarizes a number of problems associated with a methodological focus on statistical measures in corpus linguistics, specifically in the subfield of lexical and lexicosyntactic analysis: the practical impossibility of collecting suficient amounts of data to fully account for all of the factors that are known to interact with linguistic output; the reliance on mathematical assumptions that are generally not met by language data; and the epistemological limitations of considering corpora language samples from a presumed superpopulation versus the evolutionary and non-ergodic nature of language. It then proposes graph metrics as a viable alternative to statistical measures. Rather than comparing groups of factors, graph metrics capture relational aspects of the whole dataset, and unlike inferential statistics, they do not project to a presumed external totality or population. Neither do they rely on assumptions of randomness, independence, stationarity, or ergodicity. They thus avoid many of the concessions to the linguistic model that are inherent to probabilistic models of the lexicon, which in turn may result in an epistemologically sounder operationalization and quantification of corpus data. However, the high level of abstraction inherent in graphs, and especially in applications of</p>
      </abstract>
      <kwd-group>
        <kwd>In</kwd>
        <kwd>Tara Andrews</kwd>
        <kwd>Franziska Diehr</kwd>
        <kwd>Thomas Efer</kwd>
        <kwd>Andreas Kuczera and Joris van Zundert (eds</kwd>
        <kwd>)</kwd>
        <kwd>Graph Technologies in the Humanities - Proceedings 2020</kwd>
        <kwd>published at http</kwd>
        <kwd>//ceur-ws</kwd>
        <kwd>org</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>graph theory, comes with its own caveats. Drawing on the example of
Kobalt, a mid-sized corpus of texts written by learners of German, two
approaches to the application of graph metrics in corpus-linguistic
research are outlined and demonstrated in this paper, including a detailed
discussion of the steps necessary for validation and linguistic
embedding.
1</p>
    </sec>
    <sec id="sec-2">
      <title>Statistics in Quantitative Corpus Linguistics</title>
      <p>
        Quantitative corpus linguistics has been a prolific field of research since the
early 1990s. A number of large, general-purpose corpora, such as the British
National Corpus
        <xref ref-type="bibr" rid="ref8">(BNC World, 2007)</xref>
        and the German Reference Corpus
DeReKo (Leibniz-Institut für Deutsche Sprache, 2019), have been curated
and refined for decades, while many more specific corpora (focusing, for
example, on spoken, historical, task-based, and/or learner language), are
continuously being compiled and developed.
      </p>
      <p>While a wide-range of phenomena are analyzed in linguistics, a
particularly prevalent strand of research in corpus linguistics addresses the
cooccurrence of words and the formation of meaningful larger units that do
not appear to follow higher-order syntactic rules, such as collocations (words
that habitually co-occur despite potentially available alternatives, like strong
cofee vs. powerful cofee ), idioms (word bundles with elements of meaning
that cannot be derived from the words themselves, like hit the road), and
constructions (semi-syntactic elements that are partially lexically specified and
tend to be expandable by analogy, like He worked his way up, She studied her
way to the top, and They sang their way into our hearts).</p>
      <p>
        The major methodological focus of this work has been on frequentist
statistics over words, lexemes (base word forms, such as to be for is, are, were, etc.)
and relatively coarse syntactic categories (part-of-speech tags, such as
preposition, or dependency labels, such as accusative object) that rely on automatic
extraction. The term ‘frequentist’ here refers to the ‘traditional’
understanding of statistics, whereby probability is defined as the convergence to the
limit of an expected relative frequency in an infinite series of experiments. 1
1Bayesian models have not played a role in corpus-linguistic analysis so far. Bayesian
statistics defines probability in a diefrent manner, namely as a range of reasonable belief based
on prior experience. This increases the suitability to the modeling of dynamic systems and
interacting factors. Whether or not this also allows for an improved quantification of
corpus data remains to be seen. Although highly relevant to the methodological development
of corpus linguistics, this question will be set aside for the remainder of this paper in
orFor example, if I toss a fair coin an infinite number of times, the relative
frequency of it landing on either side will approximate 0.5, which is also the
probability value. Corpus linguistics operates within precisely such a
framework, relying on various types of quantitative analysis such as productivity
measures, which look at word distributions and quantify the openness of
slots to accept new members
        <xref ref-type="bibr" rid="ref3 ref95">(Zeldes, 2013; Baayen, 2002)</xref>
        , and lexical
association measures, which investigate conditional probability and
transformations thereof
        <xref ref-type="bibr" rid="ref24 ref25 ref3 ref35 ref36 ref38 ref54 ref57 ref90 ref91">(Baayen, 2002; Gries and Stefanowitsch, 2004; Stefanowitsch
and Gries, 2003, 2005; Gries, 2013; Evert, 2005; Evert et al., 2017)</xref>
        . In recent
years, mixed eefct modeling has also begun to attract attention as a more
sophisticated technique for controlling the many interacting factors in
language data
        <xref ref-type="bibr" rid="ref37 ref56 ref87">(Linck and Cunnings, 2015; Speelman et al., 2018; Gries, 2019)</xref>
        .
      </p>
      <p>In the wake of these conceptualizations of corpora as plausibly
representative samples of language, a dynamic interplay has developed between
amassing larger amounts of data to allow for statistical analysis and an increasing
reliance on automatic modes of data extraction and classification. There
are, however, a number of issues with viewing language through the lens
of stochastics, which can be broken down into three general categories: a)
practical – controlling for all interacting factors tends to greatly limit sample
size as well as linguistic depth; b) mathematical – the lexicon is likely not a
stochastic system; and c) epistemological – corpora frequently do not
qualify as samples that allow inferences to a population, especially those that have
been compiled in elicited or quasi-experimental settings.
1.1</p>
      <sec id="sec-2-1">
        <title>Practical Problems</title>
        <p>A major reason for the reliance of corpus linguistics on surface or
surfacenear forms lies in the dificulty and high cost of in-depth annotation.
Linguistic categories are generally ambiguous, fuzzy-edged, and often only
implicitly represented in surface forms.</p>
        <p>For example, while the grammatical roles of subject and direct object in the
sentence The boy broke the vase are easily extractable – especially in English,
where the position in left adjacency to the finite verb is reserved for the
grammatical subject – the semantic, pragmatic, textual, and intertextual
implications can be much harder to pinpoint. Consider, for example, the sentence
The vase broke, in which the former direct object to the verb break is now the
grammatical subject, while the process of breaking still belongs to the same
object semantically.2
der to allow for a more in-depth discussion of the pitfalls of frequentist approaches and the
potential of graph-based modeling as a structural alternative.</p>
        <p>2This is an example of a so-called unaccusative verb. Features and problems of
categor</p>
        <p>
          Even aspects of limited ambiguity, such as the morphological features of a
word (for example grammatical gender or types of inflection) or the syntactic
structure of a sentence, cannot always be parsed with high accuracy in
languages with flexible word order and a high degree of inflection
          <xref ref-type="bibr" rid="ref85">( Seddah et al.,
2013)</xref>
          . This is particularly true of non-canonical language, i.e. language that
does not follow the explicit and implicit rules of written formal language, for
example spoken or learner language
          <xref ref-type="bibr" rid="ref1 ref14 ref5 ref52 ref74">(Krivanek and Meurers, 2011; Ott and
Ziai, 2010; Choi et al., 2015; Bechet et al., 2014)</xref>
          .
        </p>
        <p>
          In a research setting that works with rich morphology and/or syntactic
lfexibility, untrained, and non-canonical data (as is the case in the analysis
of German learner language), automatic classification and parser
performance generally do not sufice for research purposes. This is even more so
the case for analyses that go beyond surface-near forms, where the accuracy
of automatic parsing of semantic or pragmatic information is generally not
very high. For example,
          <xref ref-type="bibr" rid="ref69">Morey et al. (2018)</xref>
          compare the performance of
various parsers for rhetorical structures, i.e. functional descriptions of discourse
blocks such as example, elaboration, or concession, and show that the most
eefctive parsers only reach accuracy levels of around 50%. This is due to the
fact that meaning is not simply encoded in the individual words themselves,
but emerges from their combination, composition, and contextual
embedding.
        </p>
        <p>Annotation of deeper level information generally requires large amounts
of tedious manual or, at best, semi-automatic annotation. Rather than
being a simple labeling task, most linguistically interesting annotation is a
complex categorization process. It requires the development of guidelines for
ambiguous cases, measurements of inter-annotator agreement, and the
iterative revision of previous annotations. With this amount of manual input,
deep linguistic annotation is generally not feasible for large corpora.</p>
        <p>
          At the same time, a number of recent and ongoing studies
          <xref ref-type="bibr" rid="ref41 ref86">(Hirschmann,
2015; Lüdeling et al., 2017; Shadrova, 2020; Lüdeling et al., 2021)</xref>
          have
shown that the simultaneous consideration of deeper levels of analysis and
several annotation layers in the same study can provide considerable insight
into deep linguistic aspects of corpus data. With analyses of this kind, it is
generally only possible to assure an adequate quality for small to mid-sized
corpora, i.e. several hundred to several thousand texts at most. This
potential is further limited by the fact that many corpora require intensive
preprocessing in the form of multiple tokenization, alignment of token layers
and parallel text, etc. Particularly where spoken or signed data is involved,
transcription and normalization are very resource-intensive, and eefctively
ization are discussed in depth in
          <xref ref-type="bibr" rid="ref54">Kuno and Takami (2004)</xref>
          .
limit the data that is available for analysis to only a few dozen texts. It appears
that for lexical analysis, this upper bound often lies below the lower bound
for the secure application of statistical measures: eefct sizes are too small to
be convincing, and with the high degree of individual and stratified variation,
controlling for all factors splits the data into even smaller subsets, or creates
spurious significance (overfitting) through the inclusion of too many factors
in the model.
        </p>
        <p>There is thus an inevitable trade-of between the depth of linguistic
analysis and corpus size. In other words, large corpora generally underexplore
the potentials of linguistic analysis. Statistical measures were introduced
into corpus linguistics in order to capture gradual eefcts and to distinguish
between true diefrences and random fluctuation – but in reality, a strong
focus on these metrics greatly limits the analytical capacity of linguistic
research. However, as I will argue in the following sections, the problems with
such an approach run deeper.
1.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Mathematical Problems</title>
        <p>
          There are a number of debates centered around statistics in linguistics which
deserve to be mentioned here, but cannot be discussed in detail due to spatial
constraints:
• There appears to be a general lack of understanding of the underlying
distribution of words in corpora. It has been assumed that words are
Zipf-distributed, but recent research suggests that this might not be the
case, and/or that the Zipf distribution is itself an artifact
          <xref ref-type="bibr" rid="ref2 ref80 ref94">(Williams et al.,
2015; Piantadosi, 2014; Aitchison et al., 2016)</xref>
          ;
• Treating text as randomly sampled, i.e. as the outcome of a series of
random experiments, is a simplification ad absurdum. Structured by
its very nature, “language is never, ever, ever random”
          <xref ref-type="bibr" rid="ref49">(Kilgarrif , 2005)</xref>
          ;
• Significance testing is problematic for a number of reasons, but
particularly so in language, since essentially all assumptions about the relation
between population and sample are mathematically unmet
          <xref ref-type="bibr" rid="ref11 ref50 ref83 ref84">(Schmid
and Küchenhof , 2013; Koplenig, 2017)</xref>
          .
        </p>
        <p>Beyond these already grave concerns, there are two even deeper problems
concerning aspects that are intrinsic not only to corpus compilation, but to
the nature of language itself.</p>
        <p>
          First, frequentist statistics is rooted in a concept of probability that
derives overall probabilities from previous observations – it predicts the future
from the past. The bridge between the two is provided by the central limit
theorem, which states that in suficiently large samples, regardless of the
underlying distribution, relative frequencies will reach limits and those limits
can be idealized to probabilities. This is expressed in the assumption that
the overall system will reach expected values if the experiment is repeated ad
infinitum: if a fair coin is tossed a suficient number of times, its idealized
property of being 0.5 heads and 0.5 tails will become defined over time, even
if heads or tails cluster in parts of the series. However, for the central limit
theorem to hold true, the underlying system must be stochastic – it must
have probabilities – which means it must be stationary and ergodic, with
stationarity referring to the property of a system to have stable and unchanging
probabilities. If, for example, the coin is damaged after a few tosses, the
outcome may be skewed and the system is no longer stationary. Language is
obviously not stationary, since it evolves significantly over time. If this was
the whole extent of the problem, then it would sufice to define stationary
subsets of language, e.g. corpora spanning only a decade or less – but in fact,
there are reasons to believe that stationarity is not reached in thematically
diverse corpora even over short periods of time
          <xref ref-type="bibr" rid="ref80">(Piantadosi, 2014)</xref>
          . Moreover,
there are other language-intrinsic features that create discrepancies to the
model, including cognitive, discourse, and inter-speaker dynamics, and,
perhaps most problematically, productivity.
        </p>
        <p>
          Since language is at least in part a cognitive function, it underlies
perceptual biases and cognitive influences. One major factor in this is priming. It
is well-known that speakers are very easy to prime on all linguistic levels –
syntactically
          <xref ref-type="bibr" rid="ref35 ref44 ref57 ref81 ref90 ref91">(Pickering and Branigan, 1998; Gries, 2005; Loebell and Bock,
2003)</xref>
          , lexically
          <xref ref-type="bibr" rid="ref42 ref48 ref76">(Hoey, 2012; Jones and Estes, 2012)</xref>
          , semantically
          <xref ref-type="bibr" rid="ref59 ref65">(Lucas,
2000; McNamara, 2005)</xref>
          , pragmatically
          <xref ref-type="bibr" rid="ref11 ref83 ref84">(Bott and Chemla, 2013)</xref>
          ,
phonetically, and/or phonologically
          <xref ref-type="bibr" rid="ref46 ref60 ref76">(Luce et al., 2000; James and Burke, 2000)</xref>
          . This
holds true for priming from any source: experimental or incidental cues;
selfpriming from elements used by the speaker; and interlocutor priming. Since
primed activations do not fully persist, but subside over time, they facilitate
another feature of natural language that is called burstiness. It refers to the
frequent re-occurrence of elements in dialogue or small parts of a text within
a short period of time, during which their probability to occur leaps to much
higher levels, only to swiftly collapse again to a rate that approximates zero.
Another contributing factor to burstiness is text structure – certain parts of
a story or report mention certain things, which are not picked up again.
        </p>
        <p>
          Priming also contributes to inter-speaker convergence or alignment, a
phenomenon in which the participants in a dialogue adapt to one another in
their linguistic expression. Alignment has been shown to exist in contexts as
diverse as abstract frames of reference (e.g. up/down vs. north/south in map
description tasks, as investigate
          <xref ref-type="bibr" rid="ref19">d by Steels and Loetzsch (2008</xref>
          )) and highly
specific and unconscious processes such as articulatory movement
          <xref ref-type="bibr" rid="ref77">( Pardo,
2006)</xref>
          , underlining the fact that probabilities may increase or decrease in a
specific dialogue not by virtue of any features that are intrinsic to the
linguistic elements themselves, but rather through the intentional or
unintentional influence of the speaker.
        </p>
        <p>A more global factor influencing the frequency of occurrence of linguistic
elements is the ebb and flow of discourse interaction, by which debates gain
momentum and then die down. This can play out within days, hours, or in
the space of a single document, and is inextricably linked to the issue of
intention: speakers do not utter certain words to fix imbalances in the probability
distribution, but rather consciously choose to speak in specific ways and/or
about a specific topic, which in turn increases or decreases the probability
for any given word to occur.</p>
        <p>Lastly, perhaps the strangest eefct is produced by productivity (the
process of coining new words incidentally), and creativity (the intentional
creation of new words). Nouns, verbs, and adjectives are open classes,
meaning new lexemes can be introduced to the system at any time. However, in
stochastic terms, this is the equivalent of rolling dice that keep changing the
number of their sides – with every new word, the relative frequencies for
all words change. Far from being a marginal phenomenon, productivity
is highly prominent in all spoken and written language, and is intrinsically
problematic for the concept of stationarity in language.</p>
        <p>
          The second major issue with respect to the mapping of the linguistic to
the statistical model is ergodicity or path-independence. It is also the second
necessary condition for the upholding of the central limit theorem. In an
ergodic system, no matter which path I take through the system, relative
frequencies for all elements can approximate limits and the overall system will
reach expected values. For example, by rolling a fair die a suficient number
of times, noting down the values, and then averaging over them, I will find
an approximation of the expected value 3.5. No matter which path I take,
whether I roll a 6 seven times in a row and then a 1 seven times in a row,
or any other combination – with suficient repetitions, the system will
converge. In a non-ergodic system, this is not the case: if I roll the die once and
define the final result as “1” for 1, 2, or 3, and “6” for 4, 5, or 6, the
expected value is still 3.5. But this value can never actually be reached, as the path
taken through the system determines the outcome. There is some research
on the mathematics of non-ergodicity in cognitively oriented research fields,
including cognitive neuroscience
          <xref ref-type="bibr" rid="ref29 ref66 ref75">(Medaglia et al., 2011; Franco et al., 2007;
Papo, 2013)</xref>
          and social and developmental theory
          <xref ref-type="bibr" rid="ref55 ref68">(Molenaar, 2008; Lerner,
2012)</xref>
          . For language, however, the problem is rarely discussed, although it
is now beginning to attract more scholarly attention (Lowie an
          <xref ref-type="bibr" rid="ref19">d Verspoor,
2018</xref>
          ;
          <xref ref-type="bibr" rid="ref19">Dębowski, 2018</xref>
          ).
        </p>
        <p>Without ergodicity and/or stationarity, the central limit theorem fails.
This is not a minor inconvenience, but equivalent to the collapse of the single
bridge that spans the space between two tall mountains – without the central
limit theorem, mathematically, there is no connection between prior
observations and a presumed population. There is no reason to assume that words
counted in one corpus hold any precise, probabilistic information about
what will be found in another. In a changing system, the past has limited
capacity to predict the future. While one may still choose to trust corpus
data as empirical evidence, there is little justification to rely on concepts like
p-values or eefct size to safeguard against chance results.
1.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Epistemological Problems</title>
        <p>The epistemology of statistics over words is unsatisfying for reasons that go
beyond mathematical concerns. Suggesting that the production of words
can be modeled stochastically, i.e. as a distribution of probabilities, entails at
least two philosophical extensions:
1. There is a stochastic system, perhaps in a latent superpopulation3 (as
it is sometimes conceptualized in the social sciences), that produces
unchanging and stochastically deterministic output;
2. Linguistic data collection is capable of capturing samples of this
output.</p>
        <p>
          Both of these extensions are, in fact, rather strange. The first suggests that
while we cannot know when certain things will be said, or which words will
co-occur; the conversations we have are eefctively pre-determined in the
stochastic system. This implies that for all new word forms that are
productively generated, and for all new processes and items in the world that did
not exist until recently and are now named (such as to google or Tesla), the
stochastic system has always held a quantitatively predefined slot. This idea is
utterly alien to linguistics, and is unlikely to be entertained by anyone in the
ifeld. On the contrary: usage-based linguistics metaphorically draws on
systems theory by suggesting that language is a complex, adaptive (i.e. evolving)
system – one that is neither stationary, nor ergodic
          <xref ref-type="bibr" rid="ref22 ref43 ref6 ref64 ref88">(Beckner et al., 2009; Ellis,
2016; Holland et al., 2005; Massip-Bonet, 2013; Steels, 2000, among others)</xref>
          .
In complex adaptive systems, cross-system statistical modeling is pointless,
3The concept of superpopulation refers to a model in which a real population is seen as
a sample of a latent infinite population. For example, one might think of the distribution of
career choices in the population of a country as a sample from a stochastic process. This is
used to allow for inferential statistics where data is eefctively not reproducible (since there is
only one of each country at a given time), but carries the philosophical problem of implying
determinism in dynamic and possibly unique systems.
since things keep changing. Parts of the system may of course be compared,
and compared quantitatively, but this requires an intelligent way to
distinguish between random fluctuation and structural shift.
        </p>
        <p>
          The second extension touches on the problem of the representativeness of
corpus data, and specifically of corpus data that is compiled in what is
sometimes called a quasi-experimental design. For example, in studying learner
language, one may try to control for certain variables such as topic, task,
setting, degree of formality, etc., which typically results in small to
midsized corpora like the learner corpora Kobalt
          <xref ref-type="bibr" rid="ref96">(Zinsmeister et al., 2012)</xref>
          , Falko
(Reznicek et al., 2010; Lü
          <xref ref-type="bibr" rid="ref19">deling et al., 2008</xref>
          ), and the International Corpus
of Learner English
          <xref ref-type="bibr" rid="ref34">(Granger et al., 2009)</xref>
          , or task-based dialogue corpora
such as the Berlin Map Task Corpus (BeMaTaC,
          <xref ref-type="bibr" rid="ref83">Sauer and Lüdeling (2013)</xref>
          ).
Both oefr an excellent way to collect data, which promises to yield crisp and
highly relevant results. However, by performing statistical analysis on this
data, or more precisely by borrowing methods from inferential statistics, via
reliance on the central limit theorem, I do not simply measure what I find
in the data, but rather infer a presumed superpopulation that works as a
stochastic system.
        </p>
        <p>This approach is epistemologically risky in two ways. First, unlike
newspaper archives or historical sources, it does not involve naturally occurring
data that has been collected from existing language – it works with language
that has been intentionally created and that may never have existed without
the researcher’s intervention. This means that the data collection created
the presumed superpopulation from which certainty or eefct size is then
derived, rendering the argument circular. Second, if the amount of data turns
out to be too small for certain types of analysis (as it typically does for
collocations, see section 1.2), one may be tempted to collect more data of the same
kind. Doing so is not the same as taking samples, however. Rather, it means
expanding language in a way in which it probably would not have developed
on its own. Continuing with this method until data sizes of two higher
orders of magnitude are reached – for example by collecting tens of thousands
of texts written by learners of German instead of a few hundred – would:
a) skew the proportions of all documented data of learner German
significantly; b) interfere in the writing development of hundreds of thousands of
learners of German; and c) create a population, rather than sampling from
one.</p>
        <p>Consider for a moment the following analogy: if a naturalist went into
the woods to count black and blue birds belonging to a particular species,
the birds would have existed without their involvement. Provided that the
proper methodology is chosen and applied well, reliable insights into the
features of this population are to be expected. However, if the naturalist began
breeding the birds in order to gain control over data collection, and
accidentally bred green birds never before observed in the wild, no amount of new
data would allow them to infer from their laboratory population to the
original population, as the existence of these birds has permanently changed
the entire species. In other words, in an attempt to create representative
amounts of data (birds), our naturalist will have created irresolvable
pathdependency in their research. The population that could have been inferred
to from the initial sample in the wild is now no longer representative of the
whole species, and thus of little to no use for comparison or significance
testing.</p>
        <p>At first glance, this may seem like a far-fetched and purely theoretical
problem, but the reality of learner language is that most of it is not spoken in the
context of the argumentative essays that are typically found in learner
corpora. Instead, it is found in: a) immigration ofices and areas with high
density immigrant populations; b) middle schools; and c) tourist destinations –
none of which are prone to eliciting essay-length argumentative texts on
controversial topics in a formal written setting. This means that the compilation
of learner corpora as they exist today already interferes in learner language
and influences projections of a latent superpopulation.</p>
        <p>This is not to say that there are no quantifiable patterns or measurable
diefrences between language cohorts. However, it is important to map the
demands of the mathematical model to the subject matter, and it appears
as though frequentist statistical interpretations of lexical distributions oefr
a rather unlinguistic perspective. The same is not necessarily true of other
types of linguistic distributions. More abstract syntactic features like word
order, cases, part-of-speech distributions, determiner systems, and so on, are
much more stable historically, and converge quickly even in data of limited
size. They also exhibit lower variance than word distributions and are greatly
influenced by changes in subcategories, as well as in any of the
complementary categories. For example, phonological changes may influence the
obligatoriness of articles, thereby also influencing the case system, as is the case in
diachronic language change. Using diefrent words and inventing new ones,
on the other hand, does not have the same eefct. It is therefore quite possible
that probabilistic analysis of syntactic aspects in corpora is mathematically
and conceptually valid. For largely lexically oriented analysis, this does not
appear to be the case.</p>
        <p>It should be noted here that criticizing the application of statistical
methods on lexical aspects of corpora is not merely a matter of methodological
taste or belief. If words do not have probabilities, probability-based
measures are meaningless in the same way that it is meaningless to measure the
temperature of a philosophical debate (even if it is heated) or the width of
the history of Europe (even if it is wide-ranged). Validity matters in
scholarly research, and that includes both the internal validity of the
mathematical model, and the mapping of mathematical to subject-specific concepts
and empirical observations.
2</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Graph-Based Modeling in Linguistics</title>
      <p>
        With an abundance of problems around statistical measures in corpus-based
lexical and lexicosyntactic analysis, and with the practical consideration that
reliable measures of smaller data are necessary, graph metrics would appear
to possess intrinsic appeal to corpus linguistics. Yet they are practically
unused. In fact, graph theory in general is underrepresented in
linguistics, despite some early work centered mainly around Markov chain
modeling
        <xref ref-type="bibr" rid="ref12 ref33 ref47">(Goodman, 1961; Jelinek et al., 1975; Brainerd and Chang, 1982)</xref>
        .
While computational linguistics does include graph-based models, they are
largely implemented in engineering ways (i.e. for underlying databases and
algorithms) rather than for the analysis of language itself. In core linguistic
research, graphs are mainly used for the visualization of complete analyses,
most notably in syntax trees, ontologies, and taxonomies. However, graphs
can also be measured in diefrent ways.
      </p>
      <p>
        The first example shows the modeling of lexical information and its
measurement through the metric of Louvain modularity
        <xref ref-type="bibr" rid="ref7">(Blondel et al., 2008)</xref>
        .
The data used for this analysis is taken from the Kobalt corpus
        <xref ref-type="bibr" rid="ref96">(Zinsmeister
et al., 2012)</xref>
        , which consists of 151 topic-controlled argumentative essays
written by learners of German from China and Belarus, and 20 control texts
written by German native speakers. From these texts, graphs based on
corrected syntactic dependency parses were extracted. Verb lexemes and noun,
adjective, and adverb lexemes that depend on those verbs as arguments
(different kinds of objects, subjects, and predicates) were modeled as nodes, and
edges were modeled from the existing dependence (co-occurrence in the
respective syntactic environment of the verb). The full analysis is discussed
in detail in
        <xref ref-type="bibr" rid="ref86">Shadrova (2020)</xref>
        and includes other layers of linguistic analysis,
as well as an in-depth internal validation through a range of sampling
techniques and hyperparameter settings. Figure 1 provides an example of a small
subgraph. The full graph of the Kobalt native speaker subcorpus is
visualized in Figure 2.4
      </p>
      <p>
        In conducting this analysis, the main research question was whether
graphs of this type show diefrent degrees of structuredness for diefrent
stages of foreign language acquisition in learners, as well as between learners
and native speakers. The underlying concept of coselectional constraint
or idiomaticity describes the tendency of native speakers to constrain their
choice in word combinations to a relatively small set out of a potentially
large combinatorial space. For example, the verb chosen for describing the
action of cleaning teeth in English is brush, whereas the German equivalent
is putzen (‘to clean’). While clean teeth would be understood and is
semantically acceptable, the combination is highly unlikely to occur in a text written
by an English native speaker (unless it was describing the act of having one’s
teeth professionally cleaned by a dentist). This particular example would
typically be learned early in both unguided learning and in language classes.
However, there are vast amounts of subtle constraints of this kind, and they
are known to be very dificult to master even at an advanced stage of second
language acquisition
        <xref ref-type="bibr" rid="ref44 ref48 ref76 ref78">(Howarth, 1998; Pawley and Syder, 1983; Paquot and
Granger, 2012, and many others)</xref>
        .
      </p>
      <p>The central hypotheses of the present study are as follows:
1. Graphs are more structured in more vs. less advanced learners;
2. Graphs are more structured in native speakers than learners;
3. Learners show a u-shaped learning development in their coselectional
structuredness.5
4Since these graphs are generally to large and detailed to be legible in print, further
visualizations, alongside the data and scripts for analysis, are available in a separate Zenodo
repository (doi: 10.5281/zenodo.3584091).</p>
      <p>
        5The linguistic background of this is discussed in detail in
        <xref ref-type="bibr" rid="ref86">Shadrova (2020)</xref>
        . Very briefly
put, this hypothesis is rooted in the theoretical premise that learners typically undergo a
process of randomization of structures. First, they learn in chunks during early acquisition
(everyone learns more or less the same in early language classes), and then they acquire and
      </p>
      <p>
        The measure used to capture these eefcts is Louvain modularity
        <xref ref-type="bibr" rid="ref7">( Blondel
et al., 2008)</xref>
        , one of a number of community detection algorithms6 deriving
a value in the range of [-1,1] for the modularity of a graph. A more modular
graph contains strongly structured (i.e. interconnected) communities that
are less connected to the rest of the graph. A less modular graph contains
recombine more vocabulary to succeed in more complex communicative situations. At this
second phase, their language cannot yet be fully restricted in ways aligned with the target
language for lack of experience and influence of the learners’ native language. This phase of
randomization is followed by semantic diefrentiation and lexical and semantic specialization
(people learn more specific language over time). This decrease in accuracy at intermediate
stages commonly described as a u-shaped learning trajectory, and can be found wherever a
learning process is guided by both rules and exemplar-based exceptions.
      </p>
      <p>6It appears that Louvain modularity prefers certain community sizes and constellations
over others, raising the question of whether other algorithms may be better suited for
corpus-linguistic analysis. This issue has not been considered in the analysis presented and
remains to be addressed in future research.
many nodes that are more randomly connected to other parts of the graph.
A graph of negative modularity contains fewer edges between nodes than
would be expected by chance, which does not appear to be the case in lexical
graphs generally. Importantly, modularity is not an artifact of graph size. As
exemplified in Figure 3, graphs of the same size in terms of nodes and edges
can be more or less structured, and thus possess higher or lower modularity.</p>
      <p>An analysis of a range of Kobalt subcorpora, each represented by five
samples in Figure 4, shows that hypotheses 1 and 2 are met by the data, and
hypothesis 3 is met in the Belarusian, but not in the Chinese subcorpus. A
sliding window analysis of the same data is presented in Figure 5. Windows
of 15 texts were used for each data point. Window 1 is created from texts
115 arranged by test scores, window 2 from texts 2-16 and so on. This analysis
shows that, beyond the grouped samples, a clear trajectory emerges over test
scores, which suggests that test scores and graph structures indeed correlate
in meaningful ways. If they did not, the trajectory would be expected to be
much more erratic.</p>
      <p>For comparison, a statistical analysis of the same data based on lexical
association measures was also performed. However, despite cohort
homogeneity and the limitation to an identical task/topic, there are very few
similar or identical collocational pairs across subcorpora. Absolute occurrences
quickly dwindle into very low numbers (below 10), which means that a high
and low rate of co-occurrence would falsely be identified from the same
order of magnitude. While 6 is of course three times as many as 2, the absolute
diefrence is not very telling in terms of the underlying diefrence in
coselectional association. In addition, the combinatorial power of lexical
material as it occurs even in small corpora is counter-intuitively large and poses
a hindrance to statistical interpretation. In fact, the number of verb and
accusative object lexemes as they are used in a small subcorpus of Kobalt
(21 texts, 148 unique verbs, 304 unique nouns in accusative object
position) results in a potential of 44,992 combinations. Even if one were to
assume that only 10% of those are semantically possible, and that only 10% of
the total 726 realizations of verbs and adjective objects in the subcorpus are
freely combined in the first place, this results in 72 draws from 449 elements.
The combinatorial potential of this is higher by several orders of magnitude
than the estimated number of atoms in the universe (3.352 · 1089 vs. 1078
– 1082). Constructing a scalar measure of coselectional constraint
statistically or stochastically would require a definition of thresholds for relative
frequencies at which the occurrence of word pairs would be considered within
or outside the range of expectation. However, drawing any specific
combination from this vast combinatorial space is extremely unlikely. Overall,
results from the statistical analysis remain inconclusive and dificult to
interpret. This is discussed in detail in Shadrova (2020, chapter 4).</p>
      <p>In contrast, the graph-based analysis shows clear trajectories and eefcts in
line with hypotheses 1 and 2, as well as a linguistically meaningful divergence
from hypothesis 3.
2.2</p>
      <sec id="sec-3-1">
        <title>Example 2: Lexicosyntactic Graphs and Grammar as a Graph</title>
        <p>The second example is not yet quantified and serves to demonstrate the
underlying modeling problem – a suggestion as to its quantification appears at
the end of this section.</p>
        <p>
          In the previous analysis, syntax is involved as a hierarchical lexical filter:
only those words that occur in certain syntactic slots of the verb are
considered in the structural analysis. Unlike positional models of collocation,
which consider words in a window of n tokens around the target word, a
syntactic model of this kind is able to eefctively filter for the relevant
collocations in languages with flexible word order such as German. However, in the
theoretical framework of usage-based linguistics that many corpus linguists
ascribe to, a strict division of lexicon and syntax is not usually presumed.
Instead, a number of models suggest the existence of a continuum, partial
overlap, and/or interdependence of lexical and syntactic elements. In fact,
some strands of grammar theory are largely focused on the very interface of
lexicon and syntax – these include construction grammar, which is entirely
based on the idea of the inseparability of lexicon and syntax
          <xref ref-type="bibr" rid="ref16 ref31 ref32 ref9">(Goldberg, 2005,
2013; Sag et al., 2012; Croft, 2001; Boas, 2013)</xref>
          , and modern approaches to
valency and dependency grammar, according to which grammar is generated
from specified features of lexical items
          <xref ref-type="bibr" rid="ref1 ref26 ref40 ref74">( Ágel and Fischer, 2010; Herbst, 2014;
Faulhaber, 2011)</xref>
          .7
7The latter is also partially true of some merging theories such as head-driven phrase
        </p>
        <p>While the idea of the syntax-lexicon continuum is appealing at first glance,
it quickly runs into conceptual problems and is dificult to operationalize.
For one thing, it requires clarity over whether the compared elements
constitute categories/variables or exemplars. Since many words are exemplars (like
on, after, if ), but many are also categories (like to be with its paradigm are,
were, is, etc.), this is not trivial. Second, syntax and higher-order functions
structure text in such a way that once a category is decided, many others fall
into place. At the same time, there is plenty of space for inter- and
intraindividual variation.</p>
        <p>
          So far, corpus studies have tended to look at the coselectional patterns of
individual lexemes or types of lexemes, or individual verb-argument
structures or classes thereof. For example,
          <xref ref-type="bibr" rid="ref20">Dux (2016)</xref>
          inspects the argument
structure patterns of selected verbs from the semantic fields of changing and
stealing, whereas
          <xref ref-type="bibr" rid="ref26">Faulhaber (2011)</xref>
          compares the argument structure
patterns of 88 diefrent verbs. Both conclude that semantics alone cannot
predict syntactic patterns with a high level of accuracy, and that lexical
idiosyncrasies need to be taken into consideration. A similar observation is made by
          <xref ref-type="bibr" rid="ref95">Zeldes (2013)</xref>
          concerning the productivity of verbs in their argument
selection. The unifying thread in these studies is that grammar is informed by the
preferences or selections of individual lexical items – something that is not
considered possible in more traditional approaches to syntax, where lexical
items are merged into syntactic patterns in a way in which syntax imposes
rules on the lexicon, but not vice versa.
        </p>
        <p>Technically, the studies mentioned do not model syntax and lexicon in the
same space – rather, they discuss the combinatorics of two sets of elements,
where the sets are selected by semantic or syntactic rules, similar to the
filtering function of syntax in the previous analysis.</p>
        <p>
          The mapping of two sets of elements, however, can be summarized and
made explicit as one system in a graph. An example is provided in Figures
6 and 7. In these graphs, lexemes and syntactic functions such as
grammatical subject (SUBJ) and direct or accusative object (OBJA) are modeled as
nodes, and the syntactic dependency between all syntactic functions is
encoded explicitly. At the same time, forces of association between lexemes
and syntactic functions are also represented in the graph. Thus, a
connection between what would be rows of a table of factor combinations are made
explicit in the graph structure. The visualization is produced by a so-called
force algorithm, a physics simulation in which higher frequency of
occurstructure grammar (HPSG,
          <xref ref-type="bibr" rid="ref82">Pollard and Sag (1994)</xref>
          ). However, syntax still has major
independence in these schools of thought, and lexical items can only select, but not generate,
syntactic rules. For a critical discussion of the various approaches to this issue and an
attempt to unify them, see
          <xref ref-type="bibr" rid="ref70">Müller (2013)</xref>
          and
          <xref ref-type="bibr" rid="ref71">Müller and Wechsler (2014)</xref>
          .
rence of a lexeme in a syntactic slot, or of one syntactic slot depending on
another, pull the two respective nodes into proximity.
        </p>
        <p>A closer look at the two figures reveals that the graph for intermediate
Belarusian learners of German in Figure 7 shows much more proximal
positions (i.e. shared lexemes) for subjects and accusative objects compared to the
native speaker graph in Figure 6, while objects appear to cluster in a more
coherent group in the native speaker graph. From a linguistic perspective, this
is interesting in two ways.</p>
        <p>First, as has already been noted, syntactic category distributions tend to
converge quickly in corpus data, and in many cases do not diefr much
between diefrent cohorts. Despite some minor pattern deviations, when
the same data is analyzed for distributions of syntactic categories, it does not
yield any substantial diefrences according to native language group.
However, through the combination of syntactic categories and lexical items,
substantial diefrences in the graphs of the two cohorts do emerge. This suggests
that graph structure is capable of capturing linguistically meaningful eefcts
that are not encoded in the individual co-occurrences of lexemes in the
syntactic structures themselves, but only in their interrelation: it appears that
lexicosyntax lives in the cross-systemic structure, and not in the
combinations of words.</p>
        <p>Second, the specific diefrences between the two graphs correspond to
higher-level linguistic concepts: a higher interchangeability of subject vs.
object lexemes may indicate passivization, or it might point towards diefrences
in anaphoricity (i.e. the types and structure of referentiality to previously
introduced discourse referents). This is particularly interesting because
Russian and Belarusian are partially pro-drop languages, meaning that subjects
can be left out or go unrealized in many contexts, which raises the
plausibility for diefrent encodings of the relation between subjects and objects in
the minds of native speakers of those languages.</p>
        <p>Figure 8, which shows the same type of graph for intermediate Chinese
learners of German in Kobalt, underscores this point. Here, subjects and
predicates are more closely aligned, which may be rooted in typological aspects
of Mandarin Chinese; a theme-rheme language in which a sentence topic
is presented and then commented upon, and which can thus be plausibly
mapped to a subject-copula-predicate construction in German. However,
since these analyses are post-hoc, and the graphs involved are
opportunistically derived from previously analyzed data, more research is needed to verify
their genuine usefulness in the study of lexicosyntax.</p>
        <p>This includes the need for quantification. Even if the visualization
suggests certain eefcts, these can only be verified through a comparison with
other graphs from the same as well as other cohorts. Assessing diefrences of
this kind visually can be tedious and unreliable. Furthermore, a
quantification would allow for an analysis of variance (i.e. more or less similar graphs
of the same cohort), while a visual assessment can only extract obviously
diverging patterns.</p>
        <p>
          The visualization encodes structural information that is also present
quantitatively in the graph itself, and may be used for external quantification.
One possible way to approach this would be through non-negative matrix
factorization (NMF,
          <xref ref-type="bibr" rid="ref30">Gillis (2014)</xref>
          ), which is used in information extraction
techniques such as topic modeling
          <xref ref-type="bibr" rid="ref13 ref18 ref53">(de Paulo Faleiros and de Andrade Lopes,
2016; Chen et al., 2019; Kuang et al., 2015)</xref>
          . NMF is a dimensionality
reduction algorithm that takes a matrix and deterministically reduces it to a unique
vector. Vectors can then be compared via cosine distance, as is done in certain
applications of computational linguistics such as word embeddings
          <xref ref-type="bibr" rid="ref23 ref67 ref79">(Mikolov et al., 2013; Pennington et al., 2014; Ethayarajh, 2019, among others)</xref>
          .
        </p>
        <p>While conceptually and computationally well-implemented, the central
question arising here is what should be encoded in the matrix to ensure
linguistic validity. One approach would be to span a matrix for all nodes times
all nodes (i.e. both lexemes and syntactic functions). But since words can
only occur in syntactic slots, this would leave most cells empty, perhaps
overestimating similarity. The other would be to abstract from the concrete
lexemes and span the matrix for syntactic nodes only, entering only the number
of shared lexemes in each field. However, this implies a double
dimensionality reduction and leaves out not only the exemplar information, but also
structural information that is relevant to the graph visualization and relates
to linguistically meaningful aspects of the graph structure, such as the issue
of how node clusters are distributed in relation to other node clusters with
which they do not share information.</p>
        <p>This goes to show that computing just any graph metric will not sufice
for the development of a linguistically interpretable method of graph
measurement. As for all methodological development, in-depth validation,
replication, and extension to new data are of crucial importance - but perhaps
more so is the embedding into the theoretical frameworks of linguistic study,
and the triangulation with results obtained through other methods.
3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conceptual Validation and Future Research</title>
      <p>Graph-based modeling could bring new perspectives into quantitative
corpus linguistics. Its major advantages over frequentist statistical modeling are
that it: (a) captures relational information directly, without the internal
triangulation of various measures; and b) does not infer to concepts such as
stationary and ergodic stochastic systems or a stable and existing
superpopulation. However, it is going to take extensive further research to fully
assess the potential of graph metrics. This is particularly true since unlike in
mathematical graph theory or network analysis, the words of a language are
both representative of linguistic categories and relevantly individual. That
is to say, each item diefrs in important aspects from all other items, which
means that, generally speaking, a full abstraction over all words will not be
well-aligned with the linguistic model. Detailed theoretical modeling and
empirical conceptual validation are therefore necessary to ensure that graph
metrics do indeed measure linguistic aspects and not spurious information
of a hyperstructure that is only marginally important in linguistic terms.</p>
      <p>
        An example of such an eefct is the measurement of the in- and out-degree
of lexemes in corpora, a measurement that consistently detects so-called
small-world-efects . These are graph structures in which some nodes have
many in- or outgoing edges, while most others have only few, and have been
measured consistently through a range of published research across many
corpora
        <xref ref-type="bibr" rid="ref15 ref18 ref27">(Choudhury and Mukherjee, 2009; Ferrer i Cancho and Solé, 2001;
Wachs-Lopes and Rodrigues, 2016, among others)</xref>
        . While one might be
tempted to view this as a graph-based discovery, the fact of the matter is
that quantitative linguistics has always known that lexemes follow a Zipf,
long-tailed, or power-law distribution, also called a distribution with large
numbers of rare events (LNRE). In large corpora, a small number of words
are very frequent – mostly functional words like prepositions or pronouns
– and typically up to half of the lexemes are so-called hapax legomena or dis
legomena, words that occur only once or twice in the corpus. Arranged in a
graph, this would trivially imply degrees of one or two for most words,
leaving an impression of high centrality or connectivity of individual lexemes.
However, to corpus linguistics, this is merely a visualization of an already
well-established fact and not a structural discovery. In order to avoid the
dead-ends of epiphenomenal observations, a clear theoretical foundation as
well as a structured approach to the validation of graph-based research in
linguistics is crucial.
      </p>
      <p>With the development of suficient computing power, graph-based
analysis has become more advanced and central not only in traditional STEM
subjects, but also in the humanities and social sciences. Accordingly, there is
an ongoing development of new metrics. Some of these metrics are derived
from graph theory directly, which means that they abstract more strongly
from individual nodes and focus on the more abstract properties of node
and edge classes. Isographs, i.e. structurally identical (sub-)graphs that may
diefr significantly in their node contents, are a particularly relevant issue in
this context. Their existence may or may not have implications for the
underlying theoretical model, and should be considered in any application of
graph metrics to a subject-specific research question.</p>
      <p>In order to make the most eficient use of graph metrics in linguistics and
other fields of study, it is important to consider implications of this kind not
only mathematically, but also epistemologically by asking questions like the
following:
1. Is the theoretical model well-represented by the mathematical model?
2. Are there aspects to the mathematical model that may interfere with
subject-specific theoretical underpinnings and/or interpretations of
the data?
Much more validation and research is required to identify concerns in this
regard, including: the application of measures to more data; the detailed
modeling and mapping of theoretical to mathematical concepts; the calibration
and triangulation of metrics; and the development of evaluative frameworks
for all of the above.
4</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Graph metrics are very new to the field of corpus linguistics, especially in
core-lingustic (rather than psycholinguistic or computational) research. The
analyses discussed in this paper suggest that graph-based modeling and
quantification may provide an alternative to operationalizations in the more
traditionally applied framework of frequentist statistics. They may even provide
solutions to some of the common practical problems of corpus linguistics,
since by encoding information at a higher density, graphs allow for the
quantification of small data. By abstracting from individual entities such as
lexemes they also allow for structural assessment without the triangulation and
comparison of a large number of words or word pairs, which implies fewer
artifacts from text length, a problem that is notorious with all corpus research.</p>
      <p>
        With advances in computing power and the ease provided by graph
infrastructures such as neo4j, the corpus search engine graphANNIS
        <xref ref-type="bibr" rid="ref51">(Krause,
2019)</xref>
        , the community API in Python, the GUI-based graph analysis
program Gephi
        <xref ref-type="bibr" rid="ref4">(Bastian et al., 2009)</xref>
        , and the igraph package in R
        <xref ref-type="bibr" rid="ref17">(Csardi and
Nepusz, 2006)</xref>
        , graph metrics are becoming more usable outside of
traditional computationally oriented subjects. However, this should not tempt
linguists and other researchers to blindly compute graph metrics on data
without further consideration of the underlying model. Much more
research is required to reliably map the concepts of graph theory and network
analysis in a way that is fully compatible with linguistic concepts, and to
ensure that the application of graph metrics does not produce undesirable
effects like the ones associated with the use of frequentist statistics in
corpusbased studies of the lexicon and lexicosyntax. The most immediate
desideratum for future research into the graph-based analysis of corpus data thus
lies in the fields of theoretical and quantitative modeling, validation, and
replication.
5
      </p>
    </sec>
    <sec id="sec-6">
      <title>A Note on Software</title>
      <p>
        Kobalt was preprocessed with TreeTagger8 and Malt Parser
        <xref ref-type="bibr" rid="ref72">(Nivre et al.,
2006)</xref>
        based on
        <xref ref-type="bibr" rid="ref28">Foth et al. (2006)</xref>
        ’s dependency grammar of German, which
was slightly adjusted for the purposes of this analysis. Further details can
be found in
        <xref ref-type="bibr" rid="ref86">Shadrova (2020)</xref>
        . For graph extraction, analysis, and
visualization, R
        <xref ref-type="bibr" rid="ref94">(R Core Team, 2015)</xref>
        and RStudio (RStudio Team, 2015) with
packages reshape2 (Wickham, 2007),
        <xref ref-type="bibr" rid="ref19">dplyr (Wickham et al., 2018</xref>
        ),
jsonlite
        <xref ref-type="bibr" rid="ref73">(Ooms, 2014)</xref>
        , and ggplot2
        <xref ref-type="bibr" rid="ref92">(Wickham, 2016)</xref>
        were employed. Graphs
have been visualized with D3.js
        <xref ref-type="bibr" rid="ref10">(Bostock et al., 2011)</xref>
        , Python matplotlib
        <xref ref-type="bibr" rid="ref45">(Hunter, 2007)</xref>
        , an
        <xref ref-type="bibr" rid="ref19">d networkX (Hagberg et al., 2008</xref>
        ). Modularity was
computed with Python’s community API developed by Thomas Aynaud
(https://python-louvain.readthedocs.io/en/latest/api.html). Figure 1 was created
with Gephi
        <xref ref-type="bibr" rid="ref4">(Bastian et al., 2009)</xref>
        .
      </p>
      <p>8https://cis.uni-muenchen.de/~schmid/tools/TreeTagger/</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>I would like to thank Joris van Zundert for his very kind and constructive
comments on a previous version of this essay, which have greatly helped to
improve its focus and clarity. All remaining errors are of course my own.</p>
      <p>Leibniz-Institut für Deutsche Sprache (2019). Deutsches
Referenzkorpus/Archiv der Korpora geschriebener Gegenwartssprache 2019-I
(Release vom 18.03.2019). http://www.ids-mannheim.de/DeReKo.</p>
      <p>R Core Team (2015). R: A Language and Environment for Statistical
Computing. R Foundation for Statistical Computing, Vienna, https://www.</p>
      <p>R-project.org/.</p>
      <p>Reznicek, M., Walter, M., Schmidt, K., Lüdeling, A., et al. (2010). Das
Falko-Handbuch: Korpusaufbau und Annotationen. Institut für deutsche
Sprache und Linguistik, Humboldt-Universität zu Berlin.</p>
      <p>RStudio Team (2015). Rstudio: Integrated Development Environment for R.</p>
      <p>RStudio, Inc., Boston, MA, http://www.rstudio.com/.</p>
      <p>Sag, I. A., Boas, H. C., and Kay, P. (2012). Introducing Sign-Based
Construction Grammar. In Boas, H. C. and Sag, I. A., editors, Sign-Based
Construction Grammar, pages 1–30. CSLI Publications.</p>
      <p>Wachs-Lopes, G. A. and Rodrigues, P. S. (2016). Analyzing Natural Human
Language from the Point of View of Dynamic of a Complex Network.
Expert Systems with Applications, 45:8–22, DOI: 10.1016/j.eswa.2015.09.020.
Wickham, H. (2007). Reshaping Data with the reshape Package. Journal of
Statistical Software, 21(12):1–20, http://www.jstatsoft.org/v21/i12/.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Ágel</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>50 Jahre Valenztheorie und Dependenzgrammatik</article-title>
          .
          <source>Zeitschrift für germanistische Linguistik</source>
          ,
          <volume>38</volume>
          (
          <issue>2</issue>
          ):
          <fpage>249</fpage>
          -
          <lpage>290</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Aitchison</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corradi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Latham</surname>
            ,
            <given-names>P. E.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Zipf's Law Arises Naturally When There Are Underlying, Unobserved Variables</article-title>
          .
          <source>PLoS computational biology</source>
          ,
          <volume>12</volume>
          (
          <issue>12</issue>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Baayen</surname>
            ,
            <given-names>R. H.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <source>Word Frequency Distributions</source>
          , volume
          <volume>18</volume>
          . Springer, Dordrecht, Netherlands.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Bastian</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heymann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jacomy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Gephi: An Open Source Software for Exploring and Manipulating Networks</article-title>
          . http://www.aaai.org/ ocs/index.php/ICWSM/09/paper/view/154.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Bechet</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nasr</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Favre</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Adapting Dependency Parsing to Spontaneous Speech for Open Domain Spoken Language Understanding</article-title>
          .
          <source>In Fifteenth Annual Conference of the International Speech Communication Association.</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Beckner</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blythe</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bybee</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Christiansen</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          , et al. (
          <year>2009</year>
          ).
          <article-title>Language Is a Complex Adaptive System: Position Paper</article-title>
          .
          <source>Language Learning</source>
          ,
          <volume>59</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          , DOI: 10.1111/j.1467-
          <fpage>9922</fpage>
          .
          <year>2009</year>
          .
          <volume>00533</volume>
          .x.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Blondel</surname>
            ,
            <given-names>V. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillaume</surname>
            ,
            <given-names>J.-L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lambiotte</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Lefebvre</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Fast Unfolding of Communities in Large Networks</article-title>
          .
          <source>Journal of Statistical Mechanics: Theory and Experiment</source>
          ,
          <year>2008</year>
          (
          <volume>10</volume>
          ):
          <fpage>P10008</fpage>
          , DOI: 10.1088/
          <fpage>1742</fpage>
          -
          <lpage>5468</lpage>
          /
          <year>2008</year>
          /10/p10008.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          BNC World (
          <year>2007</year>
          ).
          <article-title>The British National Corpus</article-title>
          . http://www.natcorp.ox.ac. uk/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Boas</surname>
            ,
            <given-names>H. C.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <source>Cognitive Construction Grammar</source>
          . Oxford University Press, DOI: 10.1093/oxfordhb/9780195396683.013.0013.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Bostock</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ogievetsky</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Heer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>D3 Data-driven Documents</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          ,
          <volume>17</volume>
          (
          <issue>12</issue>
          ):
          <fpage>2301</fpage>
          -
          <lpage>2309</lpage>
          , DOI: 10.1109/TVCG.
          <year>2011</year>
          .
          <volume>185</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Bott</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Chemla</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Pragmatic Priming and the Search for Alternatives</article-title>
          .
          <source>In Proceedings of the Annual Meeting of the Cognitive Science Society</source>
          , volume
          <volume>35</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Brainerd</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          (
          <year>1982</year>
          ).
          <article-title>Number of Occurrences in Twostate Markov Chains, with an Application in Linguistics</article-title>
          .
          <source>The Canadian Journal of Statistics/La Revue Canadienne de Statistique</source>
          , pages
          <fpage>225</fpage>
          -
          <lpage>231</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            , H., Liu,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ye</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Experimental Explorations on Short Text Topic Mining between Lda and Nmf Based Schemes</article-title>
          .
          <source>Knowledge-Based Systems</source>
          ,
          <volume>163</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          , DOI: 10.1016/j.knosys.
          <year>2018</year>
          .
          <volume>08</volume>
          .011.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tetreault</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Stent</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2015</year>
          ). It Depends:
          <article-title>Dependency Parser Comparison Using a Web-based Evaluation Tool</article-title>
          .
          <source>In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)</source>
          , pages
          <fpage>387</fpage>
          -
          <lpage>396</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Choudhury</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Mukherjee</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>The Structure and Dynamics of Linguistic Networks</article-title>
          .
          <source>In Dynamics on and of Complex Networks</source>
          , pages
          <fpage>145</fpage>
          -
          <lpage>166</lpage>
          . Birkhäuser, Boston, MA.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Croft</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Radical Construction Grammar: Syntactic Theory in Typological Perspective</article-title>
          . Oxford University Press, DOI: 10.1093/acprof:oso/9780198299554.001.0001.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Csardi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Nepusz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>The igraph Software Package for Complex Network Research</article-title>
          . InterJournal, Complex Systems:
          <volume>1695</volume>
          , http:// igraph.org.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>de Paulo Faleiros</surname>
          </string-name>
          , T. and
          <string-name>
            <surname>de Andrade Lopes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>On the Equivalence between Algorithms for Non-negative Matrix Factorization and Latent Dirichlet Allocation</article-title>
          .
          <source>In ESANN.</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Dębowski</surname>
            ,
            <given-names>Ł.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Is Natural Language a Perigraphic Process? The Theorem about Facts and Words Revisited</article-title>
          . Entropy,
          <volume>20</volume>
          (
          <issue>2</issue>
          ):
          <fpage>85</fpage>
          , DOI: 10.3390/e20020085.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Dux</surname>
            ,
            <given-names>R. J.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>A Usage-based Approach to Verb Classes in English and German</article-title>
          .
          <source>PhD thesis</source>
          , https://tdl-ir.tdl.org/handle/2152/ETD-UT-2011
          <string-name>
            <surname>-</surname>
          </string-name>
          05-3114.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Eckes</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Fremdsprache (onDaF): Theoretische Grundlagen, Konstruktion und Validierung</article-title>
          . In Grotjahn, R., editor,
          <source>C-Test: Contributions from Current Research</source>
          , volume
          <volume>18</volume>
          <source>of Language Testing and Evaluation</source>
          , pages
          <fpage>125</fpage>
          -
          <lpage>192</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Ellis</surname>
            ,
            <given-names>N. C.</given-names>
          </string-name>
          (
          <year>2016</year>
          ). Salience, Cognition,
          <string-name>
            <given-names>Language</given-names>
            <surname>Complexity</surname>
          </string-name>
          , and
          <article-title>Complex Adaptive Systems</article-title>
          . Studies in Second Language Acquisition,
          <volume>38</volume>
          (
          <issue>2</issue>
          ):
          <fpage>341</fpage>
          -
          <lpage>351</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Ethayarajh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo,</article-title>
          and GPT-2
          <string-name>
            <surname>Embeddings</surname>
          </string-name>
          . arXiv, page
          <year>1909</year>
          .
          <volume>00512</volume>
          , https://arxiv.org/abs/
          <year>1909</year>
          .00512.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Evert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <source>The Statistics of Word Cooccurrences: Word Pairs and Collocations. Dissertation</source>
          , Institut für maschinelle Sprachverarbeitung, Universität Stuttgart.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Evert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uhrig</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bartsch</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Proisl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>E-view-afilationa Large-scale Evaluation Study of Association Measures for Collocation Identification</article-title>
          .
          <source>Proceedings of eLex 2017-Electronic lexicography in the 21st century: Lexicography from Scratch</source>
          , pages
          <fpage>531</fpage>
          -
          <lpage>549</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Faulhaber</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Verb Valency Patterns: A Challenge for Semantics-based Accounts</article-title>
          , volume
          <volume>71</volume>
          . De Gruyter.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Ferrer i Cancho</surname>
          </string-name>
          , R. and
          <string-name>
            <surname>Solé</surname>
            ,
            <given-names>R. V.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <source>The Small World of Human Language. Proceedings of the Royal Society of London. Series B: Biological Sciences</source>
          ,
          <volume>268</volume>
          (
          <issue>1482</issue>
          ):
          <fpage>2261</fpage>
          -
          <lpage>2265</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Foth</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>By</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Menzel</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Guiding a Constraint Dependency Parser with Supertags</article-title>
          .
          <source>In Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL - ACL '06</source>
          .
          <article-title>Association for Computational Linguistics</article-title>
          , DOI: 10.3115/1220175.1220212.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Franco</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rolls</surname>
            ,
            <given-names>E. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aggelopoulos</surname>
            ,
            <given-names>N. C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jerez</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Neuronal Selectivity, Population Sparseness, and Ergodicity in the Inferior Temporal Visual Cortex</article-title>
          .
          <source>Biological cybernetics</source>
          ,
          <volume>96</volume>
          (
          <issue>6</issue>
          ):
          <fpage>547</fpage>
          -
          <lpage>560</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Gillis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The Why and How of Nonnegative Matrix Factorization</article-title>
          . Regularization, optimization, kernels, and support vector machines,
          <volume>12</volume>
          (
          <issue>257</issue>
          ):
          <fpage>257</fpage>
          -
          <lpage>291</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <source>Constructions at Generalization in Language. Oxford</source>
          <volume>10</volume>
          .1093/acprof:oso/9780199268511.001.0001.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <source>Constructionist Approaches</source>
          . Oxford University Press, DOI: 10.1093/oxfordhb/9780195396683.013.0002.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>1961</year>
          ).
          <article-title>Graphs for Linguistics</article-title>
          .
          <source>American Mathematical Society</source>
          , DOI: 10.1090/psapm/012/9982.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <surname>Granger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dagneaux</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meunier</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Paquot</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <source>International Corpus of Learner English</source>
          . Presses universitaires de Louvain,
          <string-name>
            <surname>Louvain-</surname>
          </string-name>
          la-Neuve.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Syntactic Priming: A Corpus-based Approach</article-title>
          .
          <source>Journal of psycholinguistic research</source>
          ,
          <volume>34</volume>
          (
          <issue>4</issue>
          ):
          <fpage>365</fpage>
          -
          <lpage>399</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <fpage>50</fpage>
          -something Years of Work on Collocations.
          <source>International Journal of Corpus Linguistics</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ):
          <fpage>137</fpage>
          -
          <lpage>166</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2019</year>
          ). 15 Years of Collostructions.
          <source>International Journal of Corpus Linguistics</source>
          ,
          <volume>24</volume>
          (
          <issue>3</issue>
          ):
          <fpage>385</fpage>
          -
          <lpage>412</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Stefanowitsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Extending Collostructional Analysis: A Corpus-based Perspective Onalternations'</article-title>
          .
          <source>International journal of corpus linguistics</source>
          ,
          <volume>9</volume>
          (
          <issue>1</issue>
          ):
          <fpage>97</fpage>
          -
          <lpage>129</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          <string-name>
            <surname>Hagberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swart</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Chult</surname>
            ,
            <given-names>D. S.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Exploring Network Structure, Dynamics,</article-title>
          and
          <article-title>Function Using Networkx</article-title>
          .
          <source>In Proceedings of the 7th Python in Science Conference</source>
          , pages
          <fpage>11</fpage>
          -
          <lpage>15</lpage>
          . https://www.osti.gov/biblio/ 960616.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          <string-name>
            <surname>Herbst</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The Valency Approach to Argument Structure Constructions</article-title>
          .
          <source>In Constructions Collocations Patterns, number 282 in Trends in Linguistics. Studies and Monographs [TiLSM]</source>
          , pages
          <fpage>167</fpage>
          -
          <lpage>216</lpage>
          . De Gruyter, Mouton, DOI: 10.1515/9783110356854.167.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          <string-name>
            <surname>Hirschmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Modifikatoren im Deutschen: Ihre Klassifizierung und varietätenspezifische Verwendung</article-title>
          . Stauefnburg, Tübingen.
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          <string-name>
            <surname>Hoey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Lexical Priming: A New Theory of Words and Language</article-title>
          . Routledge, London, DOI: 10.4324/9780203327630.
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          <string-name>
            <surname>Holland</surname>
            ,
            <given-names>J. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gong</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minett</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ke</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al. (
          <year>2005</year>
          ).
          <article-title>Language Acquisition as a Complex Adaptive System. Language acquisition, change and emergence</article-title>
          , pages
          <fpage>411</fpage>
          -
          <lpage>435</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          <string-name>
            <surname>Howarth</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>Phraseology and Second Language Proficiency</article-title>
          .
          <source>Applied Linguistics</source>
          ,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <fpage>24</fpage>
          -
          <lpage>44</lpage>
          , DOI: 10.1093/applin/19.1.24.
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          <string-name>
            <surname>Hunter</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <source>Matplotlib: A 2D Graphics Environment. Computing in Science &amp; Engineering</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <fpage>90</fpage>
          -
          <lpage>95</lpage>
          , DOI: 10.1109/
          <string-name>
            <surname>MCSE</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <volume>55</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          <string-name>
            <surname>James</surname>
            ,
            <given-names>L. E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Burke</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Phonological Priming Eefcts on Word Retrieval and Tip-of-the-Tongue Experiences in Young and Older Adults</article-title>
          .
          <source>Journal of Experimental Psychology: Learning, Memory, and Cognition</source>
          ,
          <volume>26</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1378</fpage>
          -
          <lpage>1391</lpage>
          , DOI: 10.1037/
          <fpage>0278</fpage>
          -
          <lpage>7393</lpage>
          .
          <year>26</year>
          .6.1378.
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          <string-name>
            <surname>Jelinek</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bahl</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Mercer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1975</year>
          ).
          <article-title>Design of a Linguistic Statistical Decoder for the Recognition of Continuous Speech</article-title>
          .
          <source>IEEE Transactions on Information Theory</source>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          ):
          <fpage>250</fpage>
          -
          <lpage>256</lpage>
          , DOI: 10.1109/tit.
          <year>1975</year>
          .
          <volume>1055384</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>L. L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Estes</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Lexical Priming: Associative, Semantic, and Thematic Influences on Word Recognition</article-title>
          . In Adelman, J. S., editor,
          <source>Visual Word Recognition: Meaning and Context, Individuals and Development</source>
          , volume
          <volume>2</volume>
          of Current Issues in
          <source>the Psychology of Language</source>
          , pages
          <fpage>44</fpage>
          -
          <lpage>72</lpage>
          . Psychology Press, London, DOI: 10.4324/9780203106976.
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          <string-name>
            <surname>Kilgarrif</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2005</year>
          ). Language Is Never, Ever, Ever, Random.
          <source>Corpus Linguistics and Linguistic Theory</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ):
          <fpage>263</fpage>
          -
          <lpage>276</lpage>
          , DOI: 10.1515/cllt.
          <year>2005</year>
          .
          <volume>1</volume>
          .2.263.
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          <string-name>
            <surname>Koplenig</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Against Statistical Significance Testing in Corpus Linguistics</article-title>
          .
          <source>Corpus Linguistics and Linguistic Theory</source>
          ,
          <volume>15</volume>
          :
          <fpage>321</fpage>
          -
          <lpage>346</lpage>
          , DOI: 10.1515/cllt-2016-0036.
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          <string-name>
            <surname>Krause</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>ANNIS: A Graph-based Query System for Deeply Annotated Text Corpora</article-title>
          .
          <source>PhD Thesis</source>
          , Humboldt-Universität zu Berlin,
          <string-name>
            <surname>Mathematisch-Naturwissenschaftliche</surname>
            <given-names>Fakultät</given-names>
          </string-name>
          , DOI: 10.18452/19659.
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          <string-name>
            <surname>Krivanek</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Meurers</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Comparing Rule-based and Datadriven Dependency Parsing of Learner Language</article-title>
          . In Gerdes,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Hajičová</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            , and
            <surname>Wanner</surname>
          </string-name>
          , L., editors,
          <source>Computational Dependency Theory</source>
          , pages
          <fpage>207</fpage>
          -
          <lpage>225</lpage>
          . IOS press.
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          <string-name>
            <surname>Kuang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Nonnegative Matrix Factorization for Interactive Topic Modeling and Document Clustering</article-title>
          .
          <source>In Partitional Clustering Algorithms</source>
          , pages
          <fpage>215</fpage>
          -
          <lpage>243</lpage>
          . Springer, Cham.
        </mixed-citation>
      </ref>
      <ref id="ref54">
        <mixed-citation>
          <string-name>
            <surname>Kuno</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Takami</surname>
          </string-name>
          , K.-i. (
          <year>2004</year>
          ).
          <article-title>Functional Constraints in Grammar: On the Unergative-unaccusative Distinction</article-title>
          . John Benjamins, https://www. jbe-platform.com/content/books/9789027295217.
        </mixed-citation>
      </ref>
      <ref id="ref55">
        <mixed-citation>
          <string-name>
            <surname>Lerner</surname>
            ,
            <given-names>R. M.</given-names>
          </string-name>
          (
          <year>2012</year>
          ). Developmental Science: Past, Present, and Future.
          <source>International Journal of Developmental Science</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          -2):
          <fpage>29</fpage>
          -
          <lpage>36</lpage>
          , DOI: 10.3233/DEV-2012-12102.
        </mixed-citation>
      </ref>
      <ref id="ref56">
        <mixed-citation>
          <string-name>
            <surname>Linck</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Cunnings</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>The Utility and Application of Mixed-Eefcts Models in Second Language Research</article-title>
          .
          <source>Language Learning</source>
          ,
          <volume>65</volume>
          (
          <issue>S1</issue>
          ):
          <fpage>185</fpage>
          -
          <lpage>207</lpage>
          , DOI: 10.1111/lang.12117.
        </mixed-citation>
      </ref>
      <ref id="ref57">
        <mixed-citation>
          <string-name>
            <surname>Loebell</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Bock</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <source>Structural Priming across Languages. Linguistics</source>
          ,
          <volume>41</volume>
          (
          <issue>5</issue>
          ):
          <fpage>791</fpage>
          -
          <lpage>824</lpage>
          , DOI: 10.1515/ling.
          <year>2003</year>
          .
          <volume>026</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref58">
        <mixed-citation>
          <string-name>
            <surname>Lowie</surname>
            ,
            <given-names>W. M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Verspoor</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Individual Diefrences and the Ergodicity Problem</article-title>
          .
          <source>Language Learning</source>
          ,
          <volume>69</volume>
          :
          <fpage>184</fpage>
          -
          <lpage>206</lpage>
          , DOI: 10.1111/lang.12324.
        </mixed-citation>
      </ref>
      <ref id="ref59">
        <mixed-citation>
          <string-name>
            <surname>Lucas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Semantic Priming without Association: A Meta-Analytic Review</article-title>
          .
          <source>Psychonomic Bulletin &amp; Review</source>
          ,
          <volume>7</volume>
          (
          <issue>4</issue>
          ):
          <fpage>618</fpage>
          -
          <lpage>630</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref60">
        <mixed-citation>
          <string-name>
            <surname>Luce</surname>
            ,
            <given-names>P. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldinger</surname>
            ,
            <given-names>S. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>E. T.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Vitevitch</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Phonetic Priming, Neighborhood Activation, and PARSYN</article-title>
          . Perception &amp; psychophysics,
          <volume>62</volume>
          (
          <issue>3</issue>
          ):
          <fpage>615</fpage>
          -
          <lpage>625</lpage>
          , DOI: 10.3758/bf03212113.
        </mixed-citation>
      </ref>
      <ref id="ref61">
        <mixed-citation>
          <string-name>
            <surname>Lüdeling</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Shadrova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Linguistic Models, Acquisition Theories, and Learner Corpora: Morphological Productivity in SLA Research Exemplified by Complex Verbs in German</article-title>
          .
          <source>Language Learning</source>
          ,
          <volume>67</volume>
          (
          <issue>S1</issue>
          ):
          <fpage>96</fpage>
          -
          <lpage>129</lpage>
          , DOI: 10.1111/lang.12231.
        </mixed-citation>
      </ref>
      <ref id="ref62">
        <mixed-citation>
          <string-name>
            <surname>Lüdeling</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doolittle</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Walter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Das Lernerkorpus Falko</article-title>
          .
          <source>Deutsch als Fremdsprache</source>
          ,
          <volume>45</volume>
          (
          <issue>2</issue>
          ):
          <fpage>67</fpage>
          , DOI: 10.37307/j.2198-
          <lpage>2430</lpage>
          .
          <year>2008</year>
          .
          <volume>02</volume>
          .02.
        </mixed-citation>
      </ref>
      <ref id="ref63">
        <mixed-citation>
          <string-name>
            <surname>Lüdeling</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschmann</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shadrova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Wan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2021</year>
          ). Deutsch in Europa: Sprachpolitisch, grammatisch, methodisch,
          <source>chapter Tiefe Analyse von Lernerkorpora</source>
          , pages
          <fpage>235</fpage>
          -
          <lpage>284</lpage>
          . De Gruyter, DOI: doi:10.1515/
          <fpage>9783110731514</fpage>
          -
          <lpage>013</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref64">
        <mixed-citation>
          <string-name>
            <surname>Massip-Bonet</surname>
            ,
            <given-names>À.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Language as a Complex Adaptive System: Towards an Integrative Linguistics</article-title>
          , pages
          <fpage>35</fpage>
          -
          <lpage>60</lpage>
          . Springer, Berlin/Heidelberg, DOI: 10.1007/978-3-
          <fpage>642</fpage>
          -32817-
          <issue>6</issue>
          _
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref65">
        <mixed-citation>
          <string-name>
            <surname>McNamara</surname>
            ,
            <given-names>T. P.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Semantic Priming: Perspectives from Memory and Word Recognition</article-title>
          . Psychology Press, DOI: 10.4324/9780203338001.
        </mixed-citation>
      </ref>
      <ref id="ref66">
        <mixed-citation>
          <string-name>
            <surname>Medaglia</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramanathan</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkatesan</surname>
          </string-name>
          , U. M., and
          <string-name>
            <surname>Hillary</surname>
            ,
            <given-names>F. G.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>The Challenge of Non-ergodicity in Network Neuroscience</article-title>
          .
          <source>Network: Computation in Neural Systems</source>
          ,
          <volume>22</volume>
          (
          <issue>1-4</issue>
          ):
          <fpage>148</fpage>
          -
          <lpage>153</lpage>
          , DOI: 10.3109/09638237.
          <year>2011</year>
          .
          <volume>639604</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref67">
        <mixed-citation>
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G. S.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Distributed Representations of Words and Phrases and Their Compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          ,
          <source>NIPS'13</source>
          , pages
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          ,
          <string-name>
            <surname>Red</surname>
            <given-names>Hook</given-names>
          </string-name>
          , NY. Curran Associates Inc.
        </mixed-citation>
      </ref>
      <ref id="ref68">
        <mixed-citation>
          <string-name>
            <surname>Molenaar</surname>
            ,
            <given-names>P. C. M.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>On the Implications of the Classical Ergodic Theorems: Analysis of Developmental Processes Has to Focus on Intraindividual Variation</article-title>
          . Developmental Psychobiology:
          <source>The Journal of the International Society for Developmental Psychobiology</source>
          ,
          <volume>50</volume>
          (
          <issue>1</issue>
          ):
          <fpage>60</fpage>
          -
          <lpage>69</lpage>
          , DOI: 10.1002/dev.20262.
        </mixed-citation>
      </ref>
      <ref id="ref69">
        <mixed-citation>
          <string-name>
            <surname>Morey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Asher</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>A Dependency Perspective on Rst Discourse Parsing and Evaluation</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>44</volume>
          (
          <issue>2</issue>
          ):
          <fpage>197</fpage>
          -
          <lpage>235</lpage>
          , DOI: 10.1162/coli_a_
          <fpage>00314</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref70">
        <mixed-citation>
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <source>Unifying Everything. Language</source>
          ,
          <volume>89</volume>
          (
          <issue>4</issue>
          ):
          <fpage>920</fpage>
          -
          <lpage>950</lpage>
          , https: //hpsg.hu-berlin.de/~stefan/Pub/unifying-everything.html.
        </mixed-citation>
      </ref>
      <ref id="ref71">
        <mixed-citation>
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wechsler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2014</year>
          ). Lexical Approaches to Argument Structure.
          <source>Theoretical Linguistics</source>
          ,
          <volume>40</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>76</lpage>
          , DOI: 10.1515/tl-2014-0001.
        </mixed-citation>
      </ref>
      <ref id="ref72">
        <mixed-citation>
          <string-name>
            <surname>Nivre</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hall</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nilsson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>MaltParser: A Data-Driven ParserGenerator for Dependency Parsing</article-title>
          .
          <source>In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC'06)</source>
          , Genoa, Italy.
          <source>European Language Resources Association (ELRA)</source>
          , http: //www.lrec-conf.org/proceedings/lrec2006/pdf/162_pdf.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref73">
        <mixed-citation>
          <string-name>
            <surname>Ooms</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>The jsonlite Package: A Practical and Consistent Mapping Between JSON Data and</article-title>
          R Objects. https://arxiv.org/abs/1403.2805.
        </mixed-citation>
      </ref>
      <ref id="ref74">
        <mixed-citation>
          <string-name>
            <surname>Ott</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Ziai</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Evaluating Dependency Parsing Performance on German Learner Language</article-title>
          .
          <source>In Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories</source>
          , volume
          <volume>9</volume>
          , pages
          <fpage>175</fpage>
          -
          <lpage>186</lpage>
          . NEALT Tartu.
        </mixed-citation>
      </ref>
      <ref id="ref75">
        <mixed-citation>
          <string-name>
            <surname>Papo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Why Should Cognitive Neuroscientists Study the Brain's Resting State? Frontiers in Human Neuroscience</article-title>
          ,
          <volume>7</volume>
          :
          <fpage>45</fpage>
          , DOI: 10.3389/fnhum.
          <year>2013</year>
          .
          <volume>00045</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref76">
        <mixed-citation>
          <string-name>
            <surname>Paquot</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Granger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Formulaic Language in Learner Corpora</article-title>
          .
          <source>Annual Review of Applied Linguistics</source>
          ,
          <volume>32</volume>
          :
          <fpage>130</fpage>
          -
          <lpage>149</lpage>
          , DOI: 10.1017/S0267190512000098.
        </mixed-citation>
      </ref>
      <ref id="ref77">
        <mixed-citation>
          <string-name>
            <surname>Pardo</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>On Phonetic Convergence during Conversational Interaction</article-title>
          .
          <source>The Journal of the Acoustical Society of America</source>
          ,
          <volume>119</volume>
          (
          <issue>4</issue>
          ):
          <fpage>2382</fpage>
          -
          <lpage>2393</lpage>
          , DOI: 10.1121/1.2178720.
        </mixed-citation>
      </ref>
      <ref id="ref78">
        <mixed-citation>
          <string-name>
            <surname>Pawley</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Syder</surname>
            ,
            <given-names>F. H.</given-names>
          </string-name>
          (
          <year>1983</year>
          ).
          <article-title>Two Puzzles for Linguistic Theory: Nativelike Selection and Nativelike Fluency</article-title>
          . In Richards,
          <string-name>
            <given-names>J. C.</given-names>
            and
            <surname>Schmidt</surname>
          </string-name>
          , R. W., editors,
          <source>Language and Communication, Language and Communication</source>
          , pages
          <fpage>191</fpage>
          -
          <lpage>226</lpage>
          . Routledge, London, DOI: 10.4324/9781315836027.
        </mixed-citation>
      </ref>
      <ref id="ref79">
        <mixed-citation>
          <string-name>
            <surname>Pennington</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>GloVe: Global Vectors for Word Representation</article-title>
          .
          <source>In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          , pages
          <fpage>1532</fpage>
          -
          <lpage>1543</lpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          , DOI: 10.3115/v1/d14-
          <fpage>1162</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref80">
        <mixed-citation>
          <string-name>
            <surname>Piantadosi</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Zipf's Word Frequency Law in Natural Language: A Critical Review</article-title>
          and
          <string-name>
            <given-names>Future</given-names>
            <surname>Directions</surname>
          </string-name>
          .
          <source>Psychonomic Bulletin &amp; Review</source>
          ,
          <volume>21</volume>
          (
          <issue>5</issue>
          ):
          <fpage>1112</fpage>
          -
          <lpage>1130</lpage>
          , DOI: 10.3758/s13423-014-0585-6.
        </mixed-citation>
      </ref>
      <ref id="ref81">
        <mixed-citation>
          <string-name>
            <surname>Pickering</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Branigan</surname>
            ,
            <given-names>H. P.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>The Representation of Verbs: Evidence from Syntactic Priming in Language Production</article-title>
          .
          <source>Journal of Memory and language</source>
          ,
          <volume>39</volume>
          (
          <issue>4</issue>
          ):
          <fpage>633</fpage>
          -
          <lpage>651</lpage>
          , DOI: 10.1006/jmla.
          <year>1998</year>
          .
          <volume>2592</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref82">
        <mixed-citation>
          <string-name>
            <surname>Pollard</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Sag</surname>
            ,
            <given-names>I. A.</given-names>
          </string-name>
          (
          <year>1994</year>
          ).
          <article-title>Head-driven Phrase Structure Grammar</article-title>
          . University of Chicago Press.
        </mixed-citation>
      </ref>
      <ref id="ref83">
        <mixed-citation>
          <string-name>
            <surname>Sauer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lüdeling</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>BeMaTaC: A Flexible Multilayer Spoken Dialogue Corpus for Contrastive SLA Analyses</article-title>
          .
          <source>In ICAME</source>
          , volume
          <volume>34</volume>
          , pages
          <fpage>46</fpage>
          -
          <lpage>47</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref84">
        <mixed-citation>
          <string-name>
            <surname>Schmid</surname>
          </string-name>
          , H.-J. and
          <string-name>
            <surname>Küchenhof</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Collostructional Analysis and Other Ways of Measuring Lexicogrammatical Attraction: Theoretical Premises, Practical Problems</article-title>
          and
          <string-name>
            <given-names>Cognitive</given-names>
            <surname>Underpinnings</surname>
          </string-name>
          .
          <source>Cognitive Linguistics</source>
          ,
          <volume>24</volume>
          (
          <issue>3</issue>
          ):
          <fpage>531</fpage>
          -
          <lpage>577</lpage>
          , DOI: 10.1515/cog-2013-0018.
        </mixed-citation>
      </ref>
      <ref id="ref85">
        <mixed-citation>
          <string-name>
            <surname>Seddah</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsarfaty</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kübler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Candito</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al. (
          <year>2013</year>
          ).
          <article-title>Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages</article-title>
          .
          <source>In Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages</source>
          , pages
          <fpage>146</fpage>
          -
          <lpage>182</lpage>
          , Seattle, WA. Association for Computational Linguistics, https: //aclanthology.org/W13-4917.
        </mixed-citation>
      </ref>
      <ref id="ref86">
        <mixed-citation>
          <string-name>
            <surname>Shadrova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2020</year>
          ).
          <article-title>Measuring Coselectional Constraint in Learner Corpora: A Graph-based Approach</article-title>
          . Dissertation, Humboldt-Universität zu Berlin.
        </mixed-citation>
      </ref>
      <ref id="ref87">
        <mixed-citation>
          <string-name>
            <surname>Speelman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heylen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Geeraerts</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Mixed-Efects Regression Models in Linguistics</article-title>
          . Springer, Cham.
        </mixed-citation>
      </ref>
      <ref id="ref88">
        <mixed-citation>
          <string-name>
            <surname>Steels</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Language As a Complex Adaptive System</article-title>
          .
          <source>In Parallel Problem Solving from Nature PPSN VI</source>
          , pages
          <fpage>17</fpage>
          -
          <lpage>26</lpage>
          , Berlin/Heidelberg. Springer, DOI: 10.1007/3-540-45356-
          <issue>3</issue>
          _
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref89">
        <mixed-citation>
          <string-name>
            <surname>Steels</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Loetzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Perspective alignment in spatial language</article-title>
          . arXiv, (
          <volume>0605012</volume>
          ), https://arxiv.org/abs/cs/0605012.
        </mixed-citation>
      </ref>
      <ref id="ref90">
        <mixed-citation>
          <string-name>
            <surname>Stefanowitsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Collostructions: Investigating the Interaction of Words and Constructions</article-title>
          .
          <source>International Journal of Corpus Linguistics</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ):
          <fpage>209</fpage>
          -
          <lpage>243</lpage>
          , DOI: 10.1075/ijcl.8.2.03ste.
        </mixed-citation>
      </ref>
      <ref id="ref91">
        <mixed-citation>
          <string-name>
            <surname>Stefanowitsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>S. T.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <source>Covarying Collexemes. Corpus linguistics and linguistic theory</source>
          ,
          <volume>1</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>43</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref92">
        <mixed-citation>
          <string-name>
            <surname>Wickham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>ggplot2: Elegant Graphics for Data Analysis</article-title>
          . Springer, New York, NY, http://ggplot2.org.
        </mixed-citation>
      </ref>
      <ref id="ref93">
        <mixed-citation>
          <string-name>
            <surname>Wickham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>François</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henry</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Müller</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>dplyr: A Grammar of Data Manipulation</article-title>
          , https://CRAN.R-project.org/package=dplyr.
          <source>R package version 0.7.6.</source>
        </mixed-citation>
      </ref>
      <ref id="ref94">
        <mixed-citation>
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lessard</surname>
            ,
            <given-names>P. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Desu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>E. M.</given-names>
          </string-name>
          , et al. (
          <year>2015</year>
          ).
          <article-title>Zipf's Law Holds for Phrases, Not Words</article-title>
          .
          <source>Scientific Reports</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ):
          <fpage>12209</fpage>
          , DOI: 10.1038/srep12209.
        </mixed-citation>
      </ref>
      <ref id="ref95">
        <mixed-citation>
          <string-name>
            <surname>Zeldes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <source>Productive Argument Selection: Is Lexical Semantics Enough? Corpus Linguistics and Linguistic Theory</source>
          ,
          <volume>9</volume>
          (
          <issue>2</issue>
          ):
          <fpage>263</fpage>
          -
          <lpage>291</lpage>
          , DOI: 10.1515/cllt-2013-0006.
        </mixed-citation>
      </ref>
      <ref id="ref96">
        <mixed-citation>
          <string-name>
            <surname>Zinsmeister</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reznicek</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brede</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosén</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al. (
          <year>2012</year>
          ).
          <article-title>Das Wissenschaftliche Netzwerk „Kobalt-DaF“</article-title>
          .
          <source>Zeitschrift für germanistische Linguistik</source>
          ,
          <volume>40</volume>
          (
          <issue>3</issue>
          ):
          <fpage>457</fpage>
          -
          <lpage>458</lpage>
          , DOI: 10.1515/zgl-2012-0030.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>