<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LEA - Linguistic Exercises with Annotation tools</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabian Barteld</string-name>
          <email>fabian.barteld@uni-hamburg.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johanna Flick</string-name>
          <email>flick@uni-duesseldorf.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of German Studies, Heinrich Heine University Du ̈ sseldorf</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of German Studies, Universita ̈t Hamburg</institution>
        </aff>
      </contrib-group>
      <fpage>11</fpage>
      <lpage>16</lpage>
      <abstract>
        <p>In this paper we present LEA (Linguistic Exercises with Annotation tools). LEA is a new didactic concept helping students to become familiar with corpus linguistic methods and annotation tools. The main idea behind LEA is that classical linguistic exercises are being solved with annotation tools. We will present the advantages of this method (e.g. didactic benefits, automatic correction) and describe two already existing LEA e-learning packages: partof-speech annotation using tab-separatedvalue files with spreadsheet software and syntactic analysis with Synpathy.1</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Corpus linguistics as a method has become more
and more important for linguistic research in the
last decades
        <xref ref-type="bibr" rid="ref13 ref3">(Gries, 2009; Bender and Good, 2010)</xref>
        .
However, creating or working with digitally
annotated data can be a difficult task for beginners. This
is especially true within the philologies where most
students lack the computational expertise needed
for corpus linguistics
        <xref ref-type="bibr" rid="ref6">(Bubenhofer, 2011)</xref>
        . In
addition, there is often not much time to teach the
related methods within these disciplines. This leads
to the situation that students have to deal with a lot
of practical issues when doing their first own
empirical research: “Working empirically is a complex
task and acquiring the necessary technical
expertise in order to use the tools can lead to frustration”
        <xref ref-type="bibr" rid="ref6">(Bubenhofer, 2011, p. 148)</xref>
        2.
      </p>
      <p>With LEA (Linguistic Exercises with
Annotation tools) we introduce an e-learning approach that
helps solving this problem. On the one hand, LEA
1Both authors contributed equally to the paper.</p>
      <p>2Own translation of: “Empirisches Arbeiten ist aufwa¨ndig
und wenn gleichzeitig noch die technischen Fertigkeiten
erlangt werden mu¨ssen, um die Werkzeuge anwenden zu ko¨nnen,
kann dies zu Frustgefu¨hlen fu¨hren.”
is a collection of ready-to-use exercises that can be
incorporated into classes when linguistic categories
like part of speech (PoS) are being introduced. On
the other hand, LEA is more than just an e-learning
package. It is a didactic methodology which helps
students to become familiar with corpus
linguistic methods and tools, since the exercises come in
the shape of annotation tasks: Students solve
traditional linguistic exercises like PoS categorization
or syntactic analysis and at the same time learn how
to use annotation tools.</p>
      <p>In the first part of the paper, we explain the
concept behind LEA. Then, this concept is further
illustrated with the description of two already
existing LEA packages (PoS classification and
syntactic analysis) which have been designed for first
semester introductory courses to German
linguistics as part of German philology. In the last part,
we present the experiences we made from applying
LEA in teaching.</p>
      <p>LEA is available at https://korpuslab.
github.io/lea.
2</p>
    </sec>
    <sec id="sec-2">
      <title>LEA – The concept</title>
      <p>
        LEA combines traditional linguistic exercises with
annotation tools, simplified annotation guidelines,
and tools for correction and evaluation. The
concept can be best explained in terms of the “Trojan
Horse metaphor” except that there is something
good coming out from the horse’s inside, i.e. the
computational know how: While doing exercises
students learn about corpus linguistic methods and
ways of creating sustainable annotated data, since
they are basically solving an annotation task. The
data that is annotated consists of invented sentences
as they are typically used in class for illustrating
and practicing specific linguistic concepts. The
students simply use annotation tools to “write down”
their solution for the exercise. Therefore, LEA
differs from other didactic approaches where either
existing corpus resources are used
        <xref ref-type="bibr" rid="ref1">(Beißwenger
and Storrer, 2011)</xref>
        or a corpus resource is created
by students
        <xref ref-type="bibr" rid="ref22">(Zeldes, 2017)</xref>
        .
      </p>
      <p>Using annotation tools for exercises has several
advantages over the classical way of using pen and
paper. First of all, it exposes students to software
and concepts from corpus linguistics before they
actually try to employ corpus linguistic methods.
The students can benefit from this new knowledge
in their later research and do not have to struggle
with the annotation tools and file formats when
acquiring statistical methods and other tools that are
necessary for corpus linguistic research. Secondly,
as the students’ answers are created digitally in a
certain format, it is easy for the lecturer to
evaluate the answers and prepare the discussion in class.
Thirdly, LEA supports media change: Instead of
solving exercises with pen and paper the students
use special computer programs. The new ways of
visualization can support a better understanding of
linguistic concepts. In addition, the exercises come
with annotation guidelines so that the concepts that
the students have learned and which they are
practicing are being presented from a new perspective.
This also supports the understanding process.</p>
      <p>Each LEA package has the same structure. It
consists of the following parts: (i) the exercise, (ii)
a manual describing how to work with the
exercise plus examples and annotation guidelines, (iii)
a manual for the annotation tool including
information on how to obtain it, (iv) the sample solution,
and (v) a tool for automatic correction and
evaluation of the answers. The whole package comes as
hypertext allowing to navigate the manuals and get
the files for the exercises.</p>
      <p>With the help of the manuals students learn to
install and use the software needed for the
exercises. Note that this step does not necessarily need
to be supervised in class since the manuals are
built as self-learning resources. Especially courses
with a full schedule and lack of time for technical
issues benefit from this approach. As mentioned
above, LEA aims at linguistics as part of
philologies where students as well as instructors often
lack computational expertise. Therefore, the
manuals lead users with screenshots through the steps
that are necessary to solve the exercise. They also
contain additional background information which
gives students the opportunity to learn more about
the tools and the data format. More experienced
learners can skip what they already know.</p>
      <p>LEA packages can be used straight away as they
come but they can also be adapted to the needs
of the course. How to create new exercises is
explained in the manuals. Due to its modular
structure, that is to say modules describing an
annotation tool are separated from modules containing the
exercise, it is easy to add a new exercise: The
exercise and its description can simply be combined
with already existing technical parts (mainly the
description of the annotation tool).</p>
      <p>Up to now, there are two LEA packages, one
for part of speech and one for phrase types and
syntactic functions.
3</p>
    </sec>
    <sec id="sec-3">
      <title>The part-of-speech exercise</title>
      <p>Every student of linguistics is being confronted
with PoS exercises at some point of her academical
studies, usually in the first year. A classical task
would be assigning categories like noun, verb or
preposition to specific word forms, e.g. the tokens
in the sentence The cat sits under the table. Since
not every classification is as easy as this example a
lot of practice is necessary.</p>
      <p>The LEA PoS package provides a new form
for this traditional task: Instead of annotating the
words with pen and paper the exercise comes in
a tab-separated-value (tsv) file that can be edited
with common spreadsheet applications like Libre
Office Calc3. We chose this as annotation tool for
the exercise because we assume that students are
usually already familiar with spreadsheet software
which makes it easier for them to fulfill the
annotation task. Furthermore, the students learn how
to work with a typical file format for tabular data,
a format that is often used for data in corpus
linguistics4. When solving the exercise they learn
about various aspects connected to this file format,
e.g. handling different kinds of text delimiters like
tab or comma, and choosing the right encoding
when opening text files which contain non-ASCII
characters like German umlauts.</p>
      <p>
        When classifying the examples, the students are
asked to use a tagset. For the created exercise, we
use a simplified version of the Stuttgart-Tu¨bingen
Tagset (STTS)
        <xref ref-type="bibr" rid="ref18">(Schiller et al., 1999)</xref>
        , which is a
defacto standard for PoS tagging for German
        <xref ref-type="bibr" rid="ref23">(Zinsmeister et al., 2014)</xref>
        . It is for example used in one
of the annotation layers in the German Reference
      </p>
      <sec id="sec-3-1">
        <title>3https://www.libreoffice.org/</title>
        <p>4Compare the various variants of the CoNLL formats for
example the CoNLL-U format used by the universal
dependencies project (http://universaldependencies.
org/format.html).</p>
        <p>
          Corpus (DeReKO)
          <xref ref-type="bibr" rid="ref2">(Belica et al., 2009)</xref>
          , which is an
important resource for the study of contemporary
German.
        </p>
        <p>On a more conceptual level, the method
introduces the concept of tagsets and annotation
guidelines to the students. While the introduction of
PoS in linguistics often emphasizes the
theoretical aspects of the categories (e.g. morphological
vs. syntactic criteria), annotation guidelines focus
on distinguishing the categories in real texts. These
different approaches complement each other and
help the students to understand the different parts
of speech.</p>
        <p>
          The students use a slightly modified version of
the original STTS as a tagset: Some categories,
e.g. different kinds of particles, were converted into
broader classes, since they are not being
distinguished in most of the introduction literature to
German linguistics
          <xref ref-type="bibr" rid="ref15 ref16 ref23 ref8">(Linke et al., 2004; Meibauer
et al., 2007; Busch and Stenschke, 2014)</xref>
          or
syntax
          <xref ref-type="bibr" rid="ref11 ref17">(Du¨rscheid, 2012; Pittner and Berman, 2015)</xref>
          .
Some tags like for instance “FM” for
foreignlanguage material were omitted entirely. On the
other hand, we extended STTS with categories that
are usually distinguished in courses for German
linguistics; we split for example the category “article”
(“ART”) into “definite article” (“ARTDEF”) and
“indefinite article” (“ARTINDEF”).
        </p>
        <p>The exercise sentences and the categorization
of the tokens itself have been created on the basis
of the above mentioned introductions to German
syntax. As such, the exercise should be usable
in a wide range of courses for German linguistics.
However, changing the exercise to the needs of a
specific course is as easy as creating a new tsv-file
and – if necessary – adapting the description of
the tagset which is presented as a html-file created
from Markdown, a simple markup language5.</p>
        <p>Solving the exercises digitally by creating
tsvfiles allows to facilitate the correction and
evaluation of the students’ solutions by automatically
comparing them to the sample solution. In order
to make this possible for lecturers who themselves
do not have a background in corpus linguistics,
we created a simple tool. It is written in Java and
should therefore run on all common platforms. The
tool reads in a sample solution and a folder
containing the students’ solutions. It then compares
each solution to the sample solution and creates
5https://daringfireball.net/projects/
markdown/syntax
a version where deviations from the sample
solution are marked and the correct answer is
provided for each of the students’ solution. To help
the lecturer getting an insight on frequent mistakes,
i.e. problematic categories, the tool also outputs
helpful statistics (e.g. mean number of errors, most
frequent mistakes and a confusion matrix) as is
illustrated in Fig. 1.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>The syntax exercise</title>
      <p>The syntax exercise focuses on phrase types
(e.g. nominal phrase or prepositional phrase) and
syntactic functions (subject, object etc.) which
represent the basic inventory of a wide rage of
linguistic theories. As annotation tool for the exercise we
chose Synpathy6, a tool for manual syntactic
annotation. Even though Synpathy is rather old and not
under active development anymore, there are three
reasons why we preferred it to other programs:</p>
      <p>
        Firstly, current tools that allow syntactic
annotation, e.g. Arborator
        <xref ref-type="bibr" rid="ref12">(Gerdes, 2013)</xref>
        and WebAnno
        <xref ref-type="bibr" rid="ref21">(Yimam et al., 2016)</xref>
        , use dependencies, a
theoretical framework which only plays a secondary
role in German linguistics. Since Synpathy allows
to annotate constituents with labeled and crossing
edges it is suitable for the kind of syntactic analysis
that we find in most of the established introduction
literature
        <xref ref-type="bibr" rid="ref11 ref17">(Du¨rscheid, 2012; Pittner and Berman,
2015, among others)</xref>
        . The presented syntax
exercise does not aim at presenting analyses based on a
specific theoretical framework like dependency or
phrase-structure but to introduce general categories
like phrase types and syntactic functions. It is also
possible to include other categories like topological
fields using Synpathy, cf. the annotation guidelines
for Tu¨ba-D/Z
        <xref ref-type="bibr" rid="ref20">(Telljohann et al., 2015)</xref>
        where
topological fields are annotated.
      </p>
      <p>
        Secondly, Synpathy is relatively easy to use
especially when compared to general purpose
annotation tools like GATE
        <xref ref-type="bibr" rid="ref9">(Cunningham et al., 2011)</xref>
        and
Atomic
        <xref ref-type="bibr" rid="ref10">(Druskat et al., 2014)</xref>
        which could also be
used to annotate constituency trees but are harder
to learn due to their general nature and therefore
are not suitable for the purpose of LEA.
      </p>
      <p>Thirdly, Synpathy saves the annotation in the
Tiger-XML-format7, which is well supported by
other tools. There are for example importers for</p>
      <sec id="sec-4-1">
        <title>6https://tla.mpi.nl/tools/tla-tools/</title>
        <p>older-tools/synpathy/</p>
        <p>
          7http://www.ims.uni-stuttgart.
de/forschung/ressourcen/werkzeuge/
TIGERSearch/doc/html/TigerXML.html
the corpus search tool ANNIS
          <xref ref-type="bibr" rid="ref14">(Krause and Zeldes,
2016)</xref>
          . Thus, students can use Synpathy to create
syntactically annotated data for their own studies
that can be searched and visualized with ANNIS.8
        </p>
        <p>We want to highlight a didactic aspect of using
Synpathy to solve the syntax exercises in
particular: Students often have problems
distinguishing between parts of speech, syntactic functions
and phrase types. By using Synpathy, they are
confronted with a visualization of an analysis that
clearly distinguishes the three types of categories,
cf. Fig. 2 which illustrates how the sentence Er
feierte gestern auf dem Kiez seinen runden
Geburtstag. (Engl.: ‘He celebrated his big birthday on the
Kiez yesterday’) would be annotated. While the
phrase types are represented as labels of oval nodes,
the syntactic functions are edge labels that are
visualized in rectangles. The PoS tags are shown
below the tokens. This helps students to
differentiate between those category types. Note that
the analyis in Fig. 2 is based on the simplified
version of STTS created for the exercise. AP stands
for Adposition. This tag corresponds to the more
specific tags APPR for Preposition and APPO for
Postposition in STTS. The two other tags from
STTS that are subordinated under AP – APPRART
for preposition-article contractions and APZR for
the right part of circumpositions – do not have a
counterpart in our simplified tagset as these special
cases do not appear in the exercise.</p>
        <p>For the automatic correction, we extended
Synpathy with a function to compare a set of
TigerXML files (the students’ solutions) with the
currently opened file (the sample solution).9 The
com8However, getting the data into ANNIS is not trivial and
would have to be taught in a later course.</p>
        <p>9This extended version of Synpathy can be found at
https://github.com/fab-bar/Synpathy.
parison of the trees is done using the algorithm
given in Brants and Skut (1998). The results are
presented in tabular form, giving an overview over
wrongly analyzed sentences.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiences with LEA</title>
      <p>In order to assess the didactic value and the
usability of the two LEA packages we conducted
surveys in three introductory courses for German
linguistics during summer term 2016 and again in
three courses during winter term 2016/2017 using
LimeSurvey10. The feedback was used for
improvements of the packages.</p>
      <p>The overall reaction to LEA was positive and
the packages are still in use in the current summer
term. The survey indicates that working with the
annotation guidelines contributed to a better
understanding of the linguistic concepts as described in
the previous sections: when asked whether STTS
was helpful for understanding parts of speech, all
of the students agreed.</p>
      <p>Furthermore, the testing phase of LEA revealed
some problems regarding the usage of the
annotation tools: with the first version many students were
not able to open the exercise and solve it properly.
For example, the Tiger-XML files used by
Synpathy were opened in the browser when the students
followed the link to the file in the manual. Many
students did not know how to save the file to disk
and open it with Synpathy. To circumvent this
problem, we packed the exercise file into a zip-archive
which does not open in the browser. A different
type of problem appeared when the
tab-separatedvalue files were opened with Excel directly and not
via the option to import the data “from text” since
this leads Excel to expect “;” as separator resulting
10https://www.limesurvey.org/de/
in a corrupted file (the first column contains the tab
used as separator). Even though the students were
able to solve the exercise in this case, the automatic
evaluation did not work.</p>
      <p>These observations confirm the lack of
technical know how as described by Bubenhofer (2011)
and at the same time stress the need for didactic
concepts like LEA.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and further work</title>
      <p>With LEA we present both a collection of
elearning packages and an underlying didactic
concept which helps students to become familiar with
annotation tools and methods from corpus
linguistics. In this paper, we described two exercises for
syntax related categories that introduce tsv-files for
parts of speech and Synpathy as an annotation tool
for phrase types and syntactic functions. They are
currently used in introductory courses for German
linguistics at Universita¨t Hamburg and Heinrich
Heine University Du¨sseldorf and under active
development in the sense that we integrate solutions
to problems that are encountered.</p>
      <p>
        In the future, we are planning to extend LEA
with new exercises. While the current packages
focus on syntax, exercises for a wider range of
topics are possible, e.g. annotation of semantic roles
with SALTO
        <xref ref-type="bibr" rid="ref7">(Burchardt et al., 2006)</xref>
        , spoken
language with EXMARaLDA
        <xref ref-type="bibr" rid="ref19 ref2">(Schmidt and Wo¨rner,
2009)</xref>
        or phonetic analysis with Praat
        <xref ref-type="bibr" rid="ref4">(Boersma
and Weenink, 2013)</xref>
        .
      </p>
      <p>While these programs – similar to spreadsheet
applications and Synpathy used for the presented
exercises – are all specialized standalone tools,
many currently developed tools are web-based,
e.g. WebAnno. Web-based tools have the
advantage that annotators do not need to install the
software and annotation tasks may be prepared for
them, e.g. in form of a project with the annotation
categories needed as well as the data which is
supposed to be annotated. In this case, the students
only need to learn how to annotate with the tool.
It makes the task easier, because they do not have
to work with tsv- or xml-files. However, this way
the students will not learn how to deal with such
file formats – one of the learning targets of the
presented LEA exercises. Without this knowledge
students will always depend on the availability of
the data and tools in a web-based application. Yet,
at some point they will need to import or export
data, e.g. for further analysis of the annotated data
using spreadsheet software. Therefore, we also
plan to create exercises where the students will
have to prepare and import data into a web-based
tool like WebAnno themselves.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The creation of the presented LEA packages were
funded by the “Innovationsfonds fu¨r Studium
und Lehre” (Universita¨t Hamburg, Fakulta¨t fu¨r
Geisteswissenschaften) and assisted by Christiane
Ho¨ltmann and Alexandra Lindt.</p>
      <p>We would like to thank Melanie Andresen,
Yvonne Hettler, Eleonore Schmitt, Renata
Szczepaniak, Annika Vieregge and Heike Zinsmeister for
testing LEA in their courses and all the students
who supplied feedback. We also thank the
anonymous reviewers for their helpful remarks.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Beißwenger</surname>
          </string-name>
          and
          <string-name>
            <given-names>Angelika</given-names>
            <surname>Storrer</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Digitale Sprachressourcen in Lehramtsstudienga¨ngen: Kompetenzen - Erfahrungen - Desiderate</article-title>
          .
          <source>Journal for Language Technology and Computational Linguistics (JLCL)</source>
          ,
          <volume>26</volume>
          (
          <issue>1</issue>
          ):
          <fpage>119</fpage>
          -
          <lpage>139</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Cyril</given-names>
            <surname>Belica</surname>
          </string-name>
          , Marc Kupietz, Andreas Witt, and Harald Lu¨ngen.
          <year>2009</year>
          .
          <article-title>The morphosyntactic annotation of DeReKo: Interpretation, opportunities, and pitfalls</article-title>
          . In Marek Konopka, Jacqueline Kubczak, Christian Mair,
          <article-title>Frantisˇek Sˇ ticha</article-title>
          , and Ulrich H. Waßner, editors,
          <source>Grammatik und Korpora</source>
          <year>2009</year>
          .
          <article-title>Dritte Internationale Konferenz</article-title>
          . Mannheim,
          <volume>22</volume>
          .-
          <fpage>24</fpage>
          .9.
          <year>2009</year>
          ,
          <article-title>number 1 in Korpuslinguistik und interdisziplina¨re Perspektiven auf Sprache (CLIP)</article-title>
          . Narr, Tu¨bingen.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Emily M. Bender</surname>
            and
            <given-names>Jeff</given-names>
          </string-name>
          <string-name>
            <surname>Good</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>A grand challenge for linguistics: Scaling up and integrating models. White paper contributed to NSF's SBE 2020 initiative</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Boersma</surname>
          </string-name>
          and
          <string-name>
            <given-names>David</given-names>
            <surname>Weenink</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Praat: doing phonetics by computer</article-title>
          . http://www.praat. org/.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Thorsten</given-names>
            <surname>Brants</surname>
          </string-name>
          and
          <string-name>
            <given-names>Wojciech</given-names>
            <surname>Skut</surname>
          </string-name>
          .
          <year>1998</year>
          .
          <article-title>Automation of treebank annotation</article-title>
          .
          <source>In Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning</source>
          , pages
          <fpage>49</fpage>
          -
          <lpage>57</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Noah</given-names>
            <surname>Bubenhofer</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Korpuslinguistik in der linguistischen Lehre: Erfolge und Misserfolge</article-title>
          .
          <source>Journal for Language Technology and Computational Linguistics (JLCL)</source>
          ,
          <volume>26</volume>
          (
          <issue>1</issue>
          ):
          <fpage>141</fpage>
          -
          <lpage>156</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Aljoscha</given-names>
            <surname>Burchardt</surname>
          </string-name>
          , Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado´, and
          <string-name>
            <given-names>Manfred</given-names>
            <surname>Pinkal</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>SALTO - a versatile multi-level annotation tool</article-title>
          .
          <source>In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC2006).</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Albert</given-names>
            <surname>Busch</surname>
          </string-name>
          and
          <string-name>
            <given-names>Oliver</given-names>
            <surname>Stenschke</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Germanistische Linguistik: Eine Einfu¨hrung. Narr, Tu¨bingen, 3rd edition</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Hamish</given-names>
            <surname>Cunningham</surname>
          </string-name>
          , Diana Maynard, Kalina Bontcheva, Valentin Tablan, Niraj Aswani, Ian Roberts, Genevieve Gorrell, Adam Funk, Angus Roberts, Danica Damljanovic, Thomas Heitz,
          <string-name>
            <given-names>Mark A.</given-names>
            <surname>Greenwood</surname>
          </string-name>
          , Horacio Saggion, Johann Petrak,
          <string-name>
            <given-names>Yaoyong</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Wim</given-names>
            <surname>Peters</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Text Processing with GATE (Version 6</article-title>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Stephan</given-names>
            <surname>Druskat</surname>
          </string-name>
          , Lennart Bierkandt, Volker Gast, Christoph Rzymski, and
          <string-name>
            <given-names>Florian</given-names>
            <surname>Zipser</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Atomic: An open-source software platform for multi-level corpus annotation</article-title>
          .
          <source>In Proceedings of the 12th Konferenz zur Verarbeitung natu¨rlicher Sprache (KONVENS</source>
          <year>2014</year>
          ), pages
          <fpage>228</fpage>
          -
          <lpage>234</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Christa</given-names>
            <surname>Du</surname>
          </string-name>
          ¨rscheid.
          <year>2012</year>
          .
          <article-title>Syntax: Grundlagen und Theorien</article-title>
          .
          <source>Vandenhoeck &amp; Ruprecht</source>
          , Go¨ttingen et al.,
          <source>6th edition.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Kim</given-names>
            <surname>Gerdes</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Collaborative Dependency Annotation</article-title>
          .
          <source>In Proceedings of the Second International Conference on Dependency Linguistics (DepLing</source>
          <year>2013</year>
          ), pages
          <fpage>88</fpage>
          -
          <lpage>97</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Th. Gries</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>What is corpus linguistics? Language and</article-title>
          Linguistics Compass,
          <volume>3</volume>
          (
          <issue>5</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Krause</surname>
          </string-name>
          and
          <string-name>
            <given-names>Amir</given-names>
            <surname>Zeldes</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>ANNIS3: A new architecture for generic corpus query and visualization</article-title>
          .
          <source>Literary and Linguistic Computing</source>
          ,
          <volume>31</volume>
          (
          <issue>1</issue>
          ):
          <fpage>118</fpage>
          -
          <lpage>139</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Angelika</given-names>
            <surname>Linke</surname>
          </string-name>
          , Markus Nussbaumer, and
          <string-name>
            <given-names>Paul R.</given-names>
            <surname>Portmann</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <string-name>
            <given-names>Studienbuch</given-names>
            <surname>Linguistik</surname>
          </string-name>
          .
          <source>Number 121 in Reihe Germanistische Linguistik. Niemeyer, Tu¨bingen, 5th edition.</source>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <article-title>Jo¨rg Meibauer, Ulrike Demske</article-title>
          , Jochen Geilfuß- Wolfgang, Ju¨rgen Pafel, Karl Heinz Ramers, Monika Rothweiler, and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Steinbach</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Einfu¨hrung in die germanistische Linguistik. Metzler, Stuttgart, 2nd edition</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Karin</given-names>
            <surname>Pittner</surname>
          </string-name>
          and
          <string-name>
            <given-names>Judith</given-names>
            <surname>Berman</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Deutsche Syntax: Ein Arbeitsbuch. Narr, Tu¨bingen, 6th edition</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Schiller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Teufel</surname>
          </string-name>
          , C. Sto¨ckert, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Thielen</surname>
          </string-name>
          .
          <year>1999</year>
          .
          <article-title>Guidelines fu¨r das Tagging deutscher Textcorpora mit STTS</article-title>
          .
          <source>Technical report, Universities of Stuttgart und Tu¨bingen.</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Schmidt</surname>
          </string-name>
          and Kai Wo¨rner.
          <year>2009</year>
          .
          <article-title>EXMARaLDA - creating, analysing and sharing spoken language corpora for pragmatic research</article-title>
          . Pragmatics,
          <volume>19</volume>
          (
          <issue>4</issue>
          ):
          <fpage>565</fpage>
          -
          <lpage>582</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Heike</given-names>
            <surname>Telljohann</surname>
          </string-name>
          , Erhard W. Hinrichs, Sandra Ku¨bler, Heike Zinsmeister, and
          <string-name>
            <given-names>Kathrin</given-names>
            <surname>Beck</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Stylebook for the Tu¨bingen treebank of written German (Tu¨Ba-D/Z)</article-title>
          .
          <source>Technical report, Seminar fu¨r Sprachwissenschaft</source>
          , University of Tu¨bingen.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Seid</given-names>
            <surname>Muhie</surname>
          </string-name>
          <string-name>
            <surname>Yimam</surname>
          </string-name>
          , Silvana Hartmann, Iryna Gurevych, Anette Frank, and
          <string-name>
            <given-names>Chris</given-names>
            <surname>Biemann</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A web-based tool for the integrated annotation of semantic and syntactic structures</article-title>
          .
          <source>In Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</source>
          , pages
          <fpage>76</fpage>
          -
          <lpage>84</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Amir</given-names>
            <surname>Zeldes</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>The GUM corpus: creating multilayer resources in the classroom</article-title>
          .
          <source>Language Resources and Evaluation</source>
          ,
          <volume>51</volume>
          (
          <issue>3</issue>
          ):
          <fpage>581</fpage>
          -
          <lpage>612</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Heike</given-names>
            <surname>Zinsmeister</surname>
          </string-name>
          , Ulrich Heid, and
          <string-name>
            <given-names>Kathrin</given-names>
            <surname>Beck</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Adapting a part-of-speech tagset to nonstandard text: The case of STTS</article-title>
          .
          <source>In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC14)</source>
          , pages
          <fpage>4097</fpage>
          -
          <lpage>4104</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>