=Paper=
{{Paper
|id=Vol-1704/paper12
|storemode=property
|title=Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE
|pdfUrl=https://ceur-ws.org/Vol-1704/paper12.pdf
|volume=Vol-1704
|authors=Nico Möller,Gunther Heidemann
|dblpUrl=https://dblp.org/rec/conf/semweb/MollerH16
}}
==Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE==
<pdf width="1500px">https://ceur-ws.org/Vol-1704/paper12.pdf</pdf>
<pre>
       Visual Development & Analysis of Coreference
           Resolution Systems with CORVIDAE

                         Nico Möller1 and Gunther Heidemann1

     Institute of Cognitive Science, University of Osnabrück, 49069 Osnabrück, Germany


       Abstract. Communication whether in verbal or written form is part of our daily
       life. Hence, we as humans have developed a set of skills that enable us to fol-
       low a discourse and extract important information from a text quite easily. For
       a machine however, language understanding is a quite challenging problem and
       considered to be AI-complete, i.e. a machine must reach human level intelligence
       in order to solve this task. Recent developments, especially those forming the se-
       mantic web, offer new ways of incorporating world knowledge into natural lan-
       guage processing methods. In this paper, we present our latest advancements on
       CORVIDAE (Coreference Resolution Visual Development & Analysis Environ-
       ment), a tool for NLP developers to analyse and eventually improve coreference
       resolution algorithms specially designed for those that interact with world knowl-
       edge.


1   Introduction
Coreference resolution (CR) is a subtask of information extraction and describes the
task of identifying all mentions in a given document and group those together that
refer to the same entity [20]. CR is one of the core tasks in information extraction,
making it a necessary preprocessing step before other algorithms can be applied. It
has been an active field of research since the 1960s. Whereas research in the early
years of CR was dominated by heuristic approaches built on computational theories
of discourse [5, 6, 27], methods on based machine learning became more and more
popular due to the broader availability and increased processing power of computers
in the 1990s. Most common methods are based on supervised learning, using string
matching, syntactic, grammatical or semantic features on those mentions. Observing
the course of development in this field, a trend becomes visible that starts with local
features [1, 17] and goes on towards more global models [15, 24]. The next logical step
would be to go beyond global features, i.e. incorporating pieces of information that are
not in the document, but can help to solve this task. This includes semantic relatedness
features extracted from knowledge bases like WordNet, Wikipedia or YAGO that already
have proven to be valuable additions [22, 25]. Additionally, there have been approaches
solving subtasks in information extraction like coreference resolution and named entity
linking in a joint fashion rather than separately [14, 9]. An elaborated error analysis of
the state-of-the-art Stanford CoreNLP system has shown that 41.7% of errors can be
attributed to the lack of background knowledge of the system [12]. Another motivation
behind this shift can be found in the increased interest in information extraction and
analysis in the recent years. Besides Big Data, another keyword that kept appearing in


                                             128
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

 the recent years is the one of the semantic web [3]. The goal of the semantic web is
 to increase the exchangeability of data as well as its usability. Web documents should
 be tagged with additional information that set a context for this document creating a
 machine-readable knowledge-graph that contains information about persons, organisa-
 tions, places or events mentioned in the text as well as their connections to other entities.
 Without proper background knowledge, it is impossible to integrate extracted informa-
 tion correctly into an existing knowledge base. Taking the outlook on data production1
 into consideration as well as the fact that only 4 out of 175 million active domains [7]
 use the semantic mark-up on their websites2 , it seems a good idea to work on increas-
 ing the quality of coreference resolution systems as these play a crucial role in solving
 problems currently encountered in Big Data analysis and fulfilling the dream of the
 semantic web.


 2    Related Work

 Tools for visualising coreference annotation data can roughly be divided into two groups.
 The first group of tools focuses on the annotation itself with the aim of creating data
 that can be used as training input for NLP algorithms like coreference resolution. Most
 popular along these are MMAX2 [19], PAlinkA [21] and BRAT [28]. However, those are
 mainly text-based with only a very limited capacity of visualising data besides a few vi-
 sual cues like highlighting mention groups or showing links between mentions. Another
 way of visualising coreference data was introduced by the TrEd annotator using trees to
 visualise coreference as well as other tree based annotations [8]. The SUCRE project in
 contrast, utilised self-organising maps to visualise coreference features [4]. Addition-
 ally, human annotators should be provided with suggestions for possible coreferences
 in a semi-supervised fashion to speed up the annotation process.
      Exploring already annotated data can be done with those tools, but due to their
 intended purpose, they lack important features that are needed for error analysis. Crucial
 would be the capability of comparing a data set against a gold standard annotation.
      Tools that focus on the NLP developer, on the other hand, are quite rare. A widely
 used toolkit for error analysis in coreference systems is that of Kummerfeld & Klein [11].
 Their approach utilised transformation operations to automatically categorise errors in
 the output of coreference systems, but also lacks any functionality to visualise their
 results. Kuhn et al.[10] presented the ICARUS Coreference Explorer (ICE). Specially
 designed to provide tools for visualisation, search and error analysis for coreference
 annotations. Besides a tree view similar to TrEd, it utilised the entity grid [2], a tabular
 view of entities in a document to give both a summed up view of mentioned entities
 as well as show changes of entity descriptions throughout the document. ICE however,
 is focused on the links between pairs of links, neglecting global features on groups of
 mentions and features beyond that. Complementing those is the tool from Martschat et
 al. [16], which provides a text-based visualisation similar to BRAT. Although the func-
 tionality to add world knowledge is mentioned, the system is not yet suitable to handle
  1
    According to a recent study conducted by EMC as part of their Digital Universe Series, humanity is currently producing
    about 4.4 Zettabytes of data, which will tenfold by 2020
  2
    http://news.netcraft.com/archives/2015/10/16/october-2015-web-server-survey.html


                                                          129
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

 analysis on the output of cross-document coreference resolution or entity linking sys-
 tems. To solve those problems we created CORVIDAE a tool for the visual analysis and
 development of coreference resolution systems that incorporate world knowledge.


 3     CORVIDAE

 CORVIDAE is a web-based application. The backend is written in Scala3 , built upon
 the the Play web application framework4 . HTML5 and JavaScript are the foundations
 for the frontend, which uses the BRAT library 5 as well as the D3.js JavaScript library6
 for interactive visualizations. For more details on the intended workflow with the ap-
 plication and interactions with existing CR systems [23, 13, 14, 9] have a look at our
 intitial presentation of CORVIDAE [18].
      In the following subsections we present a few new and improved visualisation
 modes that focus on different parts of the error analysis. CORVIDAE follows Shnei-
 derman’s mantra[26], to provide the user with an overlook of the systems strengths and
 weaknesses first and details for an in-depth analysis later on. All of these visualisation
 modes are types of circular layouts, a compact drawing style for information visualisa-
 tion that is especially popular in the area of bioinformatics.


 3.1    Radial Sequence Diagrams

 The radial sequence diagram is one of the core elements of CORVIDAE has undergone
 a few changes and additions since the last revision. As already pointed out before this
 visualisation mode is quite versatile and hence can be used in many different ways. It
 marks the entry point for almost any system analysis the NLP developer might want
 to perform, as it can be configured to give a quick overview on the most crucial error
 measures at one glance. Originally used to compare genome sequences, we utilise this
 technique to quickly compare a broad variety of results. We can use it to:

     – visualise and compare different error metrics for one or more documents or system
       configurations,
     – compare annotation results gained by different configurations for a single corefer-
       ence system or results from different systems on a single document,
     – compare different documents to find out if they get linked to the same entities,
     – analyse the propagation of errors in the multi-sieve model level wise.

 Another big advantage of this view mode is that due to its compact design it allows
 not only to compare two results to one another but even multiple ones. This feature is
 especially helpful when we enter the area of cross-document coreference resolution,
 that cannot be handled properly by the solutions presented in section 2. Figures 1 and
 2 show two usage examples for the radial sequence diagram.
  3
    http://www.scala-lang.org/
  4
    https://www.playframework.com/
  5
    http://brat.nlplab.org/embed.html
  6
    https://d3js.org/


                                            130
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE


 Fig. 1. Error overview for different system Fig. 2. A radial sequence diagram, comparing
 configurations. Outer ring shows the num- the annotation (outer sequence, found men-
 ber of possible errors, clustered by mention tions, color coded according to cluster mem-
 type and weighted by appearance in docu- bership) from two CR system configurations
 ment. The inner rings show the error sum- (inner sequence) against a gold standard an-
 maries for three different configurations. The notation. The inner sequence are color coded
 order of the inner rings is also sortable, if the to show correct (light green), wrongly as-
 user wishes to focus on the worst/best perfor- signed(red), extra (yellow) and missed men-
 mance for a certain error category. It is also tions (orange). In a similar fashion this view
 possible split the inner rings, in case the user can be used to check the results from an entity
 wants to investigate subcategories.               linking module.


 Fig. 3. A radial network diagram, showing
                                                 Fig. 4. A radial directed graph diagram, show-
 links between different mentions within a
                                                 ing entities found within a document. Addi-
 text. Links are color coded according to clus-
                                                 tional links and entities can be provided by a
 ter membership. This view supports high-
                                                 gold standard annotation as well as querying
 lighting, filtering and sorting, to enable a
                                                 a linked knowledge base. Size corresponds to
 more detailed analysis of a certain error type,
                                                 the number of in and outgoing links.
 e.g. by analysing a single coreference chain.


                                                131
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

 3.2     Radial Network Diagram

 Figure 3 depicts an example of a radial network diagram, a of visualisation primarily
 used to display coreference chains throughout a document. Shown on the outer rim are
 the found mentions within a document, currently in the order of appearance within the
 document. Entity clusters are depicted by colour coding. Arcs connecting two mentions
 indicate a coreference between them. The mentions can also be sorted and split into
 their corresponding entity clusters for further inspection of the individual mentions.
 As mentioned before the D3.js library allows for interactivity, henceforth the visualisa-
 tions allow for highlighting via hovering or filtering via queries, as well as displaying
 additional information like the linked real world entity when selecting an entity clus-
 ter. Theses functionality is quite essential to simplify the otherwise dense and complex
 visualisation and isolate single coreference chains the NLP developer wants to analyse.
      In order to compare two annotation results or one against a gold standard, a dif-
 ferential view can be computed, highlighting differences in found mentions and links,
 while fading out the rest, which allows for an easy spotting errors.


 3.3     Radial Directed Graph Diagram

 The radial directed graph diagram has been incorporated in two different modes.
     Mention centred: This mode allows for the visualisation of tree based coreference
 annotations similar to the ones found in TrEd or ICE, but instead of a triangular layout
 we are using a radial one, which allows for a much more compact and cleaner repre-
 sentation. Originating from the inner document root, nodes in the tree correspond to
 mentions in the text, whereas links indicate coreference between those. Each branch
 from the root node corresponds to a cluster representing an entity.
     Entity centred: The second mode is concerned with the visualisation of semantic
 background knowledge.
     It can visualise information extracted from the documents itself, but is not solely
 restricted to it. Named entity linking usually uses a knowledge base that serves as an
 anchor. These can be exploited to provide additional context for the NLP developer, as
 well as to evaluate and compare extracted information against the knowledge base. The
 colour of the links indicates that no (yellow/orange), supporting (green) or contradicting
 (red) information has been found in the knowledge base. For example if a was-born-in
 relation between entity a and entity b is mentioned within a sentence and this fact can be
 found in our database the link would be green, if no relation can be found the link would
 be orange 7 and red in case contradicting information has been found. The size of the
 dots corresponds to the number of in and outgoing edges. A simple example showing
 this can be found in figure 4.
     The same technique as mentioned in section 3.2 can also be used on this type of
 visualisation. Mention centred this view allows to compare different sets of annotations
 for one document, whereas the entity centred view can be used to explore results of one
 cross-document coreference resolution system over two documents.
  7
      if the relation comes from the gold annotation it would be yellow instead


                                                 132
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

 4      Conclusions and Future Work
 In this paper we our latest updates on CORVIDAE a tool designed for NLP developers
 for the visual error analysis of coreference systems. This tool offers a variety of circular
 visualisations to display coreference annotation data, which will help to analyse and de-
 bug cross-document coreference resolution algorithms. In its current state CORVIDAE
 supports three different circular visualisations, namely:
     – radial sequence diagrams,
     – radial network diagrams,
     – radial directed graph diagrams.
 Each is intended to support the NLP developer in tracking down, isolating and locating
 errors caused by the CR system. All of these visualisations are interactive and highly
 customizable, making it easy for the user to adapt the system to his needs. As a starting
 point for our analysis, we choose the state-of-the-art CoreNLP CR system, but CORVI-
 DAE can easily be extended to support other systems as well. The next steps will be an
 extensive analysis of the joint systems mentioned in 2, to further investigate the inter-
 action between named entity linking and coreference resolution with world knowledge
 and elaborate how this can be exploited to boost the performance in both. More Infor-
 mation on CORVIDAE as well as demos will be made available on a separate project
 website in the near future8 .


 References
     [1] C. Aone and S. W. Bennett. Evaluating automated and manual acquisition of anaphora
         resolution strategies. In Proceedings of ACL 1995, pages 122–129. Association for Com-
         putational Linguistics, 1995.
     [2] R. Barzilay and M. Lapata. Modeling local coherence: An entity-based approach. Compu-
         tational Linguistics, 34(1):1–34, 2008.
     [3] T. Berners-Lee, J. Hendler, and O. Lassila. The semantic web. Scientific American,
         284(5):34–43, 5 2001.
     [4] A. Burkovski, W. Kessler, G. Heidemann, H. Kobdani, and H. Schütze. Self organizing
         maps in nlp: Exploration of coreference feature space. In Proceedings of the 8th Inter-
         national Conference on Advances in Self-organizing Maps, WSOM’11, pages 228–237,
         Berlin, Heidelberg, 2011. Springer-Verlag.
     [5] B. J. Grosz. The representation and use of focus in a system for understanding dialogs. In
         Proceedings of IJCAI 1977 - Volume 1, IJCAI’77, pages 67–76, San Francisco, CA, USA,
         1977. Morgan Kaufmann Publishers Inc.
     [6] B. J. Grosz, A. K. Joshi, and S. Weinstein. Providing a unified account of definite noun
         phrases in discourse. In Proceedings of ACL 1983, pages 44–50. Association for Compu-
         tational Linguistics, 1983.
     [7] R. V. Guha. Light at the end of the tunnel. Talk at the 12th International Semantic Web
         Conference (ISWC), Sydney, 10 2013.
     [8] J. Hajic, B. Vidová-Hladká, and P. Pajas. The prague dependency treebank: Annotation
         structure and support. In Proceedings of the IRCS Workshop on Linguistic Databases,
         pages 105–114, 2001.
  8
      https://ikw.uos.de/%7Ecv/publications/VOILA16


                                                133
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

  [9] H. Hajishirzi, L. Zilles, D. S. Weld, and L. S. Zettlemoyer. Joint coreference resolution and
      named-entity linking with multi-pass sieves. In EMNLP, pages 289–299. ACL, 2013.
 [10] J. Kuhn, M. Gärtner, A. Björkelund, G. Thiele, and W. Seeker. Visualization, search, and
      error analysis for coreference annotations. ACL 2014, page 7, 2014.
 [11] J. K. Kummerfeld and D. Klein. Error-driven analysis of challenges in coreference resolu-
      tion. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language
      Processing, pages 265–277, Seattle, Washington, USA, October 2013.
 [12] H. Lee, A. Chang, Y. Peirsman, N. Chambers, M. Surdeanu, and D. Jurafsky. Deterministic
      Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules. Computational
      Linguistics, 39(4):885–916, dec 2013.
 [13] H. Lee, Y. Peirsman, A. Chang, N. Chambers, M. Surdeanu, and D. Jurafsky. Stanford’s
      multi-pass sieve coreference resolution system at the conll-2011 shared task. In Proceed-
      ings of the Fifteenth Conference on Computational Natural Language Learning: Shared
      Task, CONLL Shared Task ’11, pages 28–34, Stroudsburg, PA, USA, 2011. Association
      for Computational Linguistics.
 [14] H. Lee, M. Recasens, A. Chang, M. Surdeanu, and D. Jurafsky. Joint entity and event coref-
      erence resolution across documents. In Proceedings of EMNLP-CoNLL 2012, EMNLP-
      CoNLL ’12, pages 489–500, Stroudsburg, PA, USA, 2012. Association for Computational
      Linguistics.
 [15] X. Luo, A. Ittycheriah, H. Jing, N. Kambhatla, and S. Roukos. A mention-synchronous
      coreference resolution algorithm based on the bell tree. In Proceedings of ACL 2014, page
      135. Association for Computational Linguistics, 2004.
 [16] S. Martschat, T. Göckel, and M. Strube. Analyzing and visualizing coreference resolution
      errors. In Proceedings of the 2015 Conference of the North Americal Chapter of the Asso-
      ciation for Computational Linguistics: Demonstration Session, Denver, Col., 31 May – 5
      June 2015, pages 6–10, 2015.
 [17] J. F. McCarthy and W. G. Lehnert. Using decision trees for coreference resolution. In
      Proceedings of IJCAI 1995, pages 1050–1055, 1995.
 [18] N. Möller and G. Heidemann. Corvidae: Coreference resolution visual development &
      analysis environment. In Joint Proceedings of the Posters and Demos Track of 12th Inter-
      national Conference on Semantic Systems - SEMANTiCS2016, Posters&Demos
      @SEMANTiCS 2016, 2016.
 [19] C. Müller and M. Strube. Multi-level annotation of linguistic data with MMAX2. In
      S. Braun, K. Kohn, and J. Mukherjee, editors, Corpus Technology and Language Pedagogy:
      New Resources, New Tools, New Methods, pages 197–214. Peter Lang, Frankfurt a.M.,
      Germany, 2006.
 [20] V. Ng. Supervised Noun Phrase Coreference Research: The First Fifteen Years. Acl,
      (July):1396–1411, 2010.
 [21] C. Orăsan. PALinkA: a highly customizable tool for discourse annotation. In Proceedings
      of the 4th SIGdial Workshop on Discourse and Dialog, pages 39 – 43, Sapporo, Japan, July,
      5 -6 2003.
 [22] S. P. Ponzetto and M. Strube. Exploiting semantic role labeling, wordnet and wikipedia
      for coreference resolution. In Proceedings of the main conference on Human Language
      Technology Conference of the North American Chapter of the Association of Computational
      Linguistics, pages 192–199. Association for Computational Linguistics, 2006.
 [23] K. Raghunathan, H. Lee, S. Rangarajan, N. Chambers, M. Surdeanu, D. Jurafsky, and
      C. Manning. A multi-pass sieve for coreference resolution. In Proceedings of EMNLP
      2010, EMNLP ’10, pages 492–501, Stroudsburg, PA, USA, 2010. Association for Compu-
      tational Linguistics.


                                               134
Visual Development & Analysis of Coreference Resolution Systems with CORVIDAE

 [24] A. Rahman and V. Ng. Supervised models for coreference resolution. In Proceedings of
      EMNLP 2009: Volume 2-Volume 2, pages 968–977. Association for Computational Lin-
      guistics, 2009.
 [25] A. Rahman and V. Ng. Coreference resolution with world knowledge. In Proceedings of
      ACL 2011: Human Language Technologies - Volume 1, HLT ’11, pages 814–824, Strouds-
      burg, PA, USA, 2011. Association for Computational Linguistics.
 [26] B. Shneiderman. The eyes have it: A task by data type taxonomy for information visual-
      izations. In Visual Languages, 1996. Proceedings., IEEE Symposium on, pages 336–343.
      IEEE, 1996.
 [27] C. L. Sidner. Towards a computational theory of definite anaphora comprehension in en-
      glish discourse. Technical report, Cambridge, MA, USA, 1979.
 [28] P. Stenetorp, S. Pyysalo, G. Topić, T. Ohta, S. Ananiadou, and J. Tsujii. brat: a web-based
      tool for NLP-assisted text annotation. In Proceedings of the Demonstrations Session at
      EACL 2012, Avignon, France, April 2012. Association for Computational Linguistics.


                                              135

</pre>