<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Wiki-Enabled Semantic Data Mining - Task Design, Evaluation and Refinement</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martin Atzmueller</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Florian Lemmerich</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jochen Reutelshoefer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frank Puppe</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Würzburg, Department of Computer Science</institution>
          ,
          <addr-line>Am Hubland, 97074 Würzburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Complementing semantic data mining systems by wikis and especially semantic wikis yield a flexible knowledge-rich method. This paper describes a system architecture of a collaborative approach for semantic data mining. The goal is to enhance the design, evaluation and refinement of data mining tasks using semantic technology. Collaborative aspects are introduced by utilizing wiki technology. We present the components and describe their interaction and application in detail.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Wikis provide flexible ways for supporting the quick and simple creation, sharing and
management of content. Based upon the established wiki-technology, semantic wikis
(e.g., [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]) enhance this by providing enriched content and features. For example,
flexible inline queries and according results that are generated based on these dynamically
are such prominent features. While the queries and answers (results) can be flexibly
handled by the system, and can usually be formalized as textual content, the wiki
system also provides appropriate means for the persistent storage and management of the
generated content.
      </p>
      <p>Semantic data mining systems enable the inclusion of a large set of background
knowledge, for example, in order to access knowledge services, for selecting the
applied data mining methods, or for postprocessing the obtained data mining results. Thus,
integrating wikis is a convenient option for semantic data mining systems, since the
semantic core components can support the semantic mining features, while the wiki
component provides for a convenient front-end and user-management, enables the persistent
storage of queries and mining results, and supports their extended annotation.</p>
      <p>
        This paper presents a wiki-enabled approach for collaborative semantic data
mining: The semantic data mining system VIKAMINE [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is combined with the semantic
JSPWiki (http://www.jspwiki.org) extension KnowWE [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. We describe the
interaction and exchange of query and results data, and the integration of semantic
information and knowledge.
      </p>
      <p>The rest of the paper is structured as follows: Section 2 describes the basics of
semantic data mining, provides an overview of the presented approach, and describes
its implementation. Section 3 concludes with a summary and interesting directions for
future work.</p>
    </sec>
    <sec id="sec-2">
      <title>Method</title>
      <p>This section briefly introduces the general semantic data mining approach. After that,
we first give a general overview, before we describe the architecture of the proposed
approach in detail. Finally, we discuss related work.
2.1</p>
      <sec id="sec-2-1">
        <title>Semantic Data Mining</title>
        <p>
          Semantic data mining can be considered as an approach utilizing formal methods and
techniques in order to explicitly integrate data semantics, background knowledge, or
reasoning in the mining process. The knowledge is typically represented in a
knowledge repository, such as an ontology, or a knowledge base. The main aspect of
semantic data mining is the explicit integration of this knowledge into the data mining and
knowledge discovery process, where the algorithms for data pre-processing, mining or
post-processing make use of the formalized knowledge to improve the overall process.
There has been growing interest in this issue, e.g., [
          <xref ref-type="bibr" rid="ref4 ref5 ref6">4–6</xref>
          ], in various domains, especially
in the medical domain [
          <xref ref-type="bibr" rid="ref4 ref7 ref8">4, 7, 8</xref>
          ].
        </p>
        <p>With the advent of the semantic web and standardized knowledge representations of
semantic web techniques, e.g., the web ontology language OWL, utilizing these
knowledge representation formalisms for data mining is a promising direction for task design,
evaluation and refinement, as discussed below. In the following, we outline the different
aspects of semantic data mining, and discuss their implications.</p>
        <p>
          The general data mining process can be structured along the CRISP-DM process
model (http://www.crisp-dm.org ) and consists of the following phases: (a)
Business understanding, i.e., understanding the application domain, (b) Data
Understanding, i.e., considering the (potential) objects of analysis, (c) Data Preparation, e.g.,
preprocessing and schema-matching of the data elements, (d) Modeling, e.g., given by
concrete mining sessions, (e) Evaluation, i.e., assessment of the mined models, (f)
Deployment, i.e., putting the extracted knowledge into action. The semantic data mining
approach integrates ontologies in each of the six steps [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. In the following, we
provide examples for each of the phases structured along the dimensions of task design,
evaluation, and refinement.
        </p>
        <p>
          – Task Design:
• In the Business Understanding phase ontologies help inexperienced users
getting accostumed to the domain, by structuring the relations between the
concepts, and explaining the concepts in terms of their properties.
• In the Data Understanding phase, important data elements (contained in the
ontology) need to be selected. Then, missing attributes, or redundant attributes
can be added or removed from the data set. This can be accomplished by a
datato-ontology mapping step [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] where the data elements are mapped to concepts
of the ontology, e.g., for integrating heterogenous data.
• The Data Preparation phase is strongly connected to the Modeling phase.
Depending on the latter, for example constraints on attributes or values can be
derived. This concerns constraints on the relations between the attributes, as
described in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], for example, grouping constraints or exclusion constraints for
certain attribute groups that should not be considered. A further possible
inclusion of the ontology is given by a more abstract task composition phase, for
which the modeling phase can be hierarchically decomposed along the
generalization/specialization hierarchies modeled in the ontology. Then, more concise
results can potentially be obtained on lower levels, but for efficiency reasons
higher levels can be considered first and be used for filtering interesting
hypotheses in an earlier stage, cf., [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
– Task Evaluation:
• During the Evaluation phase (of CRISP-DM), the discovered patterns can
be interpreted and explained in a structured way using the concepts and/or
contained patterns. Various post-processing options are available at this point,
cf., [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Specifically, due to the data-to-ontology mapping, the discovered
patterns can be matched to semantic relations or more complex relations between
these. Additionally, such knowledge provides a potential (explaining) context
for the discovered patterns. Furthermore, prior knowledge can be compared to
the patterns, e.g., for confirming known relations, identifying new knowledge,
and/or detecting exceptions and conflicts with formalized expectations.
Concerning possible explanations, causal relations can often help in this respect,
for validating and confirming discovered patterns, or for their analysis.
• The Deployment phase concerns the integration of the discovered models into
the business setting. It is easy to see, that for distributed processing and storage
(e.g., on the semantic web) a shared ontology is inevitable. This is especially
relevant for deploying results as semantic analytic reports (an extension of
analytic reports [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]), described below. In a late evaluation step, the
models/patterns can be tested during their practical application. In that case, the persistent
sessions stored in the wiki provide direct access in a collaborative manner.
– Task Refinement: The task refinement step is activated after the evaluation step
has been performed. It is accomplished either manually using the wiki system – by
modifying the textual task description, or by applying formalized knowledge with
respect to the applied data mining method. Then, parameters and/or the method
itself can be adapted. Refinement is performed according to the results of the
evaluation phase, so both steps are tightly coupled. Due to the application of the wiki,
different persons can collaborate in separate sessions, such that previous results can
be included in the refinement of other (related) sessions. Furthermore, previous
experiences can be documented using the wiki, for example, explanations/comments
by previous users. Furthermore, special refinement and/or evaluation knowledge
can be formalized for further improving the respective steps.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>General Overview</title>
        <p>
          As discussed in the last section semantic data mining is concerned with the utilization of
ontological knowledge and semantic annotations to be used throughout the data mining
and knowledge discovery process, similar to ontology-enhanced [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] data mining.
However, further semantic features are enabled by including a semantic core component,
e.g., a RDF-Store: Using that, results can be incrementally formalized and provided to
the store, while subsequent mining and semantic queries can make use of the collected
knowledge. The data mining query, results, and additional knowledge can then be
transparently integrated into a semantic analytic report: The idea of such reports is based on
analytical reports [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] that are simple text documents containing the mining results with
additional text (which is created by humans). In the semantic setting, we can
automatically transform the mining results into a format suitable for the report. Additionally,
the content can be enriched using semantic annotations and links between the reports
(and background information). The wiki also provides for flexible versioning which is
especially useful in a collaborative setting.
        </p>
        <p>
          The sketched scenario is especially suitable for inexperienced users that are mainly
interested in reporting features of a data mining system. Such reports provide high-level
access to pre-specified queries that can be evaluated routinely. However, using the wiki
query mechanism, such queries can also be formalized in an ad-hoc fashion. Further
more detailed reports, analyses and mining sessions can then be implemented using
more advanced data mining tools, e.g., by applying the VIKAMINE [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] system.
        </p>
        <p>
          On the application side, specialized sessions with domain experts, e.g., medical
doctors, and data mining engineers can be easily implemented using the collaborative
tool. In this context, the proposed approach provides, for example, flexible query
formalization, versioning, a history of queries and results, and the potential for knowledge
and experience management since the obtained semantic analytical reports can be
commented on, and can be linked to other (similar) documents. Further sessions can thus
easily build on results of previous sessions, with the same or new participants. For
experience management, the wiki can also be combined with a tagging system, e.g., [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Architectural Overview</title>
        <p>
          Due to the limited space, we only provide a brief summary of the architecture of the
proposed approach. A more comprehensive discussion and overview is given in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
The architecture consists of two core components: The basic wiki system (provided by
JSPWiki (http:/www.jspwiki.org) is extended by the semantic wiki extension
KnowWE [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. The wiki component provides basic features like editing, versioning,
user management, access management and attachment management. Additionally, it
directly supports the collaborative aspects of the sketched semantic mining approach.
KnowWE itself is designed as a highly extensible minimal core providing basic
semantic wiki features like formalization and reasoning. Therefore, for communication with
the mining component we designed the connector plugin KnowWE-RIP (REST [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]
Interface Plugin) that facilitates the connection to the mining web-service. The
semantic core component for storage and reasoning is given by a combination of the
Sesame (http://www.openrdf.org) framework and OWLIM. Sesame is a
javabased framework with support for storing and analyzing RDF data. OWLIM is a
semantic repository with reasoning capabilities that is packaged as a storage and inference
layer for sesame. As such, KnowWE integrates a semantic component and contains a
connector to the Sesame/OWLIM components for providing the semantic functionality.
        </p>
        <p>
          We utilize the VIKAMINE [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] system (http://www.vikamine.org) for data
mining. VIKAMINE features a web-service that can be queried using XML based on
a specialized query language. The result (i.e., the answer) is also formulated as XML
and can thus transparently be integrated with the wiki.
        </p>
        <p>
          The semantic mining process is initiated by the user, that is, by formulating a query
to the wiki system. Similar to other wiki-systems, the query is provided in the form
of an inline-query (e.g., [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]): The query is directly entered in textual form. Whenever
the wiki page is stored and/or reloaded with a new or modified query the result is
requested. In addition, we provide ’extended’ inline queries, such both the query and the
result (i.e., the ’answer’) can also be shown as required. Technically, the query is first
transformed to an XML-representation (VPDL, the VIKAMINE Pattern Description
Language, and then forwarded to the mining engine that produces an result in
XML/VPDL format. Finally, this result is re-transformed into human-readable textual form
to be displayed by the wiki. However, internally the ’raw’ result can be retained by the
versioning system of the wiki, such that always the latest result is available and can be
cached for efficiency. Therefore, changes, for example, due to an updated dataset, can
be easily extracted. The general architecture is shown in Figure 1. The seamless
integration of the result presentation enables (inexperienced) users to quickly evaluate the
obtained results by themselves and according to the formalized ontological knowledge.
2.4
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Related Work and Discussion</title>
        <p>
          Using ontologies for
enhancing data mining has
been discussed, e.g., by
Svatek et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] and by
Antunes [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] in the context of
mining association rules.
        </p>
        <p>
          Furthermore, Cespivova et
al. [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] and Kuo et al. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
describe applications in the
medical domain. While the
application of ontologies is
also a focus of the
presented approach, the
proposed method aims at a Fig. 1. Semantic Data Mining Architecture
more comprehensive
integration of semantic
information and knowledge. In contrast to the existing approaches, the proposed approach
considers a comprehensive two-way integration of semantic and data mining methods
for semantic data mining, with feedback in both directions. In this way, prior knowledge
can be transparently integrated. Using the wiki-support of the presented approach
collaborative sessions can be implemented. Furthermore, semantic annotations using the
wiki, linking unstructured, semi-structured and structured information is another novel
issue with respect to the presented approach. Semantic analytical reports can include
semantic annotations at the document level, global tagging, and associated query – data
mining results that are stored in the semantic store and thus provide powerful options
for knowledge-rich applications.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>In this paper, we have presented an approach for collaborative semantic data mining.
We discussed the considerations of task design, evaluation and refinement in the context
of semantic data mining. Additionally, we have presented an overview on the approach
and have described its architectural considerations in detail, utilizing the VIKAMINE
system for semantic data mining with a connector to a wiki system.</p>
      <p>
        For future work, we aim to extend the mining approach towards text mining and
information extraction, e.g., [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. This opens up further potential for incremental
knowledge refinement, discovery and semantic annotation.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work has been partially supported by the German Research Council (DFG) under
grant Pu 129/8-2.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Reutelshoefer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haupt</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lemmerich</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baumeister</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>An Extensible Semantic Wiki Architecture</article-title>
          .
          <source>In: Proc. 4th Workshop on Semantic Wikis - The Semantic Wiki Web</source>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vrandecic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Völkel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haller</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Studer</surname>
          </string-name>
          , R.:
          <source>Semantic Wikipedia. Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>5</volume>
          (
          <issue>4</issue>
          ) (
          <year>2007</year>
          )
          <fpage>251</fpage>
          -
          <lpage>261</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Atzmueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puppe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Semi-Automatic Visual Subgroup Mining using VIKAMINE</article-title>
          .
          <source>Journal of Universal Computer Science (JUCS)</source>
          ,
          <source>Special Issue on Visual Data Mining</source>
          <volume>11</volume>
          (
          <issue>11</issue>
          ) (
          <year>2005</year>
          )
          <fpage>1752</fpage>
          -
          <lpage>1765</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Atzmueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seipel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Using Declarative Specifications of Domain Knowledge for Descriptive Data Mining</article-title>
          .
          <source>In: Proc. 18th International Conference on Applications of Declarative Programming and Knowledge Management</source>
          , Berlin, Springer Verlag (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Svátek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rauch</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ralbovský</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Ontology-Enhanced Association Mining</article-title>
          .
          <source>In: Semantics, Web and Mining</source>
          . Volume
          <volume>4289</volume>
          of LNCS. (
          <year>2005</year>
          )
          <fpage>163</fpage>
          -
          <lpage>179</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Antunes</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Onto4AR: A Framework for Mining Association Rules</article-title>
          . In: International Workshop on
          <article-title>Constraint-Based Mining and Learning (CMILE</article-title>
          <year>2007</year>
          ), Warsaw, Poland (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cespivova</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rauch</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kejkula</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Roles of Medical Ontology in Association Mining CRISP-DM Cycle</article-title>
          .
          <source>In: Proc. ECML/PKDD 2004 Workshop on Knowledge Discovery and Ontologies</source>
          , Pisa, Italy (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kuo</surname>
            ,
            <given-names>Y.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lonie</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sonenberg</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paizis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Domain Ontology Driven Data Mining: A Medical Case Study</article-title>
          .
          <source>In: DDDM '07: Proceedings of the 2007 international workshop on Domain driven data mining</source>
          , New York, NY, USA, ACM (
          <year>2007</year>
          )
          <fpage>11</fpage>
          -
          <lpage>17</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Atzmueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haupt</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puppe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Knowta: Wiki-Enabled Social Tagging for Collaborative Knowledge and Experience Management</article-title>
          .
          <source>In: Proc. 2nd International Workshop on Design, Evaluation and Refinement of Intelligent Systems (DERIS)</source>
          .
          <article-title>(</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Atzmueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lemmerich</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reutelshoefer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puppe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>An Extensible Architecture for Wiki-enabled Semantic Data Mining</article-title>
          .
          <source>In: Technical Report</source>
          . University of Wuerzburg. (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Fielding</surname>
            ,
            <given-names>R.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , R.N.:
          <article-title>Principled Design of the Modern Web Architecture</article-title>
          .
          <source>ACM Transactions on Internet Technology</source>
          <volume>2</volume>
          (
          <issue>2</issue>
          ) (
          <year>2002</year>
          )
          <fpage>115</fpage>
          -
          <lpage>150</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Atzmueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kluegl</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puppe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Rule-Based Information Extraction for Structured Data Acquisition using TextMarker</article-title>
          .
          <source>In: Proc. LWA</source>
          <year>2008</year>
          (
          <article-title>Knowledge Discovery and Machine Learning Track</article-title>
          ), University of Wuerzburg (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>