<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic-based Learning for Trend Mining in Text Collections</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Networked Information Systems, Free University Berlin</institution>
          ,
          <addr-line>Konigin-Luise-Str.24-26 , 14195 Berlin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <kwd-group>
        <kwd>Olga Streibel</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Determination and early detection of emerging trends can be retrieved from
numeric data as well as from texts [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, the use of text collections in
the process of trend detection requires new analysis techniques. Although many
interesting approaches have been developed in the eld of Trend Mining on texts,
de ned as Emerging Trend Detection in Text Mining[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], they are still lacking
the integration of expert knowledge in the process of trend recognition. Such
knowledge is crucial for the proper trend detection in various domains, e.g. in
medical diagnosis, opinion mining, nancial markets.
      </p>
      <p>Our research addresses trend detection in text collections and we are
concentrating on the development of a novel, knowledge-based learning approach for
the automatic detection of trends in text streams. At this stage of our work, we
are concerning ourselves with two main research issues: representation of trend
knowledge and knowledge-integrating learning approach for trend detection.</p>
      <p>
        Knowledge about emerging trends is hard to de ne and relies on the
knowledge of given domain including experts' experience in detecting trends for this
domain. Knowledge of given domain can be expressed in language, thus in formal
de nition of domain concepts and relations between these concepts. Semantic
Web o ers ontology as a technology for knowledge formalization. However, the
classic Ontology approach has been created under assumption of hierarchical,
static relations describing knowledge and therefor can be successfully applied in
domains of taxonomic characteristic (i.e. life sciences). Knowledge about trends
is more intuitive, dynamic, context- and time-dependent, subjective. We assume
that formalization of trend knowledge requires for novel lightweight
formalization techniques, like extreme tagging [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]1, that allow for collective capturing
nonhierarchical relations and enhanced semantics. For this reason, we are
concentrating on the utilization of Extreme Tagging Systems approach in order to gather
and formalize knowledge needed for trend detection.
      </p>
      <p>
        Regarding chosen knowledge representation paradigm and di erent learning
approaches from Arti cial Intelligence[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we are elaborating on the proper de
nition of a knowledge-integrating learning approach for trend detection in texts.
Considering several statistical learning methods, we aim at applying the
appropriate method to the nancial market texts in order to enable trend detection
in this domain.
1 http://www.corporate-semantic-web.de/extreme-tagging.html
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>April</given-names>
            <surname>Kontostathis</surname>
          </string-name>
          , Leon Galitsky, William M. Pottenger, Soma Roy, and
          <string-name>
            <given-names>Daniel J.</given-names>
            <surname>Phelps</surname>
          </string-name>
          .
          <article-title>A Survey of Emerging Trend Detection in Textual Data Mining</article-title>
          .
          <source>SpringerVerlag</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Victor</given-names>
            <surname>Lavrenko</surname>
          </string-name>
          , Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen,
          <string-name>
            <given-names>and James</given-names>
            <surname>Allan</surname>
          </string-name>
          .
          <article-title>Mining of concurrent text and time series</article-title>
          .
          <source>In In proceedings of the 6 th ACM SIGKDD Int'l Conference on Knowledge Discovery and Data Mining Workshop on Text Mining</source>
          , pages
          <volume>37</volume>
          {
          <fpage>44</fpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Stuart</given-names>
            <surname>Russel Peter Norviq</surname>
          </string-name>
          .
          <article-title>Arti cial Intelligence: A Modern Approach</article-title>
          . Prentice Hall International,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Vlad</given-names>
            <surname>Tanasescu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Olga</given-names>
            <surname>Streibel</surname>
          </string-name>
          .
          <article-title>Extreme tagging: Emergent semantics through the tagging of tags</article-title>
          .
          <source>In ESOE</source>
          , pages
          <volume>84</volume>
          {
          <fpage>94</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>