<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multiple-Resolution Stream Clustering Using Graph Maintenance</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marwan Hassani</string-name>
          <email>hassani@cs.rwth-aachen.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pascal Spaus</string-name>
          <email>spaus@cs.rwth-aachen.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thomas Seidl</string-name>
          <email>seidl@cs.rwth-aachen.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Management and Data Exploration Group RWTH Aachen University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Challenges for clustering streaming data are getting continuously more sophisticated. They are driven by stream properties such as the continuous data arrival, the time-critical processing of objects, the evolution of the data streams, the presence of outliers and the varying densities of the data. Due to the continuously evolving nature of the stream, it is crucial that stream clustering algorithms autonomously detect clusters whose number, shapes and densities vary as the stream flows. We present the first hierarchical density-based stream clustering algorithm based on cluster stability, called HASTREAM [2] which is able to meet the above mentioned requirements. We show additionally, that HASTREAM inherited efficiency issues as the main drawback of density-based hierarchical clustering algorithms, as these were not the scope of its contribution. We present then I-HASTREAM [1], a first density-based hierarchical clustering algorithm that has considerably less computational time compared to the first presented algorithm. I-HASTREAM utilizes and introduces techniques from the graph theory domain to devise an incremental update of the underlying model instead of repeatedly performing the expensive calculations of the huge graph. Specifically the Prim's algorithm for constructing the minimal spanning tree is adopted by introducing novel, incremental maintenance of the tree by vertex and edge insertion and deletion. The extensive experimental evaluation study on real world datasets shows that I-HASTREAM is considerably faster than HASTREAM while delivering almost the same clustering quality.</p>
      </abstract>
    </article-meta>
  </front>
  <body />
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Marwan</given-names>
            <surname>Hassani</surname>
          </string-name>
          , Pascal Spaus, Alfredo Cuzzocrea, and Thomas Seidl.
          <article-title>Adaptive stream clustering using incremental graph maintenance</article-title>
          .
          <source>In BigMine 2015 at KDD'15</source>
          , pages
          <fpage>49</fpage>
          -
          <lpage>64</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Marwan</given-names>
            <surname>Hassani</surname>
          </string-name>
          , Pascal Spaus, and Thomas Seidl.
          <article-title>Adaptive multiple-resolution stream clustering</article-title>
          .
          <source>In MLDM '14</source>
          , pages
          <fpage>134</fpage>
          -
          <lpage>148</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>Copyright c 2015 by the paper's authors. Copying permitted only for private and academic purposes</article-title>
          . In: R.
          <string-name>
            <surname>Bergmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Görg</surname>
          </string-name>
          , G. Müller (Eds.):
          <source>Proceedings of the LWA</source>
          <year>2015</year>
          <article-title>Workshops: KDML, FGWM, IR, and FGDB</article-title>
          . Trier, Germany,
          <volume>7</volume>
          .-
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>October</source>
          <year>2015</year>
          , published at http://ceur-ws.org
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>