<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ordinal Data Analysis</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Knowledge &amp; Data Engineering Group, Dept. of Electrical Engineering and Computer Science &amp; Research Center for Information System Design (ITeG), University of Kassel</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Orders are ruling our live: social hierarchies, rankings in online shops,
classification systems, waiting queues and many more. However, the majority of
data analysis approaches has been developed for numerical features. There are
– at least – two reasons for this: i) These features exist in many datasets,
because many of the properties can be measured by real numbers, and ii), they
allow to make use of a rich set of mathematical tools, such as computing
differences/distances, means, deviations, weighted averages, etc.</p>
      <p>In 1946, S. S. Stevens aimed at providing a solid mathematical foundation to
the question whether and how it is possible to measure human sensation. To this
end, he introduced four different levels of measurement - nominal, ordinal,
interval, and ratio. This triggered a discussion about the meaning of ‘measurement’
in the various cases and the statistical manipulations that can legitimately be
applied. This discussion has been very productive, even though vicious at times,
and is still going on today.</p>
      <p>In the meanwhile, many analysis methods have been developed for data on
the nominal, interval and ratio levels. However, there exists up to now no
comprehensive theory for analysing ordinal data. There are only few data science/data
analysis/machine learning techniques that are particularly suited for ordinal data
(e. g., decision trees work well for ordinal data). Because of the appeal of analysis
and machine learning techniques for numerical data, many scientists also apply
these techniques to ordinal data. In clustering, for instance, ranks are frequently
treated as being interval-scaled, so that distances one is used to (e. g., Euclidean
distance) can be applied. However, this may lead to significant misinterpretations
of the data.</p>
      <p>The aim of this talk is to inspire the audience to venture into the development
of tools for ordinal data analysis. To this end, we will illustrate various roles of
orders in human live by real-world examples and summarize the main issues of
the discussion about the various levels of measurement before discussing different
types of ordinal data analysis tasks. We conclude the talk by presenting ongoing
work of our group on ordinal data analysis.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>