<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Ontology Design Pattern for Digital Video</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Panagiotis Mitzias</string-name>
          <email>pmitzias@iti.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marina Riga</string-name>
          <email>mriga@iti.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Waddington</string-name>
          <email>simon.waddington@kcl.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Efstratios Kontopoulos</string-name>
          <email>skontopo@iti.gr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Georgios Meditskos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pip Laurenson</string-name>
          <email>Pip.Laurenson@tate.org.uk</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ioannis Kompatsiaris</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Information Technologies Institute</institution>
          ,
          <addr-line>CERTH, GR-57001 Thessaloniki</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>King's College London</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Tate</institution>
          ,
          <addr-line>London</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we introduce an ODP for representing digital video resources. The aim is to model digital video files, their components and other associated entities, such as codecs and containers. The proposed design pattern facilitates the creation of relevant domain ontologies that will be deployed in the fields of media archiving and digital preservation of videos and video artworks. This ODP has been developed within the PERICLES FP7 project.</p>
      </abstract>
      <kwd-group>
        <kwd>Digital Video</kwd>
        <kwd>Codec</kwd>
        <kwd>Stream</kwd>
        <kwd>ODP</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        This paper presents an Ontology Design Pattern (ODP) for modelling digital video
resources. This work was motivated by the problem of consistent presentation of
digital video files in the context of digital preservation within the PERICLES FP7
project1. Over the past five years, this challenge has emerged as a significant one within
the conservation of video art [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and was taken as a focus within Presto4U2. As a
result of this initial work, Dave Rice was commissioned to produce a technical report
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and it is this report which underpins the analysis of this challenge presented in this
paper. Although those who are responsible for the conservation of video art have been
particularly concerned with ensuring consistent playback, the problem is pertinent to
any application domain requiring video playback. Presenting digital video
consistently is dependent on the design, coordination and quality of all aspects of both the video
file and the video player [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In particular, the ongoing development of media players
can impact the capability to view video files as they were originally intended.
      </p>
      <p>
        Playback of compressed video is reliant on a correct interpretation of the
parameters associated to the file, with colour and aspect ratio being two of the most
vulnerable properties. We focus our efforts here on the relationship between the video file
1 www.pericles-project.eu/
2 Presto4U FP7 project (ICT Call 9): www.tate.org.uk/about/projects/presto4u
itself, the codec used to compress the video and the wrapper. By wrapper, we mean a
multimedia container format, which can identify and interleave different data types,
including video and audio streams, subtitles, as well as synchronisation metadata to
enable the streams to be played concurrently. A particular source of conflict is that the
video file and the wrapper can potentially contain values for the same parameter,
which can lead to inconsistency of playback. For example, aspect ratio information
can be carried in both the video file and the wrapper, and is often handled differently
by different players [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        Many standards and specifications for video and multimedia containers exist, with
similar definitions of key parameters. When considering the playback of video and
audio using players supporting multiple video formats and multiple versions of those
formats, there is a clear need for a consistent set of definitions of key parameters
across different formats. To the best of our knowledge,
ontologydesignpatterns.org currently features no such ODP for describing video. A literature review
reveals several relevant ontologies and vocabularies that deal with the modelling of
multimedia objects and their processes. The well-established multimedia standard
MPEG-7 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], as well as several MPEG-7 based ontologies (Hunter [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], Rhizomik [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ],
COMM [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], SWIntO [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Boemie [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], DS-MIRF [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], M-OWL [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]) can be used for
creating metadata descriptions of multimedia content corresponding to low-level visual
and audio features, or semantic objects (e.g. places, actors, events, objects).
Furthermore, OMR3 is a core vocabulary aimed at bridging the different descriptions of
media resources and at providing an interoperable set of metadata. OMR includes,
among others, technical metadata about media objects; nevertheless the represented
properties do not cover the domain in sufficient detail. Similarly, the audioMD and
videoMD schemas4 define significant technical audio and video metadata, but do not
contain all the partial components which constitute a digital video.
      </p>
      <p>It is evident that the aforementioned ontologies do not focus specifically on the
representation of digital videos but on various multimedia resources as a whole, by
modelling information regarding the creator, the conceptual aspect (idea, content)
behind the digital resource, its legal/intellectual properties, etc. Our proposed ODP
deals with the structural and technical representation of digital videos in detail,
carrying significant information for modelling characteristics and interrelationships
(dependencies) that impact the ability to preserve a digital video over time.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Pattern Description and Formalization</title>
      <p>This section presents the proposed ODP, focusing on the core classes, properties, and
axioms. Fig. 1 features a diagrammatic overview of the pattern, which is available at:
http://ontologydesignpatterns.org/wiki/Submissions:DigitalVideo.
Although the ODP contains 31 classes and 27 object properties, in terms of
expressivity it is deliberately lightweight, containing only subclass and subproperty axioms,</p>
      <sec id="sec-2-1">
        <title>3 www.w3.org/TR/mediaont-10/</title>
        <p>4 www.loc.gov/standards/amdvmd/index.html
property restrictions and class disjointness axioms, in order to be easily applicable to
a wide range of use cases and scenarios.</p>
        <p>Our starting assumption for the design pattern is that the video entity itself
comprises a video stream alongside optional associated audio and subtitle streams. The
pattern also covers the case of having multiple video and audio streams. The
following is a list of the core classes found in the proposed pattern:
DigitalVideo. The DigitalVideo class represents a single digital video file. Such a
file typically consists of one or more streams, which are compressed using codecs and
wrapped into a specific type of container.</p>
        <p>Container. A Container (or wrapper) is typically 1-to-1 associated with the video
file format. It acts as a discrete “black box” that contains the various components of a
video and defines how different elements of data and metadata coexist in the video
file. Sample container formats are AVI, Matroska, MP4, etc.</p>
        <p>Codec. A Codec (coder-decoder) is a computer software capable of encoding or
decoding a digital data stream or signal5. Video codecs convert raw video streams to a
compressed format and vice-versa, while audio codecs process audio streams. Some
well-known codecs are x264, DivX Pro and mp3HD.</p>
        <p>Stream. A (data) stream is a sequence of digitally encoded coherent signals (packets
of data or data packets) used to transmit or receive information6. Class Stream
represents raw, uncompressed content (video, audio or subtitles) prior to being encoded
into a wrapper or after being decoded from a wrapper. A digital video file includes at
least one video stream and may also have any number and any kind of other streams:</p>
        <sec id="sec-2-1-1">
          <title>DigitalVideo  hasVideoStream.VideoStream</title>
          <p>(1)</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>5 https://en.wikipedia.org/wiki/Codec</title>
        <p>6 https://en.wikipedia.org/wiki/Data_stream</p>
        <sec id="sec-2-2-1">
          <title>DigitalVideo  hasAudioStream.AudioStream</title>
        </sec>
        <sec id="sec-2-2-2">
          <title>DigitalVideo  hasSubtitleStream.SubtitleStream</title>
          <p>(3)</p>
        </sec>
        <sec id="sec-2-2-3">
          <title>Each type of stream (VideoStream, AudioStream and SubtitleStream) is asso</title>
          <p>ciated with disparate types of properties and elements (see Fig. 1), though some of
them apply to both video and audio streams, such as BitRate and SampleRate.</p>
          <p>All in all, the proposed ODP is deliberately generic and can be extended with
appropriate restrictions, depending on specific application requirements. Also, for
reasons of flexibility, no data properties were included for numerical or text attributes.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>This paper presented an ODP for representing digital video resources that can serve as
the building block for domain-specific ontologies. Its main focus is on digital
preservation and has been successfully deployed within the PERICLES FP7 project.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work was supported by the European Commission Seventh Framework
Programme under Grant Agreement Number FP7-601138 PERICLES.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Arndt</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          et al. (
          <year>2007</year>
          ).
          <article-title>COMM: designing a well-founded multimedia ontology for the web</article-title>
          .
          <source>LNCS 4825</source>
          , pp.
          <fpage>30</fpage>
          -
          <lpage>43</lpage>
          , Springer Berlin Heidelberg.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Dasiopoulou</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          et al. (
          <year>2007</year>
          ).
          <source>Capturing MPEG-7 Semantics. 2nd Int. Conf. on Metadata and Semantics (MTSR)</source>
          , Corfu, Greece.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Celma</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Semantic Integration and Retrieval of Multimedia Metadata</article-title>
          .
          <source>Int. Semantic Web Conference (ISWC'05)</source>
          , Galway, Ireland.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Harit</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          et al. (
          <year>2006</year>
          ).
          <article-title>Using Multimedia Ontology for Generating Conceptual Annotations and Hyperlinks in Video Collections</article-title>
          .
          <source>IEEE/WIC/ACM Int. Conf. on Web Intelligence</source>
          , pp.
          <fpage>211</fpage>
          -
          <lpage>217</lpage>
          , IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hunter</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Adding Multimedia to the Semantic Web: Building an MPEG-7 Ontology</article-title>
          . SWWS'
          <volume>01</volume>
          , Stanford University, California, USA.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Jarczyk</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Phillips</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Life after tape: Collecting Digital Video Art</article-title>
          . Electronic Media Group.
          <article-title>42nd American Institute for Conservation Annual Meeting (to appear).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Oberle</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          et al. (
          <year>2007</year>
          ).
          <article-title>DOLCE ergo SUMO: On Foundational and Domain Models in SWIntO (Smart Web Integrated Ontology)</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>5</volume>
          (
          <issue>3</issue>
          ), pp.
          <fpage>156</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Polydoros</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          et al. (
          <year>2006</year>
          ).
          <article-title>GraphOnto: OWL-based ontology management and multimedia annotation in the DS-MIRF framework</article-title>
          .
          <source>J. of Digital Inf. Management</source>
          ,
          <volume>4</volume>
          (
          <issue>4</issue>
          ),
          <fpage>214</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Rice</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Sustaining Consistent Video Presentation</article-title>
          . Available online: http://goo.gl/CHDfKB, last accessed: June'15.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Sikora</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>The MPEG-7 visual standard for content description - an overview</article-title>
          .
          <source>IEEE Trans. on Circuits and Systems for Video Technology</source>
          ,
          <volume>11</volume>
          (
          <issue>6</issue>
          ), pp.
          <fpage>696</fpage>
          -
          <lpage>702</lpage>
          , IEEE.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>