<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Presentations Preserved as Interactive Multi-video Objects</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Caio Cesar Viel</string-name>
          <email>caio viel@dc.ufscar.br</email>
          <email>cesar@dc.ufscar.br</email>
          <email>viel@dc.ufscar.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Erick L. Melo</string-name>
          <email>erick melo@dc.ufscar.br</email>
          <email>melo@dc.ufscar.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria da Graca C. Pimentel</string-name>
          <email>mgp@icmc.usp.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cesar A. C. Teixeira</string-name>
          <email>cesar@dc.ufscar.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Author Keywords</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Universidade Federal de S~ao Carlos</institution>
          ,
          <addr-line>S~ao Carlos-SP</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Mathematics and Computer Science, Universidade de S~ao Paulo</institution>
          ,
          <addr-line>S~ao Carlos-SP</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Student-multimedia interaction. Interactive Multimedia., E-learning. Ubiquitous Capture and Access. NCL.</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2013</year>
      </pub-date>
      <volume>8</volume>
      <issue>2013</issue>
      <abstract>
        <p>We rst give an overview of a system which allows capturing a lecture to generate, as a result, a multi-video multimedia learning object composed of synchronized videos, audio, images and context information. We then discuss how a group of students interacted with a learning object captured from a problem solving lecture: a similar approach can be used by instructors to re ect about their performance during their lectures.</p>
      </abstract>
      <kwd-group>
        <kwd>General Terms Documentation</kwd>
        <kwd>Measurement</kwd>
        <kwd>Veri cation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Increasingly universities record lectures and make them
available on the web, exploiting the fact that the
classroom can be viewed as a rich multimedia
environment where audiovisual information is combined
with annotating activities to produce complex multimedia
objects [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Although in some cases the web lecture may
be a single video stream (e.g. Kan Academy and TED
talks), more elaborate viewing alternatives are available
(e.g. opencast [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and openEya [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]).
      </p>
      <p>
        Once captured lectures have been made available, being
able to analyse how the users watch them { and learn
from them { is a challenging task, as illustrated by Brooks
and colleagues [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In such a scenario, extracting
semantics from the captured information is a must [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
We built a system prototype that allows recording a
lecture: audio and video streams from the instructor,
slides, writings on whiteboards, as well as contextual
information { the aim is to automatically generate an
interactive multimedia object [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. Given the several
sources of information, students must be given a broad
range of interaction alternatives when reviewing the
lecture: our system generates multi-video objects in a
standard for interactive multimedia, so that students have
several interaction alternatives at the same time that can
use a standard HTML5 browser. The actual student
interactions are also captured so they can be analysed.
Next, we brie y introduce our system, and outline results
from analysing the student interactions with the resulting
interactive multi-video object.
      </p>
    </sec>
    <sec id="sec-2">
      <title>From capture to interactive multi-video</title>
      <p>
        We have instrumented a classroom with cameras
(Figure 1(a)), and built a prototype system whose
modules (Figure 1(b)) capture several information streams
from a lecture and generate an interactive multimedia
object (NCL1), which can then be played-back in a player
which runs on standard HTML5 browsers (Figure 1(c)).
1Nested Context Language - http://ncl.org.br/en
(a)
(b)
(c)
The player (Figure 1(c)) is designed so that the
multi-video object corresponding to the lecture may be
reconstituted and explored in dimensions not achievable in
the classroom. The student may be able, for example, to
obtain multiple synchronized audiovisual content that
includes the slide presentation (1), the whiteboard content
(2), video streams with focus on the slide (3) or the
lecturer's full body (4), or the lecturer's web browsing,
among others. Moreover, the student may choose at any
time what content is more appropriated to be exhibited in
full screen. The student may also be able to perform
semantic browsing using points of interest like slides
transitions and the position of lecturer in the classroom.
Moreover, facilities can be provided for users to annotate
the captured lecture while watching it, as advocated by
the Watch-and-Comment paradigm [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>One lecture, 12 modules</title>
      <p>Using the capture-tool prototype, one instructor captured
one lecture without students in the classroom: students
had access to the multi-video object to prepare to their
nal exam.</p>
      <p>The lecture was a problem solving session for a Computer
Organization course in which an instructor solved a total
of 15 exercises. These exercises were related to each other
and usually a subsequent exercise used some results from
the previous one. The exercises also become more di cult
as the presentation progressed.</p>
      <p>The lecture was divided into 12 modules, totalling 1 hour
and 18 minutes. Module 1 presented 3 exercises, module
5 contained 2 exercises, and all the other modules
presented one exercise each.</p>
      <p>Eighteen students watched the lecture for at least 4
minutes: the average playback time was 59 minutes, with
standard deviation of 39 min. The average number of
interactions was 118.6, also with a large deviation (99.6).</p>
    </sec>
    <sec id="sec-4">
      <title>Checking out student interactions</title>
      <p>Given that the multimedia object has more than one video
stream and that students can choose which stream they
want as the main stream (presented in the large window
in the player), the information of which stream is the
most selected as the main stream at each moment can be
useful for the instructors to re ect about their
performance during the lecture.</p>
      <p>
        We present next how students interacted with the several
video components that make up the multi-video object of
modules 1 and 4. A detailed discussion of the students
interaction with all modules is available elsewhere [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
Figure 2(a) and Figure 2(b) summarize which streams
were most selected as the main stream, respectively, for
module 1 and module 4. Each line represents how many
times a stream was watched in a speci c moment:
the blue line corresponds to the slides as presented
in the instructors notebook (Figure 1(c-1));
the red line corresponds to the conventional
whiteboard (Figure 1(c-2));
the green line corresponds to the electronic
whiteboard which presented slides which could be
annotated by the instructor (Figure 1(c-3));
and the purple corresponds to the camera giving an
overview of the classroom (Figure 1(c-4)).
      </p>
      <p>As shown in Figure 2(a), students watched more, as the
main stream, the slides and the whiteboard. The three
regions with higher values for the red line correspond to
the moments in which the instructor solved the three
exercises writing on the conventional whiteboard.
Accordingly, the higher values for the blue line correspond
to the slides with the speci cation of the exercises, and
precede properly the higher values of the red line. A
similar behavior is shown in Figure 2(b): the di erence is
that this module discussed a single exercise.</p>
    </sec>
    <sec id="sec-5">
      <title>Final Remarks</title>
      <p>Our plans for future work include capturing more
contextual information during the presentation toward
providing novel navigation facilities, and the development
of visualization tools for the instructors to analyse the
students multi-video object interaction.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>We thank the courses instructor and the students, the
WAVe13 organizers for the opportunity to present our
work, and the workshop participants for their inspiring
presentations.2</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Abowd</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pimentel</surname>
            ,
            <given-names>M. d. G. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kerimbaev</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ishiguro</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Guzdial</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Anchoring discussions in lecture: an approach to collaboratively extending classroom digital media</article-title>
          .
          <source>In Proc. CSCL</source>
          '
          <volume>99</volume>
          (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Brooks</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Greer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <article-title>Visualizing lecture capture usage: A learning analytics case study</article-title>
          .
          <source>In Proc. WAVe</source>
          '
          <year>2013</year>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Canessa</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fonda</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tenze</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Zennaro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Apps for synchronized photo-audio recordings to support students</article-title>
          .
          <source>In Proc. WAVe</source>
          '
          <year>2013</year>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Cattelan</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teixeira</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goularte</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Pimentel</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. D. G. C.</surname>
          </string-name>
          <article-title>Watch-and-comment as a paradigm toward ubiquitous interactive video editing</article-title>
          .
          <source>ACM TOMCCAP 4</source>
          ,
          <issue>4</issue>
          (Nov.
          <year>2008</year>
          ),
          <volume>28</volume>
          :1{
          <fpage>28</fpage>
          :
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5] Hurst, W.,
          <string-name>
            <surname>Maass</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Muller, R., and
          <string-name>
            <surname>Ottmann</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <article-title>The "authoring on the y" system for automatic presentation recording</article-title>
          .
          <source>In CHI'01 Extended Abstracts</source>
          (
          <year>2001</year>
          ),
          <volume>5</volume>
          {
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Pimentel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abowd</surname>
            ,
            <given-names>G. D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ishiguro</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <article-title>Linking by interacting: a paradigm for authoring hypertext</article-title>
          .
          <source>In Proc. HYPERTEXT</source>
          '
          <volume>00</volume>
          (
          <year>2000</year>
          ),
          <volume>39</volume>
          {
          <fpage>48</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Ronchetti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Videolectures ingredients that can make analytics e ective</article-title>
          .
          <source>In Proc. WAVe</source>
          '
          <year>2013</year>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Viel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melo</surname>
            , E., da Graca Pimentel,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Teixeira</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>How are they watching me: learning from student interactions with multimedia objects captured from classroom presentations</article-title>
          .
          <source>In Proc. ICEIS</source>
          '
          <volume>13</volume>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>