=Paper=
{{Paper
|id=Vol-1716/WSICC_2016_paper_5
|storemode=property
|title=Screen Concepts for Multi-Version Hypervideo Authoring
|pdfUrl=https://ceur-ws.org/Vol-1716/WSICC_2016_paper_5.pdf
|volume=Vol-1716
|authors=Britta Meixner,Stefan John,Philipp Defner
|dblpUrl=https://dblp.org/rec/conf/tvx/MeixnerJD16
}}
==Screen Concepts for Multi-Version Hypervideo Authoring==
<pdf width="1500px">https://ceur-ws.org/Vol-1716/WSICC_2016_paper_5.pdf</pdf>
<pre>
   Screen Concepts for Multi-Version Hypervideo Authoring

                                       Britta Meixner1,2 , Stefan John1 , Philipp Defner1
                                     1
                                  University of Passau, Innstrasse 43, 94032 Passau, Germany
                           2 FX Palo Alto Laboratory, 3174 Porter Drive, Palo Alto, CA 94304, USA
                                   2 meixner@fxpal.com, 1 firstname.lastname@uni-passau.de


ABSTRACT                                                                            or a new hypervideo has to be put together in the authoring
The creation of hypervideos usually requires a lot of plan-                         tool from scratch. In case of changes in the hypervideo, both
ning and is time consuming with respect to media content                            versions have to be altered and maintained, which can be
creation. However, when structure and media are put together                        error prone in larger projects. Translating a hypervideo means
to author a hypervideo, it may only require minor changes                           not only translating every text or medium, it also involves
to make the hypervideo available in other languages or for                          the table of contents, the terms for the keyword search, the
another user group (like beginners versus experts). However,                        titles of scenes, alternative texts for media, the captions and
to make the translation of media and all navigation elements                        button labels of control panels, and all other elements that
of a hypervideo efficient and manageable, the authoring tool                        are shown in the player at some point. Providing facilitating
needs a GUI that provides a good overview of elements that                          functions for this in a hypervideo authoring tool can help
can be translated and of missing translations. In this work, we                     keeping maintenance costs low while providing up to date
propose screen concepts that help authors to provide different                      hypervideos.
versions (for example language and/or experience level) of a
                                                                                    The aforementioned issues raise the question how a translation
hypervideo. We analyzed different variants of GUI elements
                                                                                    feature can be integrated into a desktop hypervideo authoring
and evaluated them in a survey. We draw guidelines from the
                                                                                    tool without overcrowding the GUI. However, possibilities
results that can help with the creation of similar systems in the
                                                                                    need to be provided to add contents in another language for
future.
                                                                                    all hypervideo elements. Accordingly, the goal of this work
ACM Classification Keywords
                                                                                    was to understand:
H.5.2. Information Interfaces and Presentation (e.g. HCI):                          • How are the different versions of the hypervideo best indi-
User Interfaces                                                                       cated in the GUI?

Author Keywords                                                                     • What is a suitable GUI approach to enable the translation
Hypervideo; Navigation; Multilingual; Translation; Authoring                          of single elements in a hypervideo?

INTRODUCTION                                                                        • What is a suitable GUI approach to give an overview of all
Hypervideos consist of linked video scenes which are com-                             elements and enable editing in the overview?
bined with other types of media to provide further information                      We conducted a survey with 51 people mainly from Germany
on certain topics. Linking media that way, non-linear struc-                        and Austria presenting them different screen concepts. The
tures are created that allow users to interactively explore facts                   survey used conditional statements to present screen concepts
fitted to one’s personal level of knowledge and learning speed.                     always based on previous decisions to avoid confusion among
This type of video is then presented in a player with additional                    the participants. We found out that users preferred a tidy
presentation areas and navigational elements.                                       user interface that can be controlled by keyboard and mouse,
Hypervideo media content creation is time consuming and                             making it not overcrowded with too many buttons.
requires a lot of planning. Currently, if a new version of an                       This paper is structured as follows: We first give an overview
already finished hypervideo is needed, for example in another                       of related work, then we describe the procedure and partic-
language or for another target group (like beginners versus                         ipants of our study. The evaluation results lead to tentative
experts), a completely new hypervideo has to be created. This                       recommendations for hypervideo authoring tools. The paper
requires either to copy the existing video and replace contents                     ends with a discussion and conclusion section.
in a database or (often XML- or JSON-based) transfer file,
                                                                                    RELATED WORK
                                                                                    Taking a look at hypervideo and multimedia authoring sys-
                                                                                    tems, it can be noted that neither early systems like Hyper-
                                                                                    Hitchcock [16] nor current systems like the SIVA Producer
                                                                                    [11, 9, 10] or Klynt [6] are capable of translating contents
                                                                                    or providing different versions of contents. This is why we
                                                                                    started looking at systems with similar underlying structures
4th International Workshop on Interactive Content Consumption at TVX’16, June 22,
2016, Chicago, IL, USA                                                              of the content, namely Content Management Systems (CMS)
Copyright is held by the author/owner(s).
for web-pages. According to [3], a CMS should “enable and
disable as many languages” as required and provide a fallback
locale. The fallback locale is used if content is not available in
the language selected by the user.
Several CMS offer the option to translate the contents of a
web-page. OctoberCMS [13] provides a default language and
translations. New languages have to be added in the settings
view, where already added languages can also be managed.                      Figure 1. Version representations for combo-boxes.
Text can be translated via a table. The table provides two op-
tions, one shows all contents, the other allows to hide already
translated contents giving a better overview. The Adobe Ex-
perience Manager (Adobe CQ5) [4] lets users translate whole
web-pages. “The website will be displayed side by side in
2 parts with the source (reference page) and the target lan-
guage page in order to compare, check difference, edit and
translate as appropriate”. ICanLocalize [7] is a translation
service for web-pages by professional translators. The tool
that is provided for the translators loads web-pages and shows
a list of sentences which are extracted from the web-page.
The translators can then translate sentence by sentence. The
rubedo CMS [15] requires to first add a list of languages into
which the currently available content is translated. The user se-
lects between two strategies. The “only one” strategy displays                    Figure 2. Version representations for tabs.
only translated elements in the resulting web-page (nothing
is displayed in case no translation is provided). The “with
fallback” strategy displays the fallback language if content is      into an online survey that allowed conditional elements based
not translated. In order to edit or translate contents of a lan-     on previous answers.
guage, it is selected in a combo-box. Then, only the contents        The survey as well as the high fidelity prototypes used a car
in this language are shown, which allows no comparison with          repair scenario. The goal was to make the instructions on
the default language. Images and other media files cannot be         how to repair a car available for beginners and experts in both
translated (or replaced with content in another language), only      German and English. Accordingly, the original version of the
descriptions can be translated. Synapse [8] is a system for the      hypervideo which was created for German novices had to be
management of translations of websites. Translation is done          extended with a version for German experts and translated
sentence by sentence (or parts of sentences). The translation        into English.
interface shows the original text and the translated text alter-
nating for the single parts separated by markups. Another way        First, participants were asked about their demographic data
to deal with the translation issue is proposed by the Open Real      (age, gender, level of education) and technical background
Estate CMS which provides automatic translation of contents          (monitor size, monitor resolution, experience in video editing).
[12]. It first used the Google Translate API [5] and is now
                                                                     The second part of the survey showed three different represen-
using the MyMemory API [18].
                                                                     tations of the versions used in a combo-box (see Figure 1) and
The systems described in this section offer different implemen-      on tabs (see Figure 2). The first representation (a) was purely
tations of the translation functionality in the GUI. Ranging         text based combining language and experience level with a
from translations of whole web-pages to translations of sin-         “_”, the second representation (b) was purely image based
gle text elements, most of the tools do not offer functions to       showing a flag and a symbol, and the third representation (c)
replace media files.                                                 was a combination of text and image, where the experience
                                                                     level was written as text and the language was shown as a
STUDY/METHOD                                                         flag-symbol. In the remainder of the survey, the version repre-
To answer the questions stated in the introduction, we con-          sentation option preferred by the participant in a ranking was
ducted an online survey comparing different interface designs.       utilized in the next screens.
                                                                     The third part of the survey showed different variants of the
Procedure/Data Collection                                            media repository and how the elements should be grouped (see
A brainstorming in a focus group [14] with four participants         Figure 3). Variant (x) grouped all images, videos, text-files,
(N = 4) was used to find different ways of presenting the            audios, and subtitles into separate folders not differentiating
different versions of a hypervideo in the GUI of the SIVA            between the versions. Variant (y) adds a second layer for
Producer. Then, high fidelity paper prototypes were created          grouping the elements. Here the files are first grouped by
and refined step by step that covered all alternative presenta-      medium and then by version. Variant (z) also has a second
tion approaches in a way with consistent representations of          layer for grouping the elements. In this variant, the files are
concepts. The high fidelity prototypes were then integrated          grouped by version first and then by medium. Again, the
           Figure 3. Different layers in media repositories.


                                                                    Figure 5. Variants for the translation of the conditional control scene
                                                                    graph element.


                                                                    the path selectors at a fork in the scene graph (more complex
                                                                    set of text fields, combo- and check-boxes, Figure 5), the an-
                                                                    notation editor for image galleries (text fields and drop areas
                                                                    for images, Figure 61 ), and the editor for the table of contents
                                                                    (tree structure with entries and list of scenes, Figure 71 ). Each
                                                                    GUI element had a variant that showed the current implemen-
                                                                    tation in the SIVA Producer, but extended with a combo-box
                                                                    for version selection (variant (A), see Figure 4 and Figure 5).
                                                                    Variant (B) showed a very similar view compared to the cur-
                                                                    rent implementation but arranged on tabs (one tab for each
                                                                    version) (see Figure 4 and Figure 5). Variant (C) extended the
         Figure 4. Variants for the translation of scene titles.    screen. It showed both the original version of the element and
                                                                    three tabs for the other experience levels and translations (see
versions had to be ranked. Another question in this part of         Figures 4, 5, 6, and 7). The variants for each GUI element
the survey explored various ways to import media into the           were ranked by the participants.
media repository, which also had to be ranked. One way was          The last part of the survey showed different versions of an
first selecting the version and then the media which were then      overview table that allows the author to detect missing transla-
added to the right folders automatically. Another way was to        tions. One version of the table had equally high rows for all
first select the media and then the version. The third option       elements (see Figure 8 (Y)), the other had collapsible rows that
imported the media into a “global” folder from where they had       showed previews and full texts if they were expanded and only
to be added to the according folders in the media repository        basic information (like the beginning of a text or the name of
by drag-and-drop. These solutions also had to be ranked. A          a media file) when they were collapsed (see Figure 8 (Z)). The
text field was offered for suggestions of other solutions.          participants were asked several questions about each version
The fourth part of the survey showed three variants of exem-        as well as to select the preferred variant. Other questions were
plary GUI elements needed for the creation and translation
of the hypervideo itself. The shown GUI elements were part          1 Variants with combo-box and tabs are not shown here to avoid
of the scene editor (simple text field, Figure 4), the editor for   repetitive images and space issues.
Figure 6. Default version and translation tabs for the image gallery view.


                                                                                  Figure 8. Versions for the translation table with all elements.


                                                                             media repository folders also revealed no clear answer with
                                                                             an even distribution of the answers for each option.
                                                                             Taking a look at the variants of the four different GUI elements
Figure 7. Default version and translation tabs for the table of contents.
                                                                             (scene settings, path settings, image gallery editor, and table
                                                                             of contents) it can be noted that the tab-based variant (B) was
                                                                             considered as most appealing (average over all elements: 25
about editing and deleting of elements and contents in the
                                                                             participants) and clearly arranged (average over all elements:
table and the presentation of editing options.
                                                                             23.75 participants). The representation of the different ver-
                                                                             sions was considered most obvious (average over all elements:
Participants                                                                 28 participants) and the distinction of the versions from each
Survey participants were recruited through emails and postings               other was most clearly visible (average over all elements: 26.5
on social networks2 . The participants had a mean age of 23.23               participants) for variant (B). Asking the participants which
years (SD = 3.47). 39 of the participants were male and 11                   variant they would want to use, on average 24.75 chose variant
were female, one person did not want to tell. All participants               (B), 15 chose variant (C), and 11.25 chose variant (A). Variant
had at least some school degree; 6 had finished a training on                (A) with the combo-box seems to be considered better for
the job, 21 finished university. About half of the participants              the simpler GUI elements (scene settings and path settings),
used screen resolutions of 1920x1080 and monitor sizes of                    compared to the more complex ones. However, variant (C)
20 inch and larger. Most of the participants did not have                    achieved the best results regarding the question which variant
a lot of experience with video editing software, only two                    makes the translation of contents the most pleasant (average
categorized themselves as experts for Adobe Premiere [2],                    over all elements: 26 participants).
some had knowledge of Adobe Flash [1], Blender [17], and
Adobe After Effects [2].                                                     The questions regarding the overview table revealed somewhat
                                                                             controversial results. Half of the participants found the GUI
                                                                             of both versions appealing and clearly arranged. They found
EVALUATION
                                                                             that the contents can be recognized well (35 participants) and
The question regarding the representation of a version clearly               it is easy to find out which elements do not have an expla-
showed that a combination of text and symbol is preferred                    nation or translation in a certain variant (25 participants) in
(42 participants) (see Figures 1 and 2 variant (c)), 6 partici-              the variant showing all the available information (i.e. variant
pants chose the symbol and 3 participants chose the plain text               (Y). However, 30 participants preferred the collapsible variant.
variant.                                                                     Regarding the editing of contents in the table, 38 participants
For the question about media grouping in the media repository,               preferred “drag & drop from the media repository or input of
no clear statement regarding a preferred version can be made.                text directly in the cell of the table”, 8 participants wanted
However, it can be stated that a grouping with only one layer is             an editor window that only shows the content of the cell as
not preferred, as variant (x) was selected by only 8 participants.           well as editing tools, and 5 participants wanted to open the
Variant (y) was picked by 20 and variant (z) by 23 participants              standard annotation editor. Regarding the question how to
(see Figure 3). The steps for importing media into the desired               delete contents, 26 participants preferred marking a cell and
                                                                             pressing the delete button, 12 participants wanted a right-click-
2 https://www.facebook.com/ (accessed May 26, 2016)                          menu with a delete option, 9 participants wanted to drag the
element and drop it on a trash-bin symbol, and only 4 partici-      the view of the standard version and the tabs of the alternative
pants preferred an additional delete button in each cell. This      versions, and to variant (A) with the combo-box. No clear
is also supported by the results of another question, where 42      statement can be made regarding the design of the overview
participants wanted a tidy user interface instead of providing      table where participants evaluated the version showing more
additional buttons in each cell for editing contents. Regarding     information at one time better, but would prefer the more
the question how missing versions should be presented, 22 par-      compact version when they were asked directly.
ticipants preferred empty cells and 29 preferred a grayed-out
miniature view of the standard version.                             Future work should include follow-up studies to confirm and
                                                                    refine the tentative guidelines and recommendations formu-
                                                                    lated in this work. Further work in this area should also include
RECOMMENDATIONS
                                                                    hypervideo authoring on tablets and other devices.
The evaluation results of the versioning interfaces for the desk-
top hypervideo authoring tool allow us to draw tentative guide-
                                                                    REFERENCES
lines and recommendations for similar systems where contents
                                                                     1. Adobe Systems Incorporated. 2016a. Adobe Flash Player.
of hypervideo have to be translated or edited for another ver-
                                                                        Website http://www.adobe.com/products/flashplayer.html
sion:
                                                                        (accessed May 26, 2016). (2016).
• If suitable, provide a combination of text and symbol for
                                                                     2. Adobe Systems Incorporated. 2016b. Adobe Premiere
  representing a version in the GUI.
                                                                        Pro CC. Website
• The media repository should provide more layers than just             http://www.adobe.com/products/premiere.html (accessed
  sorting by medium.                                                    May 26, 2016). (2016).
• A view with tabs is preferred to switching between versions        3. Globalization Partners International. 2013a. Selecting the
  with a combo-box.                                                     Right Multilingual CMS to Manage International
                                                                        Websites. Website http://blog.globalizationpartners.
• Showing the default and translated versions next to each              com/cms-international-websites.aspx (accessed May 06,
  other makes the translation task more pleasant.                       2016). (2013).
• In an overview, elements should be editable as easily as           4. Globalization Partners International. 2013b. Website
  possible (direct entry, drag & drop, keyboard shortcuts)              Translation in Adobe Experience Manager - Adobe CQ5.
  and only necessary information should be visible without              Website http://blog.globalizationpartners.com/
  overcrowding the GUI with buttons for editing the contents.           multilanguage-website-adobe-experience-manager.aspx
                                                                        (accessed May 06, 2016). (2013).
• Missing information should be either visualized by empty
  fields or by grayed-out versions of the originals.                 5. Google Inc. 2015. Google Translate API - Dynamically
                                                                        translate between thousands of available language pairs.
DISCUSSION AND CONCLUSIONS                                              Website https://cloud.google.com/translate/ (accessed
In this paper we presented a GUI concepts study for providing           May 06, 2016). (2015).
hypervideo contents in different versions, like for beginners        6. Honkytonk Films. 2015. Klynt. Website,
and experts and/or in different languages. We created high              http://www.klynt.net/ (accessed April 27, 2015). (2015).
fidelity prototypes of GUI concepts and integrated them in
a survey which was answered by 51 participants. However,             7. ICanLocalize. 2016. ICanLocalize Documentation -
most of the participants were less than 30 years old, from Ger-         ICanLocalize documentation center. Website http://docs.
many/Austria, and had very limited to no experience in video            icanlocalize.com/information-for-clients/tutorials/
editing. A more diverse user group with more participants               how-to-translate-cms-websites-with-icanlocalize/
would make the results more reliable. Rough guidelines could            (accessed May 06, 2016). (2016).
nonetheless be drawn from the results where most of the users        8. iData Technologies. 2016. Multilingual Content
agreed. Our prototypes have a somewhat complex scenario                 Management with the Synapse Publisher CMS. Website
showing different experience levels and languages making it             http://www.idatatechnologies.com/en-US/
harder for the participants to understand the scenario. Using a         Multilingual-Web-Content-Management.aspx (accessed
simpler scenario would have made understanding the scenario             May 06, 2016). (2016).
easier, but would also have over-simplified the prototypes.
This may have resulted in results that are not applicable to         9. Britta Meixner, Stefan John, and Christian Handschigl.
more difficult scenarios.                                               2015. SIVA Suite: Framework for Hypervideo Creation,
                                                                        Playback and Management. In Proceedings of the 23rd
A combination of text and symbol are preferred for the de-              ACM International Conference on Multimedia (MM ’15).
scription and differentiation of the different versions. Media          ACM, New York, NY, USA, 713–716. DOI:
organization in the media repository should be enabled on two           http://dx.doi.org/10.1145/2733373.2807413
levels, either medium-version or version-medium; an organi-
zation structure with only one level is not desired. Comparing      10. Britta Meixner, Katarzyna Matusik, Christoph Grill, and
the different GUI variants for editing hypervideo elements,             Harald Kosch. 2012. Towards an easy to use authoring
variant (B) with a list of tabs is preferred to variant (C) with        tool for interactive non-linear video. Multimedia Tools
    and Applications 70, 2 (2012), 1251–1276. DOI:
    http://dx.doi.org/10.1007/s11042-012-1218-6

11. Britta Meixner, Beate Siegel, Günther Hölbling, Franz
    Lehner, and Harald Kosch. 2010. SIVA Suite: Authoring
    System and Player for Interactive Non-linear Videos. In
    Proceedings of the 18th ACM International Conference
    on Multimedia (MM ’10). ACM, New York, NY, USA,
    1563–1566. DOI:
    http://dx.doi.org/10.1145/1873951.1874287

12. MonoRay. 2016. Machine (automated) translation in
    Open Real Estate CMS. Website
    https://open-real-estate.info/en/blog/
    mashinnyj-perevod-v-open-real-estate-cms (accessed
    May 06, 2016). (2016).
13. octobercms.com. 2014. OctoberCMS Plugins - Translate -
    Enables multi-lingual websites and translate website
    contents. Website
    https://octobercms.com/plugin/rainlab-translate
    (accessed May 06, 2016). (2014).
14. D. Remenyi. 2012. Field Methods for Academic Research:
    Interviews, Focus Groups and Questionnaires 3rd
    Edition:. Academic Conferences and Publishing Limited.
15. rubedo. 2013. Multilingualism tutorial : rubedo cms.
    Website http://www.slideshare.net/Rubedo-project/
    multilingualism-tutorial-rubedo-cms (accessed May 06,
    2016). (2013).
16. Frank Shipman, Andreas Girgensohn, and Lynn Wilcox.
    2008. Authoring, Viewing, and Generating Hypervideo:
    An Overview of Hyper-Hitchcock. ACM Trans.
    Multimedia Comput. Commun. Appl. 5, 2, Article 15
    (Nov. 2008), 19 pages. DOI:
    http://dx.doi.org/10.1145/1413862.1413868

17. Stichting Blender Foundation. 2016. Blender. Website
    https://www.blender.org/ (accessed May 26, 2016).
    (2016).
18. Translated.net. 2015. MyMemory: API technical
    specifications. Website
    http://mymemory.translated.net/doc/spec.php (accessed
    May 06, 2016). (2015).

</pre>