=Paper=
{{Paper
|id=Vol-3300/short_6981
|storemode=property
|title=GazeHD: Towards Measuring Effect of Depth of Field Controlled by Eye Tracking in 3D Environments
|pdfUrl=https://ceur-ws.org/Vol-3300/short_6981.pdf
|volume=Vol-3300
|authors=Marc Anthony Berends,Jordan Aiko Deja,Nuwan T Attygalle,Matjaž Kljun,Klen Čopič Pucihar
|dblpUrl=https://dblp.org/rec/conf/hci-si/BerendsDAKP22
}}
==GazeHD: Towards Measuring Effect of Depth of Field Controlled by Eye Tracking in 3D Environments==
<pdf width="1500px">https://ceur-ws.org/Vol-3300/short_6981.pdf</pdf>
<pre>
GazeHD: Towards Measuring Effect of Depth of
Field Controlled by Eye Tracking in 3D
Environments
Marc Anthony Berends1, Jordan Aiko Deja1,2, Nuwan T Attygalle1, Matjaž Kljun1,3 and
Klen Čopič Pucihar1,3
1
 University of Primorska, Faculty of Mathematics, Natural Sciences and Information Technologies, Koper, Slovenia
2
 De La Salle University Manila, Philippines
3
 Faculty of Information Studies, Novo Mesto, Slovenia

                                         Abstract
                                         Depth of Field (DoF) has been used in 3D software to imitate realistic vision to improve immersion and
                                         depth perception on 2D displays. However, traditional methods of introducing DoF use fixed focus point
                                         which is usually located in the center of the screen. This may lead to unwanted blur that could affect
                                         user immersion and game satisfaction. In this paper, we present GazeHD, a dynamic DoF system that
                                         uses eye tracking in order to actively focus at the position of user gaze whilst blurring other parts of
                                         the screen based on geometry of 3D environment. We evaluate dynamic DoF by running a user study
                                         (𝑛 = 5) including a tunnel test and a 3D game demonstration. The results show DoF does not improve
                                         depth perception. This was true for both mouse controlled and eye tracking controlled DoF. However
                                         users perceived higher immersion which also persisted in complex 3D scenes such as high fidelity first
                                         person video games.

                                         Keywords
                                         depth of field, eye tracking, tunnel test, unity, 3D game


1. Introduction
In order to imitate realistic vision in 3D games, software can try to simulates the depth of field
(DoF) effect. It is applied to the scene camera generating imagery where objects in the scene
are either blurred or sharp. The amount of blur is dependent on the properties of the camera,
focusing point, and the 3D geometry of the scene (i.e. the distance between the object and the
camera). This kind of visual distortion is intrinsic to our vision system so its introduction to 3D
graphics may lead to a higher immersion when experiencing such virtual environments.
   However, the standard implementation of DoF commonly uses a fixed focal point that is
positioned in the center of the screen. In this way objects in the center of the screen are always
in focus as the user moves though the 3D environment. The correct focal length is calculated
based on the distance between the observer(i.e. scene camera) and the scene center point (i.e.
the intersection point between the camera raycast and the surface in front of the camera) [1].
However, if a user wants to look at content that is away from the screen centre, such content
may be invisible due to blur. This potentially breaks the immersion of the experience, as the
Human-Computer Interaction Slovenia 2022, November 29, 2022, Ljubljana, Slovenia
Envelope-Open 89181103@student.upr.si (M. A. Berends); jordan.deja@famnit.upr.si (J. A. Deja); nuwan.attygalle@famnit.upr.si
(N. T. Attygalle); matjaz.kljun@upr.si (M. Kljun); klen.copic@famnit.upr.si (K. Čopič Pucihar)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)
image does not accommodate for where the user is looking, and thus fails to fully imitate
realistic human vision. Besides breaking the illusion this may also have a negative effect on
depth perception.
   Several studies have explored depth perception in 3D virtual environments . In the study
by Naceri et al. the authors studied users’ depth perception in 3D virtual environments [2].
They compared two different Virtual Reality (VR) systems: the head mounted devices and
immersive wide screen displays. The comparison was done by presenting a virtual environment
containing different objects and asking the participants to compare their depth. The objects
shown were placed at different depths, however their size was modified so that they always
appeared to be of the same size. To achieve this, the size of the object was changed according to
the depth position. This was done to eliminate the apparent size effects that would serve as a
depth cue. The results showed significant differences between the two devices and highlighted
the distance misestimation phenomenon for head mounted devices. Other studies explored
depth perception and immersion in the scope of stereoscopic 3d rendering [3] and 3D controlled
DoF in stereoscopic displays [4]. Another study looked at the effect DoF on immersion in 3D
games [5].
   Advances in low cost gaze-tracking technologies, such as Tobii Eye tracker 5 make it possible
to track human gaze at an affordable cost in close to real time. This makes it possible to build a
dynamic DoF system in which the focus point moves with user gaze. In study conducted by
Mauderer et al. authors explored dynamic DoF and showed that it can lead to an increase in
the perceived realism and can contribute to the perception of ordinal depth. Furthermore it
also improved the distance between objects, however the authors found this is limited in its
accuracy [6].
   In this paper we attempt to verify this previous result on dynamic depth and extend it by
exploring if such an effect can also be observed in situations where 3D objects are used and
where the user is experiencing complex 3D scenes, such as high fidelity first person video
games. Within this context we want to find out: (1) If eye tracking controlled DoF improves depth
perception accuracy?, and (2) If eye tracking controlled DoF is preferred and offers higher immersion
when compared to fixed and no DoF systems? To answer these questions we design and run a
user study with 5 participants running two different tasks: a Tunnel Test and a 3D game called
Spaceship Demo. The method, results and discussion are provided in sections hereafter.


2. Method
In this section we explain the method followed in the user study covering apparatus, task
description, study design, study procedure and data collection techniques.
   Apparatus: An application integrated with the Tobii 4C Eye Tracker tool was developed
using Unity. We used a display with a resolution of 1920 px × 1080 px at a size of 53 cm × 30 cm,
and with a refresh rate of 60 frames per second. The eye tracker scans and estimates the user’s
head position and gaze at a frequency of 90Hz. Throughout the experiment the user was sitting
at the desk where mouse and keyboard were provided for interaction see Figure 1.
   Task and Study Design: We chose a within-subject design which has two independent
variables: DoF mode and feedback. We compared three DoF modes: no DoF, mouse controlled
Figure 1: Preview of setup: (Left) Tunnel Test and (Right) Spaceship Demo. Participants were seated
at a desk and aligned in a way that their eyes steadily remained at around 42cm from the display.


DoF where the focus point moved with mouse pointer and eye tracking DoF where the focus
point moved with gaze. In respect to feedback we compared conditions with and without
feedback. The feedback was shown as text popup indicating if the user correctly completed
the task. The feedback was included into the study design in order to explore learning effect.
We were interested in finding out if users are capable of improving their performance when
feedback is provided.
   The dependent variables were score which indicates how many times the user successfully
completed the task, total duration which indicates the total amount of time the user spent on
the task and questionnaire response.
   We run two tasks: Tunnel Test and Spaceship Demo. In the Tunnel Test the goal was to
measure depth perception where the only depth cue is DoF. A 3D scene was generated showing
two spheres placed at different depths. The size of spheres is scaled so that they appear of equal
size forming symmetry inside a tunnel (see Figure 1 left). The user was then asked to indicate
which sphere was closer. This method has been previously used by [6], however within their
experimentation they did not use untextured 3D objects (e.g. spheres), but instead used 2D
surfaces with relatively complex textures.
   The Spaceship Demo was built upon an open source game [7]. We modified the game to
enable all DoF modes. The game is a first person game controlled with a mouse and keyboard.
The players are tasked to navigate tough 3D environment which helps them progress through a
fixed story line. The story lasts for approximately 5 minutes, where the player can navigate
and explore the virtual environment freely. In this task we only collected qualitative data. We
composed questionnaires based on methods used in the works of [1, 5, 3].
   Participants and Study Procedure We recruited 𝑛 = 5 university students as test subjects
via convenience sampling. The study started with a brief explanation of the study goals
and consent form approval. Participants were then sat in front of the computer. We then
conducted the 5 point eye tracking calibration after which the first test (Tunnel Test) started.
The participants were shown how to interact with the system after which the data capture
started. In each DoF modes, the order of which were randomized and counterbalanced, the
user repeated the task 20 times. We vary the difficulty of the task creating 4 levels. The higher
the level the closer together are the two objects. This in tehory makes it more difficult to
figure which out which object is closer. After completing the task the user answered a sort
questionnaire. Afterwards, the same process was repeated with feedback enabled. This meant
that the users were informed about the correctness of their answer after each task repetition.
   The final test was Spaceship Demo test. The users played the game in each DoF modes, the
order of which were randomized and counterbalanced. After completing the test the users filled
in a questioner.
   Data Collection: Thought Tunnel Test we collected task time and task completion score.
At the end of each condition we also collected questionnaire responses. We inquired on the
following topics: level of comfort, difficulty of estimating distance of objects, level of immersion
and difficulty of navigating the scene. We followed the metrics and scales used in the study of
[4].


3. Results


Figure 2: Tunnel Test Results: (Top left) mouse controlled and eye tracking controlled DoF scores
without feedback across all 20 trials. Y-axis shows consecutive number of test repetition indicating
the flow of time and highlighting if any learning happened over time. (Top middle) mouse controlled
and eye tracking controlled DoF scores with feedback. (Top Right) average duration of task in each
condition. (Bottom Left) total scores per condition. The black line shows baselines performance of
random selection. (Bottom Right) total score per difficulty level. The black line again shows baselines
performance of random selection.


   The results of Tunnel Test show there is no significant learning in any of the conditions
(see Figure 2 top left and top middle graphs). This is true for both no-feedback and feedback
conditions. When observing the results of total duration (see Figure 2 top right) we see the users
performed the task faster in no DoF condition compared to mouse and eye tracking controlled
DoF conditions. The results for task performance (see Figure 2 bottom row) show that none
of the modes managed to consistently outperform the random selection. Furthermore there is
no clear distinction in quantitative performance between the three modes we compared. The
qualitative results collected in the form of responses to the questionnaires showed that users
think the most compelling depth is available in eye tracking controlled DoF condition, however
the difference is very small compare to no DoF condition. Furthermore, the no DoF condition
was chosen as the most popular mode.
   In the Spaceship demo, the mean rating for navigation of the 3D environment, is highest
for mouse controlled DoF followed closely by no DoF condition. Eye tracking controlled DoF
has the highest mean ratings in questions 2 and 4, regarding the viewing comfort and level
of immersion, respectfully. The rankings of the conditions in the Spaceship Test from best to
worst, according to participants show that eye tracking controlled DoF was voted highest.


4. Discussion and Conclusion
In this research we explored the effects of eye tacking controlled DoF in 3D environments,
compared to manually(mouse) controlled DoF. By using eye tracking controlled DoF, we keep
the gaze point in focus which in turn imitates real life vision. We designed an experiment that
measured both depth perception accuracy, and subjective preference for different aspects of 3D
environments. We failed to find evidence the DoF improves depth perception. This was true for
both mouse controlled DoF and eye tracking controlled DoF. However when considering users
preferences our research shows that DoF can increase immersion. Furthermore we show this is
true also true in complex 3D scenes, such as high fidelity first person video games. However, it
is important to note that this study is limited with the number of participants, which prevented
us from running statistical tests. Therefore these findings are of preliminary nature and should
be corroborated by extending the user base.


References
[1] S. Hillaire, A. Lécuyer, R. Cozot, G. Casiez, Depth-of-field blur effects for first-person
    navigation in virtual environments, in: Proc. of ACM VRST, 2007, pp. 203–206.
[2] A. Naceri, R. Chellali, F. Dionnet, S. Toma, Depth perception within virtual environments:
    Comparison between two display technologies, International Journal On Advances in
    Intelligent Systems 3 (2010).
[3] I. K. Li, E. M. Peek, B. C. Wünsche, C. Lutteroth, Enhancing 3d applications using stereoscopic
    3d and motion parallax, in: Proc. of the AUIC, 2012, pp. 59–68.
[4] M. Vinnikov, R. S. Allison, Gaze-contingent depth of field in realistic scenes: The user
    experience, in: Proc. of ETRA, 2014, pp. 119–126.
[5] S. Hillaire, A. Lécuyer, R. Cozot, G. Casiez, Using an eye-tracking system to improve camera
    motions and depth-of-field blur effects in virtual environments, in: Proc. of IEEE VR, IEEE,
    2008, pp. 47–50.
[6] M. Mauderer, S. Conte, M. A. Nacenta, D. Vishwanath, Depth perception with gaze-
    contingent depth of field, in: Proc. of ACM CHI, 2014, pp. 217–226.
[7] T. Iche,         The spaceship demo project using vfx graph and high-
    definition render pipeline,              2022. URL: https://blog.unity.com/technology/
    now-available-the-spaceship-demo-project-using-vfx-graph-and-high-definition-render.

</pre>