=Paper=
{{Paper
|id=Vol-2744/paper65
|storemode=property
|title=Psychovisual Perception Scale Based on a Neural Network
|pdfUrl=https://ceur-ws.org/Vol-2744/paper65.pdf
|volume=Vol-2744
|authors=Vladimir Budak,Ekaterina Ilyina
}}
==Psychovisual Perception Scale Based on a Neural Network==
<pdf width="1500px">https://ceur-ws.org/Vol-2744/paper65.pdf</pdf>
<pre>
       Psychovisual Perception Scale Based on a Neural
                          Network

        Vladimir Budak [0000-0003-4750-0160] and Ekaterina Ilyina [0000-0003-0783-0931]

            Moscow Power Engineering Institute (National Research University),
                      Krasnokazarmennaya 14, Moscow, Russia
                  budakvp@gmail.com,kitesika@gmail.com


       Abstract. The purpose of this article is to construct a psychophysical scale of
       visual perception from lighting scene based on a direct propagation neural net-
       work using for assessment of real or synthesized images with spatial brightness
       distribution.
           Visual perception assessments of different scenes were obtained for 10 ob-
       servers at the experimental installation of the Department of lighting engineering
       of the MPEI (NRU). These results were checked and found out agreed with the
       numerical scale of visual perception proposed by Lekish and Holladay. Neural
       network was trained to predict a sensation at the level of 40-70%, depending on
       the scale category. For more careful prediction level in each of 5 categories of
       scale a new experiment should be done with new calibration and with tested in-
       structions and with more observers involved.
           The novelty consists in using a neural network as an expert to assess the de-
       gree of comfort of the lighting scene.

       Keywords: Glare Discomfort, Scale of Visual Perception, Neural Network


1      Introduction

Currently, neural networks (NN) are widely used in image and text recognition tasks,
in medicine as diagnostic systems, in quality control systems and in many other fields
where it is impossible to consider all the conditions that affect the decision. And there-
fore, only the most important ones are considered. In lighting engineering, the problem
of psychophysical assessment of a visual perception of the spatial brightness distribu-
tion into person's field of view can be assigned to this class of tasks. Processing of
visual information can be conditionally represented as: "Stimulus – sensation – percep-
tion - reaction" where the stimulus is the spatial brightness distribution and the reaction
is the sensation of comfort or discomfort caused by stimulus.
    Modern lighting calculation programs allow us to get quite realistic visualizations of
the designed lighting installations. To evaluate the quality of the brightness distribution
_________
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License
Attribution 4.0 International (CC BY 4.0).
2 V. Budak, E. Ilyina


in terms of visual comfort using real or synthesized images, a psychophysical lighting
comfort scale (PLCS) should be built to rank all these images. PLCS could be used to
set the relationship between human sensation and the numerical values of the light
source’s and background’s brightness on the image. The probabilistic model of transi-
tion from category to category of sensation should be used in PLCS to predict with
some degree of accuracy what reaction can be expected from a lighting scene.
   The purpose of this work is to construct a PLCS using empirical data obtained from
an experiment where observer's reaction the depending on the brightness of a bright
light source observed on a uniform background is evaluated when performing a typical
visual task.
   Lekish and Holladay were the first who used a psychophysical assessment of visual
perception of lighting [1] and tried to find the relationship between the numerical values
of source’s and background’s brightness, and a set of human responses, expressed in
the categories of lighting comfort. They suggested a scale like as is scarcely noticeable,
is most pleasant, is still pleasant, is at limit of pleasant, is very comfortable, is still
comfortable, is less comfortable, is at the boundary between comfort and discomfort, is
perceptibly uncomfortable, is thoroughly uncomfortable, is at boundary between objec-
tionable and intolerable, is irritating and is painful. They had got an empirical formula
for value K as measure of «sensation» that looks like this:

                        K = log(Ls) + 0.25log(Q) – 0.3log(Lad)                         (1)
where Ls –brightness of source, Q – solid angle of light source, Lad – brightness of
adaptation or background for source. Each numeric value of K corresponds to one of
the thirteen categories of the scale, for example, K = 1.9 indicates a feeling on the
boundary between comfort and discomfort (BCD).
   In turn, Hopkinson's research [2] showed that sensations on the scale could not be
clearly defined as values of brightness, they can only be expressed by an interval. So
the transition from scale’s category to another can be set by a probability function. For
example, Hopkinson shows that the function describing BCD has form as a sigmoid.
Thus, judge a sensation can be predicted only with a certain probability.
   The scale of sensations was compared with the values of the discomfort index cal-
culated using an empirical formula for a single source of light using a system of cate-
gorical judgments [3]. Since the sensitivity of the eye changes exponentially as the
brightness level increases, the standard deviation interval also increases for each sub-
sequent interval on the scale. Based on this, the hypothesis of a normal distribution of
ratings on the scale was confirmed, which allows us to find the numerical boundaries
of the intervals of subjective categories in standard deviation units and determine the
numerical boundaries of the intervals for a certain probability value, for example,
p=0.5.
   Construction of a PLCS using the probabilistic model of transition from category to
category can give a more complete description how the light environment (the bright-
ness distribution in space) affects a person. It can become a measure of the typical atti-
tude of a group of observers to lighting conditions.
   Lakish’s and Guth’s experiment (in 1949) investigated the influence of various fac-
tors on the visual perception on BCD for different brightness levels of glare source in
                                 Psychovisual Perception Scale Based on a Neural Network 3


the range from 1000 to 6000 cd/m2, and as a result, these studies contributed to clarify-
ing definition, which formed the basis of modern unfired glare ratings (UGR) [4].
    An experimental at the Department of lighting engineering in MPEI (NRU) was
built to study the discomfort from glare sources with various shapes and brightness in
the observer’s field of view/ It’s similar to the Lakish’s and Guth’s installation, but has
modern light sources and a wider range of brightness from 33 to 100,000 cd/m2. Mod-
ern experiment for BDC shown that the results were agreed with the experiment of
Lekish and Guth installation [5]. Initially, the psychophysical scale proposed by Lekish
and Holladay was used to find BCD. But later a scale was reduced from 13 to 5 cate-
gories: barely noticeable, comfortable, uncomfortable, unpleasant, and painful. From
our point of view these 5 categories are enough for building of new PLCS.
    From the point of view of NN, the task of PLCS’s construction is most like the class
of task «many-to-many», which include image recognition. Such a tasks have several
input parameters, and the result of the NN is the classification of objects into several
categories. When a network divides input vectors into two classes, it is sufficient to
have only one network output that takes the values «1» or «0». But in our case, the scale
of sensations has 5 categories, so the number of outputs should be equal to 5. For ex-
ample if the observer's response corresponds to a category barely noticeable then the
value should be written as "1", the rest as" 0", and this logic of recording should be
applied to each observer's response.
    As a result, the problem is reduced next conditions: there are several photos of the
lighting system with a spatial distribution of brightness at the input, and there are real
estimates of the observers of these scenes corresponding to the five categories of the
scale at the output. During training, the NN will extract certain features from the input
images, collect and classify them in a certain way. After training, the network should
output the probability of an image or group of images falling into each category of the
scale. The maximum probability will determine the category of the comfort scale, that
is, the most likely reaction of typical observers to the distribution of brightness in the
room.


2      Methods

   If we simplify the distribution of brightness on image to the ratio of the brightness
of light source of finite size viewed on the uniform background, then the input vector
into NN have two variables: y1 = LS and y2 = Lad.
   In the future, the length of the vector y can be changed by adding, for example,
correlated color temperature of the light source (y3). Moreover, later y1 and y2 can be
used both from experiment and from an image with a spatial distribution of bright-ness
in a real photograph or modeled in a lighting calculated program. The recognizing and
classification of images tasks are well solved using convolutional NN. In the framework
of this article, we will not consider this issue.
   The input and output data for training of the NN were obtained from an experiment
performed on an experimental installation at the Department of Lighting Engineering
of the MPEI (NRU). The technique consisted of assessing the sensation when a glare
4 V. Budak, E. Ilyina


source appeared in the observer's field of vision while reading the inscription. The back-
ground brightness was uniform. The adaptation brightness was taken equal to the back-
ground brightness. The experiment involved 7 observers with a back-ground brightness
of 57 and 113 cd/m2 in the dark time, and 3 observers with a background brightness of
66, 77, 95 and 119 cd /m2 in the daytime in cloudy weather. Each observer evaluated
the first sensation and registered the value of source brightness. Then the bright-ness of
the source changed (gradually increased or decreased) till the observer registered the
second sensation and these actions were repeated up to fifth sensation. Then the back-
ground brightness was changed to next.The measurement was carried out both on an
ascending and descending scale. The observer was seat on the chair at distance of 1.5
meters from the inscription "Experiment", which he read. The inscription was at the
level of the line of sight. The source with diameter 30 mm was located just above the
line of sight.
    Reading of inscription imitates condition for performing visual work when glare
source can appear in the person’s field of view and cause sensation of discomfort.Since
the scale of sensations refers to the scale of verbal categories, the observer can find it
difficult to spread his feelings in a wide range. That’s why special attention was paid to
the briefing. It should be clear to the observer in terms that are familiar to him. In our
experiment the briefing was as follows: barely noticeable - the source becomes notice-
able during reading the inscription, this at the moment when the observer detect the
source at the first time; “comfortable” - it is pleasant to read the inscription and the light
source does but does not interfere with reading, “uncomfortable” - the sensation of a
“buzzing fly”, the source already attracts attention, but do not , “unpleasant” - distracts
from reading the inscription , "Unbearable" means pain-fully.
    Initially, the experiment was done with LEDs covered white fabric without limiting
the angular size of the source. Later it was fixed. It is very important to limit angular
size of the source during experiment because it can affect on the results.
    The installation was calibrated for each the brightness of source and background
using the Konia Minolta LS-110 brightness meter. A feature of the installation is that
the brightness measurement is carried out indirectly. The observer sets not the source’s
brightness but the voltage on the LEDs (U). The calibration curve LS = f (U) shown
that in the range from 7.2 to 7.8 V, the brightness increases from 20 to 1870 cd/m2 (see
Fig.1).
    Brightness of background is set by the level of general lighting in the room. When it
turned out there is a contribution from general lighting into the brightness of source
surface. Therefore, calibration was performed for each background of brightness level
separately, in order to reduce the error at low brightness levels. This limitation affects
the ability to accurately determine the interval of brightness for sensations: barely no-
ticeable and comfortable.
    The measurement was carried out at one adaptation level, then it was changed. There
was a pause between the measurements so that the observer's eyes could adapt. All
measured and registered data were converted into input and output vectors for transfer-
ring them into nprtool in Matlab. This application helps to solve the problems of clas-
sifi-cation of pattern recognition [6]. After that, a NN with direct distribution was cre-
ated. It contained several hidden layers and an output layer (see Fig. 2). All neurons
                                  Psychovisual Perception Scale Based on a Neural Network 5


may have their own activation function; the number of neurons corresponds to the num-
ber of classes n on the last layer, in our case n = 5.


Fig. 1. Calibration curve for source brightness from the voltage on the LED at Lad = 54 cd/m 2

    The training of NN was performed using the standard trainscg function, which can
train any NN if its weight, clean input signal and transfer functions have derivative
functions. To solve multidimensional logical problems, the Softmax activation function
is applied, which looks like this:

                                               e yi
                                   S ( y )i = n
                                             e j
                                                    y

                                               j =1                                        (2)

The function converts a vector with dimension j into a vector S with the same dimen-
sion. Each coordinate S(y)i of the resulting vector is represented by a real number in
the interval [0,1] and the sum of coordinates S(y)i is equal to 1. The layer index n is
usually omitted, therefore which implies that this is the last layer.


Fig. 2. The scheme of the neural network. (Input - input vector of dimension j = 2, Hidden -
number of hidden layers n = 5, Output - output layer with 5 classes, Output - probabilities for
each class)
6 V. Budak, E. Ilyina


    Our NN has five outputs, then it uses Softmax function to produce a set of probabil-
ities for each sensation caused by the brightness ratio. To assess the reliability, we use
the surface defined as:

                                     p = f(LS,Lad)                                      (3)
where p is the maximum probability of each event on the PLCS. In other words, this
probability surface shows how each point in space with coordinates LS and Lad refers
to a sensation. The higher the probability value, the higher the confidence.To determine
the sensations on the PLCS, we build a sensation surface in the same space, which can
be defined as:

                                    R = f(LS,Lad)                                       (4)
Thus, our model of scale may represent some surface - a field of predictions for a given
initial data.


3      Results

   Average brightness values obtained during the experiment for each category of the
scale were used to calculate values K using formula (1). Calculated values K were
matched with Lekish-Holliday’s scale (see Fig. 3).


       Fig. 3. Value K and the corresponding sensation on the Lekish-Halladay’s scale

It can be noted that the values K for barely noticeable and still nice coincide rather
good. There is a slight discrepancy for the category of BCD. Thoroughly uncomfortable
                                    Psychovisual Perception Scale Based on a Neural Network 7


and painful coincide at Lad = 54 cd/m2 and slightly diverge at Lad = 113 cd/m2. In this
case, thoroughly uncomfortable corresponds to the instruction unpleasant in our exper-
iment. The all values K > 2.7 are considered as painful [1]. Nevertheless, such a graph
does not provide information how the feeling can be translated into the brightness of
source and background and how to determine the width of each interval. To approxi-
mate the sensation according to the normal law using response frequencies for a five-
point scale, it’s needed to find the only numerical value for one fixed brightness of the
source and background. We constructed graphs with the distribution of response fre-
quencies for each category of the scale (see Fig. 4). Several maxima can be observed
within each category. This indicates the lack of statistical data. In barely noticeable
and comfortable the maxima are shifted toward the light source’s brightness 42 and 450
cd/m2 respectively. For uncomfortable there are two maxima 3107 and 8910 cd/m2
instead of one. Unpleasant and unbearably have a more smeared character without a
clear maximum. Presumably it could be due to insufficient data or because it was dif-
ficult to correctly separate the person’s sensation of discomfort and unpleasant. As for
the painful category, everyone has their own threshold and more statistics are needed
here.


Fig. 4. Graphs of the distribution of response frequencies for each category of the scale depending
on the brightness of the light source

   All the results were used to train NN and to construct the PLCS where sensation is
a function of brightness of the source and background. Having built a multilayer artifi-
cial NN it’s important to determine whether it adequately describes the data and the
dependencies between them. As a rule the number of neurons in the hidden layer is
selected experimentally by comparing the accuracy of NN. So, the number of hidden
layers in the NN is a custom parameter. The minimum values of cross entropy (CE) and
8 V. Budak, E. Ilyina


percent of errors (the proportion of incorrectly classified examples) are preferred for
the training, test, and test sets. The smaller the error, the better result[10].
   In our test, we take the number of hidden layers from 2 to 100. A graph of the error
depending on the hidden layers of the NN, has a form like a cardiogram. Anyway it was
determined that the NN with 32 layers has minimum percentage of error, and NN with
11 layers has an error is closest to the minimum.
   Not only the CE parameters and the percent of errors could be used to evaluate the
quality of the NS operation, but also a Confusion Matrix is used. For each class of
observations, the results of assigning observations to a class are given. The matrix al-
lows to see whether the classifier confuses classes. The matrix columns correspond to
the predicted classes, and the rows correspond to the actual classes.
   Fig.5 shows the percentage of error for NN with 32 layers. Classes from 1 to 5 mean
categories on the scale from barely noticeable to unbearable. According to the Confu-
sion Matrix, grades from 1 and 5 have a minimum number of errors 9.8% and 28.6%
respectively. These values correspond to the criterion of prediction accuracy of at least
70%. Grades 2, 3 and 4, have a lower the prediction accuracy. Let's see how these pa-
rameters change for NN with 11 layers (see Fig.5). The percentage of errors in all clas-
ses except the grades 2 and 4 decreases. In this case, accuracy for grade 2 (comfortable)
and grade 4 (unpleasant) is rather low. But for grade 3 (discomfort) it is over 60%
which is quite high. This clearly indicates that the NN with 11 layers works more ade-
quately compared with NN with 11 layers. To improve the accuracy of the prediction,
additional data from the experiment is required for at least the comfortable and unpleas-
ant categories.


                        Fig. 5. All Confusion Matrix for 32 and 11 layers

   The shape of the Receiver Operating Characteristic (ROC-curves) is also an im-
portant indicator of the quality of the neural network. For an ideal classifier, the ROC-
curve graph passes through the upper-left corner, where the percentage of true positive
cases is 100% or 1, and the percentage of false positive examples is zero. Therefore,
                                   Psychovisual Perception Scale Based on a Neural Network 9


the closer the curve is to the upper-left corner, the higher the predictive power of the
model. On the contrary, the smaller the curve bends and the closer it is to the diagonal
line, the less efficient.
   As it can be seen at Fig.6 neural network with 11 layers has the ROC-curves that
pass closer to the upper-left corner, which means that the predictive ability of this model
is higher than for other two. At the same time, it is clearly noticeable that ROC-curves
for model with 32 layers passes below the diagonal. It means that model lie.
   Let's construct surfaces p and R defined by formulas (3) and (4) for LS range from
100 to 100000 cd/m2 and Lad range from 54 to 120 cd/m2 using NN with 38 and 11
hidden layers. From Fig.7 the shapes of both surfaces change depending on the number
of layers. The surface obtained based on NN with 11 layers looks less ragged.


     Fig. 6. ROC-curves depending on the number of hidden layers: 32(left) and 11 (right)


Fig. 7. Probability surfaces for PLCS for cases: NN with 38 layers (left) and NN with 11 layers
                                       (right) respectively

The higher the accuracy of the prediction (or probability), the more NN works like an
"expert" who can predict: what kind of sensation will be caused by the ratio of the
brightness of source and the background in a typical group of observers.
10 V. Budak, E. Ilyina


4      Discussions

   In this article, we reviewed the results of an experiment on MPEI’s installation and
the idea of using neural networks to construct PLCS depending on the ratio of the
brightness of source and the background was tested.
   Despite that response rates (frequencies) curves (Fig. 4) have no maxima for un-
pleasant and unbearable, the probability surface p shows the greatest NN’s accuracy for
these categories. This may be because the observer can easily detect them, that’s why
NN can be training more accurately and provide classification with p>0.7. As for barely
noticeable, the surface p shows the probability p>0.5 (green and yellow zone). Com-
fortable and uncomfortable have the lowest values of p, they contain zones where p<0.5
(blue color). Thus, the resulting model has a prediction accuracy of 40-70%, depending
on the category of scale.
   This work has shown that this idea can be used to build a model for psychological
assessment of the spatial distribution of brightness in a lighting scene from comfort
point of view if provided a enough experimental sample for training. Even though now
PLCS has lower accuracy than required, this work allows us to formulate criteria for
the future model of the scale and requirements for a new experiment. To provide best
predicational ability of NN the brightness of background in new experiment should be
at the range from 50 to 200 cd/m2 with step of 10 cd/m2 and a greater number of ob-
servers should be involved.
   Also, instructions used in this work for calibration of the experimental installation
and for briefing of observers before starting can be used in future.
   This can improve the accuracy of the input data, and therefore improve the work of
the NN as the main "expert" that assess the comfort of lighting scene using synthesized
images.


References
 1. Luckiesh, M., Holladay, L.: Glare and Visibility. Transactions of the IES. 20: 221(1925)
 2. Hopkinson, R.G.: Evaluation of Glare. Illuminating Engineering 52(6), 305–321 (1957)
 3. Kotik, G., Matveev? A., Perejma, V., Tohadze, I.: Kategorijnye ocenki kachestva i ih vzai-
    mosvyaz' na psihofizicheskoj shkale, Svetotekhnika, 3, 3-5 (1975)
 4. Luckiesh, M., Guth, S.: Brightness in the visual field at borderline between comfort and
    discomfort. Illuminating Engineering, 44(11), 650 (1949)
 5. Budak, V., ZHeltov, V., Meshkova, T., Notfullin, R.: Ocenka kachestva osveshcheniya na
    osnove prostranstvenno-uglovogo raspredeleniya yarkosti. Svetotekhnika, 3, 17-22 (2017)
 6. Beale, M., Hagan, M., Demuth, H.: Deep Learning Toolbox Getting Started Guide R2019b,
    MathWorks, 2019, 162

</pre>