=Paper= {{Paper |id=Vol-2485/paper4 |storemode=property |title=Implementation of the Radiosity Algorithm for Large Scale Scenes |pdfUrl=https://ceur-ws.org/Vol-2485/paper4.pdf |volume=Vol-2485 |authors=Alexander Shcherbakov,Vladimir Frolov }} ==Implementation of the Radiosity Algorithm for Large Scale Scenes== https://ceur-ws.org/Vol-2485/paper4.pdf

Implementation of the Radiosity Algorithm for Large Scale Scenes
A.S. Shcherbakov1,2, V.A. Frolov1,3
alex.shcherbakov@graphics.cs.msu.ru|vladimir.frolov@graphics.cs.msu.ru
1Lomonosov Moscow State University, Moscow, Russia;
2Gaijin Entertainment, Moscow, Russia;
3Keldysh Institute of Applied Mathematics, Moscow, Russia

We propose an upgrade for the Radiosity algorithm that allows to efficiently apply radiosity for large scale
scenes. This is achieved by considering only the patches located close to the observer. For each frame we update
local form-factor matrix with a little set of patches, effectively reusing information from the previous frame in this
way. Our method is completely expressed via matrix-vector operations, thus, it’s GPU implementation is natural and
straightforward. We achieve high occupancy for both CPU and GPU versions of the algorithm by thanks to we
use special matrix of several reflections for which update operation effectively combine computations with memory
operations.
Keywords: Global Illumination, Radiosity, Large Scenes, GPU

1. Introduction 2.2 PRT Methods
Precomputed Light Transport (PRT) methods al-low
The methods of global real-time global illumina- to move the most complex calculations from the
tion in general can be divided into two classes: the rendering stage to the precomputation stage.
methods that uses preprocessing/precomputation and
For example, Radiance regression function [17] is
those who does not uses it (i. e. calculate everything in
based on neural networks. It is restricted by static
dynamic). The main advantage of methods that do not
geometry and materials. The second disadvantage of
use pre-processing is the fully dynamic content.
method is that it works only for point light sources.
However, the methods of the first group due to the
Spherical harmonics based methods [8, 20] uses
precomputation and pre-integration usually are sig-
idea of ”relighting”, treating the dynamic light source as
nificantly better in terms of speed/quality ratio.
a linear combination of pre-processed sources.
Our work is devoted to the problem of building These methods suffers from light leaking which was
methods of the ”intermediate class” that combines the noted in [20]. In general these methods are well-suited
advantages of both classes. for outdoor but not indoor scenes.
Despite huge number of papers in real-time global
illumination, radiosity [3, 19] to this day remains one of
2. Related works the best method by means of speed/quality ra-tion.
It is used by Enlighten and Unity for exam-ple [1, 5,
2.1 Dynamic methods 12, 14] and lighting engineering programs [22, 23].
Radiosity, like the previously considered methods,
uses scene discretization (but use patches instead of
LPV [9], VCT [4] and Volumetric Irradiance Map voxels). The pre-calculation is done by calculating the
[20] discretizes the space around the observer and matrix of form-factors. The form-factor is a value in-
model the movement of light along the voxel grid. dicating how much of the lighting is transferred from
Main disadvantage of LPV and VCT is the high com- one patch to another. The calculation of the global
putational cost and high memory requirements. Volu- illumination is reduced to solving the linear equation
metric Irradiance Map [20] is uses for half-static scenes system or several matrix-vector multiplications. By
and the secondary lighting is applied to dynamic ob- expanding the calculations at the preprocessing stage, it
jects, but these objects do not contribute to secondary is possible to reduce the calculation of the global
lighting of static objects. Thus, Volumetric Irradiance illumination method to a single matrix-vector multi-
Map [20] is one of those ”intermediate class” exam- plication [18].
ples. However Radiosity still have O(N 2) algorithmic
complexity where N is the number of scene patches.
Instant Radiosity [11], Reflective Shadow Map [6] This is the main disadvantage of the algorithm, since
and analogues creates secondary light sources, calcu- such computational complexity is unacceptable for
lating the secondary illumination in the same way as large scenes (with an increase in the number of sites
the primary. The most significant disadvantage of N ). Our work is aimed specifically at solving this
RSM-based methods is their low precision. As the problem. The existing approaches of hierarchical ra-
number of secondary sources increases, the accuracy diosity and analogues [7, 13] are light-dependent (due to
increases, but the calculation time also increases sig- they split scene to patches according to the cur-
nificantly [2, 15, 21].

Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
rent lighting setup) and does not exploit its the main Therefore, third idea is to combine updating the
advantage — precomputation of form-factors. local matrix with the multiple reflection matrix from
[18]. This will allow organizing a pipeline in which
3. Proposed approach calculations are immediately performed for each form-
For a better explanation of our methods, we will factor obtained from the global matrix with form-
begin with a few simple ideas that will be refined fur- factors from the local matrix. That is, we will store
ther. and update the local matrix of several reflections,
Our first idea is that we can consider only M clos-est rather than the usual local matrix of form-factors. In
patches to the observer (where M << N ). We as-sume this case, the subsequent calculation of the global il-
lighting from other patches equals to some con-stant or lumination is reduced to a single multiplication of the
evaluate it on a rougher approximation (that is, we may matrix by the vector, which, given the small size of the
build cascades of discrete representations of the scene in local matrix, is very cheap.
the same way as the Cascaded LPV [10] does). This idea Finally, the fourth idea is that with a small
leads us to the fact that we can make a submatrix of movement of the observer, it is possible to update the
form-factors, on which we will perform basic local matrix less often than to calculate the lighting on
calculations. We will call it local matrix (Fig. 1, left). it (for example, once in 2-5 frames). Therefore, it
However, the direct implementation of this method makes sense to reduce the cost of the stage of cal-
will not be efficient because the matrix of the form- culating the lighting using a local matrix of several
factor for the entire scene (even if special methods reflections.
are used to effectively store sparse matrices) does not fit
into the computer’s RAM. Therefore, reading even M 2 3.1 Baseline
of random elements (shown in green in Fig. 1, left)
from DRAM or external memory cannot be im- The matrix of form-factors of several reflections
plemented efficiently. [18] is based on the idea of the composition of form-
Therefore, second idea is that it is necessary to factors. We further describe the principle of composi-
reuse information from the previous frames as much as tion on simple examples.
possible, updating only patches that were not loaded on Consider three patches with indices i, j, k. Then Fij
previous frames (Fig. 1, right). This allows us to and Fjk — form-factors of energy transfer from the j-th
reduce the number of form-factors that are loaded into path to the i-th patch and from the k-th patch to the j-
the local matrix each frame to only a few readings (or th patch respectively. Colorsj — color of j patch.
approximately M if the observer moves quickly, Then value
which, nevertheless, is significantly better than M 2).
Fij · Colorsj · Fjk
However, reading even a few elements from the
global matrix at random addresses is still inefficient shows how much of the world will go from the k-th
for modern computing systems. During the update of patch to the i-th patch when the multi-reflection from the
the local matrix of form-factors, computing system (both j-th patch.
CPU and GPU) will be practically idle. At the same After going through all patches as intermediate in
time, after updating, when calculating the radi-ance on the reflection, we get the complete second reflection of
the local matrix of form-factors, they will be loaded and light from the patch number k to the patch number i:
these calculations may become a bottle-neck since you
∑N F · Colors · F ,
need to perform at least 3 multiplica-tions of the local F=ik
2 ij j jk
matrix by a vector (to account for three light bounces). j=0
where N — number of patches in local matrix.
Thus, at the preprocessing stage, it is possible to
calculate reflections of an arbitrary order. The mul-
tiple reflection matrix contains a specified number of
reflections.

3.2 Local matrix of multiple-reflection
In order to extend this approach to a local ma-trix in
the surrounding of an observer, two ways are possible:
local matrix matrix update
1. Calculation of the matrix of several reflections for
Fig. 1. Local matrix is a submatrix of global form- the entire scene and the use of its parts.
factor matrix. It is composed of green-labelled cells 2. ”Online” matrix calculation for the observer’s area
(left). Changing sites in the surroundings of the using the usual matrix of form-factors (Fig. 1,
observer when moving (right). right).
The first option is not applicable for large scenes 3.3 GPU implementation
since it requires to store 3N 2 of real numbers. The To implement the proposed method we create 6
proposed method implements the second option, since the
matrix of form-factors is sparse and can be effi-ciently compute shaders to update the matrix, one to delete the
stored/accessed. patch data and one shader to update the lighting. The
To update the matrix of several reflections, two op- following shaders are used for updating:
erations are required: (1) adding a site to the matrix and 1. Calculating double_ref lection (1) using parallel
(2) removing the site from it. Delete operation is reduction on GPU.
implemented by zeroing the row and column of the 2. Calculates triple reflections gcol, grow (2) by mul-
matrix of form-factors corresponding to the deleted tiplying double_ref lection with form-factors.
patch. 3. Multiplying gcol by a matrix of form-factors to get ′
col (3).
4. Multiplying the form-factor matrix by grow to get g′
row (3).
5. Adding to the matrix of information about reflec-
tions in view of the new patch (5).
6. Adding new site form-factors to the matrix.

Fig. 2. Scheme of updating form-factors for local
matrix of several reflections.

To add a new patch, we must consider several types of
reflections. First, we need to include the usual form-
factors fcolumn and frow (Fig. 2, ”single”). Next,
triple reflections are taken into account, in which light is Suggested approach; (2 ms, 128 MB)
reflected twice from a new patch. To do this, double
reflection is first calculated (Fig. 2, ”double”):
double_ref lection = fcolumn ◦ Colors · frow (1)
By using double reflection double_ref lection
triple reflections can be calculated further (Fig. 2,
”triple”):
gcol = fcolumn · Color · double_ref lection
(2)
grow = frow · Color · double_ref lection
Finally, we add information on form-factors that
takes into account the light reflections between
patches already participating in the matrix (Fig. 2,
Multiple reflection matrix from [18]; (300 ms, 30GB)
”interreflections”):
′
gcol = Flocal · gcol
′ (3)
grow = grow · Flocal
Thus, the form-factors for the new patch are equal
to
′
gcol + gcol + fcolumn ,
′ (4)
grow + grow + frow .
Now, in addition to adding the form-factors of the
new patch, it is required to update the form-factors of
the rest of the matrix. For this we use previously
calculated form-factors for new patch (4): Difference image. In average is is less than 0.05.
′
Flocal ← Flocal + (gcol + gcol + fcolumn )·
(5)
′
·(grow + grow + frow ) Fig. 3. Comparison of Radiosity for the full matrix
Thus, we can add and remove patches from the local and the proposed method. Since the full matrix 63125
matrix. The size of the matrix is limited by a constant × 63125 does not fit in the memory of the GPU, the
specified by the user. time was estimated.
The most computational-cost are steps 3-5. How-ever, [4] Crassin, C., Neyret, F., Sainz, M., Green, S., and
steps 3 and 4 are just matrix-vector multiplica-tions, and Eisemann, E. (2011, September). Interactive indirect
the step 5 implements the addition of two matrices. Both illumination using voxel cone tracing. In Computer
of these these operations are to be easily implemented on Graphics Forum (Vol. 30, No. 7, pp. 1921–1930). Ox-
GPU. ford, UK: Blackwell Publishing.
[5] Cupisz, Kuba, and Kasper Engelstoft, Lighting in
Unity, Game Developers Conference, Mar. 2015.
3.4 Details http://www.gdcvault.com/play/1021765/Advanced-Visual-
Effects-With-DirectX
The matrix of form-factors can be stored as a tex-ture [6] Dachsbacher, C., Stamminger, M. Reflective shadow maps //
in the format RGBA16F. At the same time, the matrix Proceedings of the 2005 symposium on Inter-active 3D
size is limited to 4096 pixels on some GPU due to the graphics and games. — ACM, 2005. — �.
specific hardware limits (mobile GPUs). 203-231.
[7] Durand, F., Drettakis, G., and Puech, C. Fast and
4. Results accurate hierarchical radiosity using global visibility.
ACM Trans. Graph. 18, 2 (April 1999), 128-170.
The proposed method was compared with the com-
DOI=http://dx.doi.org/10.1145/318009.318012
mon Radiosity on local form-factor matrix and signif-
[8] Green, R. Spherical harmonic lighting: The gritty de-
icantly outperforms it (Fig. 4). tails //Archives of the Game Developers Conference.
– 2003. – Vol. 56. – p. 4.
[9] Kaplanyan, A. Light propagation volumes in cryengine 3 //
ACM SIGGRAPH Courses. 2009. Vol. 7. p. 2.
[10] Kaplanyan, A., Dachsbacher, C. 2010. Cascaded light
propagation volumes for real-time indirect illumina-
tion. In Proceedings of the 2010 ACM SIGGRAPH
symposium on Interactive 3D Graphics and Games
(I3D ’10). ACM, New York, NY, USA, 99-107.
[11] Keller A. Instant radiosity. — 1997.
[12] Magnusson, Kenny, Lighting You Up with Battle-
field 3, Game Developers Conference, Mar. 2011.
http://www.frostbite.com/2011/03/lighting-you-up-in-
battlefield-3/
[13] Marries van de Hoef. Real-Time Dynamic Radios-
ity for High Quality Global Illumination. 2013. Master
Fig. 4. Comparison of the proposed method for a local Thesis. ICA-3220516. Utrecht university.
matrix of several reflections with an a common [14] Martin, Sam, and Per Einarsson, A Real-Time Ra-
Radiosity for different local matrix sizes. diosity Architecture for Video Game, SIGGRAPH Ad-
vances in Real-Time Rendering in 3D Graphics and
We test our method on the scene of 63125 patches Games course, July 2010.
[15] Ou, J. and Pellacini, F. 2011. LightSlice: matrix slice
(Fig. 3). We took a Minecraft scene in order to omit
sampling for the many-lights problem. ACM Trans.
scene approximation algorithms in our work. At high
Graph. 30, 6, Article 179 (December 2011), 8 pages.
speed, the proposed method gives an image close to DOI: https://doi.org/10.1145/2070781.2024213
the image obtained by the method of full-matrix Ra- [16] Pranckevicius, Aras, Jens Fursund, and Sam
diosity on the CPU. Martin, Advanced Lighting Techniques in Unity,
Unity DevDay, Game Developers Conference, Mar.
5. Acknowledgments 2014. http://aras-p.info/blog/2014/05/05/shader-
This work was sponsored by RFBR 18-31-20032 compilation-in-unity-4-dot-5/
grant. [17] Ren, P., Wang, J., Gong, M., Lin, S., Tong,
X., and Guo, B. 2013. Global illumination with
6. References radiance regression functions. ACM Trans. Graph. 32,
4, Article 130 (July 2013), 12 pages. DOI: https://
[1] Akenine-Moller, T., Haines, E., Hoffman, N. Real-time doi.org/10.1145/2461912.246200
rendering. Fourth Edition. – AK Peters/CRC Press, [18] Shcherbakov, A., and Frolov, V.Accelerating radios-
2018. page 482. ity on gpus. In WSCG’2017 Full papers proceedings
[2] Budak, V.P., Zheltov, V.S., Kalakutsky, T.K. Local es- (2017), 2701, Computer Science Research Notes 2701
timations of Monte Carlo method with the object spec-tral Pilsen, Czech Republic, pp. 99–105.
representation in the solution of global illumina-tion. // [19] Sillion F. X. et al. Radiosity and global illumination.
Computer Research and Modeling, 2012, vol. 4, no. 1, pp. San Francisco : Morgan Kaufmann, 1994. Vol. 1.
75-84. [20] Yudintsev, A. Scalable Real-Time Global Illumi-
[3] Cohen, M., GreenBurg, D. The Hemi-cube: A Radios-ity nation for Large Scenes. GDC Talk. 2019. URL =
solution for complex it is hard to achieve environ- https://www.gdcvault.com/play/1026469/Scalable-Real-
ments // Proceedings of SIGGRAPH 85 in Computer Time-Global-Illumination
Graphics, 1985. 19. number 3. pp. 31-40.
[21] Walter, B., Fernandez, S., Arbree, A., Bala, K.,
Donikian, M., and Donald P. Greenberg. 2005. Light-
cuts: a scalable approach to illumination. ACM
Trans. Graph. 24, 3 (July 2005), 1098-1107. DOI:
https://doi.org/10.1145/1073204.1073318
[22]DIAL Dialux. 2019. URL =
https://www.dial.de/en/dialux/
[23] Relux Informatik AG Relux. 2019. URL =
https://reluxnet.relux.com/en/