=Paper=
{{Paper
|id=Vol-2299/paper3
|storemode=property
|title=Promenade: A Minimalistic Sculptural Interface for Immersive Soundscape Presentation
|pdfUrl=https://ceur-ws.org/Vol-2299/paper3.pdf
|volume=Vol-2299
|authors=Martin Rumori
|dblpUrl=https://dblp.org/rec/conf/fmt/Rumori18
}}
==Promenade: A Minimalistic Sculptural Interface for Immersive Soundscape Presentation==
Promenade A Minimalistic Sculptural Interface for Immersive Soundscape Presentation Martin Rumori Institute of Electronic Music and Acoustics University of Music and Performing Arts Graz, Austria Email: rumori@iem.at Abstract—Promenade proposes an interface for the semi- personal, interactive presentation of soundscapes that allows for the seamless integration of visual material. It combines the excitation of a flat surface using transducers with transaural cross-talk cancelled stereophonic projection. The unfolding of a wide spatial image within the sweet spot is contrasted by a collapsed, localised impression outside. This inherent interactive behaviour may be further extended by means of a distance sensor. Promenade emerged from an experimental, arts-based research project. It therefore targets contexts that feature custom audio content rather than aiming at general-purpose applications requiring optimal acoustic properties. I. I NTRODUCTION Exhibitions or other presentations of distinct objects often call for an unobtrusive sound projection means that affords the visitors an immersive, mostly individual listening experience. Achieving such an experience requires spatial capabilities of the chosen projection technique. In many such cases, headphones are used. Spatially rich sound material may be projected in particular with binaural signals. A major side effect is the inherent reconfiguration of our multi-modal perception, as the surrounding auditory domain is largely replaced by the projected one. Additionally, headphones may be considered as cumbersome or unhygienic in public exhibitions. Multichannel loudspeaker setups, in con- Figure 1. Promenade sound projection device. trast, do not require physical engagement with the projection medium. However, they do not facilitate individual listening situations and they have to be mounted in an often complex This paper describes the basic idea of Promenade and its constellation around the target listening area. originating context (section II), followed by introducing the Promenade is a minimal auditory display prototype that two main techniques involved, transaural crosstalk-cancelled relates to such issues. It is important to notice that Promenade sound projection (section III) and the principle of flat panel has not been developed to overcome the limitations of other loudspeakers (section IV). Finally, some findings that arose projection means. Instead, it emerged from a series of ex- from practical experience are discussed, along with possible periments in an arts-based research context as part of a sound modifications and extensions (section V). installation with a focus on aesthetic listening experience. Due to its simplicity, its purpose and the idiosyncratic technology II. P ROMENADE it incorporates, Promenade is not a universal, general-purpose The Promenade sound projection device consists of a flat spatial sound projection system but it manifests certain lim- surface of variable size with two electrodynamic exciters itations. However, it might provide an appropriate and eleg- mounted on the back (Figure 1). Audio signals are fed to ant solution in some of the circumstances described above. the exciters such that the surface acts as a loudspeaker mem- Promenade’s qualities with respect to those circumstances brane. The physical properties of the surface material lead are regarded retrospectively here in order to explore further to a certain radiation characteristics. The directional effect is possible fields of application and obvious modifications for further emphasised by crosstalk-cancelled stereophony, which different forms of appearance. creates a large auditory canvas that is much wider than the 25 Promenade: A Minimalistic Sculptural Interface for Immersive Soundscape Presentation projection panel (see section III). The effect occurs at a central listening position, facing the device from a distance of a few ten centimetres up to about one and a half metres. As soon as the sweet area is left, the spatial auditory image collapses and sound seems to be restricted to the panel surface. Although both exciters are mounted to the same surface, their signal differences remain intact to a sufficient extent for providing the effect of crosstalk-cancelled projection. A. Physical construction Promenade has been prototyped with two materials and in two sizes so far. The first series uses finnboard, a lightweight type of cardboard, with the dimensions 70 by 100 centimetres (portrait) and 3 millimetres thickness. The second, more Figure 2. Promenade installation, SMC 2018, Limassol, Cyprus. compact series is made of screenboard, a more dense type of cardboard with white cover layers, 43,5 by 64 centimetres and 2 millimetres thick. Other promising materials that have been structures, processes, and realities around us, with the singular- explored include ply wood of a few millimetres thickness, and ity of particular spots in space that, when located, allow for an KAPA, a compound material combining a polyurethane foam exclusive access to individual layers of this environment. The core with cardboard cover layers. crosstalk-cancelled projection technique restricts these spots The smaller prototype uses Visaton EX-60 R exciters to clear-cut boundaries but on the other hand it potentially (60 millimetres in diameter), while the larger is equipped with ‘elevates’ each layer’s spatiality as a sublime, temporary the more powerful Visaton EX-80 S (square shape, 80 milli- overlay to the global environment. metres edge length). White flat cable serves for feeding the The installation Promenade serves as an experiential starting exciters and for suspending the device. Both the exciters and point for further investigations into the fields of anecdotal the cables are glued to the panels. The projection devices are music, narrative identity construction and strollology. All of fed by small standard amplifier boards installed separately. these areas may be reflected in conjunction with aesthetic In the installation, the projected signals were generated from experience in our everyday life auditory surrounding and its a BeagleBone Black computer board equipped with the Bela technical mediation. Departing from a workshop within the audio cape.1 Sound programming and convolution for crosstalk project Promenade, a publication is currently prepared that cancellation was performed in SuperCollider.2 As cancellation summarises the described approach and the ongoing process. filters, the so-called BACCH filters by Edgar Choueiri [1] were taken from the jconvolver convolution engine bundle by Fons III. T RANSAURAL CROSSTALK - CANCELLED SOUND Adriaensen.3 PROJECTION B. Sound installation and artistic context A. Working principle The sound projection device has been conceived as part Transaural crosstalk-cancelled sound projection, often ab- of a sound installation within the arts-based research project breviated XTC, has been developed for playing back binaural Promenade. Anekdote, Alltag, Abbild (Promenade. Anecdote, signals, which normally require headphones, on a pair of everyday life, image), hence its name.4 loudspeakers (see, e. g., [2, pp. 283–326]). While headphones Seven playback canvases (five in a later version, Figure 2) isolate the sound propagation paths from each of the earphones were distributed in the exhibition space, each suspended from to the corresponding ear, respectively, loudspeaker projection the ceiling. The projection surface was left blank, without any involves transaural crosstalk: the signal radiated from the left further visual additions. Each of the devices played back a ear also reaches the right one and vice versa. However, the soundscape composition based on field recordings. Listeners playback of binaural signals requires the separate presentation were invited to explore and ‘visit’ the different soundscapes of each channel to the corresponding ear. by moving in space, approaching the projection devices one Transaural crosstalk cancellation is based on time-shifted by one and ‘entering’ the recorded space by positioning anti-phase signals emitted from the respective opposite loud- themselves in the sweet area where the crosstalk-cancelled speaker that shall arrive at the near-side ear at the same time projection effect unfolds. as the unwanted crosstalk signal from the far side speaker The installation seeks to combine a global sonic envir- in order to cancel it out. Obviously, the cancellation signal onment that reflects the simultaneity of seemingly unrelated causes a crosstalk at the far side ear, too, which in turn 1 http://bela.io is eliminated by another correction signal emitted from the 2 http://supercollider.github.io opposite speaker. Several iterations of this process lead to the 3 http://kokkinizita.linuxaudio.org/linuxaudio/ design of a recursive filter network that ultimately seeks to 4 https://esc.mur.at/de/projekt/promenade-anekdote-alltag-abbild ensure an isolated projection of the ear signal pair [3]. 26 Promenade: A Minimalistic Sculptural Interface for Immersive Soundscape Presentation B. Binaural vs. stereophonic signals with XTC The experimental artistic context of Promenade did not As mentioned above, transaural crosstalk cancellation has pursue a scientific approach to such a technique for transaural been developed for the projection of binaural signals on loud- sound projection. In multiple iterations, the used combination speakers, that is, ear signals involving interaural time (ITD) of materials, dimensions and mount points of the exciters have and level differences (ILD) as well as elevation-dependent been adjusted such that the desired effect could be achieved spectral cues (see [4]). Nevertheless, XTC filters are also used sufficiently well. A validation by acoustic measurements, for the playback of ordinary loudspeaker stereophonic signals. however, has not yet been carried out. The latter application is known as Ambiophonics in consumer V. D ISCUSSION entertainment contexts [5]. Experimental findings of the Promenade project with re- The presented sound projection device shows an appropri- spect to stereophonic signals are discussed below (Section V). ate performance within its frame of origin, the Promenade arts-based research project. There, it allows for experiencing IV. F LAT PANEL LOUDSPEAKER PROJECTION the often surprising, sometimes uncanny effect of crosstalk- The excitation of flat, solid matter surfaces as an alternative cancelled sound projection and supports the exploration and loudspeaker working principle has been investigated during reflection of aesthetic experience in multi-layered spatial aud- the past decades (see, e.g., [6]). Major advantages are the itory environments. appealing form factor requiring dramatically reduced mounting Beyond the attached context, the principles exploited in depth, and the potential to create diffuse sound fields due to Promenade may be applied to other use cases. In the following, the low directivity even at high frequencies. some findings and possible directions are discussed, such as One important factor for exciting flat panels is the mounting the use of stereophonic signals, the integration with visual location of the exciter. Depending on the application, the presentations, extensions towards interactive applications, and physical properties and the size of the panel, a basic question other form factors, materials, or objects. is whether to exploit or to avoid the direct excitation of pre- A. Crosstalk cancellation of stereophonic signals dominant modes. Often, an out-of-centre location avoiding low integer partition of the surface dimensions is recommended When developing the Promenade sound installation with for a more balanced response. In Promenade, exciters were field recording material and XTC projection, it was found mounted according to the golden section as an experimentally that coincident stereophonic signals involving only level dif- found trade-off of acoustic and visual qualities. ferences but none of phase provide a more consistent and Apart from the construction of high-end loudspeakers, elec- stable spatial image than recordings in spaced omnidirectional trodynamic excitation of manifold objects for sound projec- stereophony that is based on phase differences. There is no tion is frequently applied in custom presentation media and well-researched explanation for this observation. It is likely in sound art (see, e.g., SHHH SHHH SHHH5 by Amanda that both the listening conditions and the projection device are Dunsmore, 2008, or Touched Echo6 by Markus Kison, 2007). not optimal for propagating phase differences to the listener’s As described above, the exploration for Promenade departs ears. In particular the fact that both exciters are working on from the latter. In such realms the presented sound material is the same projection surface may degrade the fidelity of phase usually created or adapted especially for the specific playback reproduction, although a similar observation was made using medium. Consequently, the projection device does not need two separate loudspeakers. to meet the same criteria as general-purpose loudspeakers, Overall, the discovery that intensity stereophony is well especially with respect to linearity and neutrality, because suited to span the extremes of the projection base – from ear such deviations can be corrected in the adaption process. This to ear using headphones, nevertheless crossing the listener’s largely facilitates achieving a beneficial result with relatively head, from speaker to speaker in an ordinary stereophonic low-cost equipment. setup, and along a well externalised, virtual stereo base of A special technical issue of the Promenade device is that about 150 degrees opening angle using XTC projection – only one panel is used for projecting two audio channels at is most valuable for the production of appropriate material. the same time. The crucial question is whether the crosstalk Following from that, field recordings were carried out in cancellation cues are propagated to a sufficient extent for mid/side stereophony, which allows for further benefits with achieving a similar effect as with two small-spaced loudspeak- respect to adaptation and interaction (see subsection V-C). ers or whether the inter-channel crosstalk in the panel degrades B. Visual integration the cues too much. A similar technique has been investigated for the application of Wave Field Synthesis (WFS) using even In the Promenade sound installation, major emphasis was more than two exciters on the same panel, coined Multi- put on listening to mediated auditory environments. Therefore, Actuator Panels [7]. Later, the method has been generalised the metaphor of a canvas was picked up that could have shown for creating multiple sound sources with a single panel [8]. a landscape but instead projected a soundscape while remain- ing visually ‘silent’. However, the surface of the projection 5 http://www.lit.ie/Dunsmore/linz/kunstraumwindows.html medium may be easily used for a seamless integration of visual 6 http://www.markuskison.de/touched_echo.html and auditory material. In the simplest case, the cardboard 27 Promenade: A Minimalistic Sculptural Interface for Immersive Soundscape Presentation material may be printed or laminated. The backside may be Such transitions may be triggered or emphasised by evaluating used as well, restricted by the mounted exciters. the listener’s distance or overall presence via the sensor. Alternatively, the panel may serve as a video projection D. Other form factors, materials, or objects screen, including three-dimensional technology involving po- larised or shutter glasses. The projection angle and distance Theoretically, combining two-channel electrodynamic excit- have to be envisaged such that the spectator is not disturbing ation of solid matter with transaural crosstalk cancellation is the projection but may still position herself appropriately with possible with various objects and materials of any form or respect to crosstalk-cancelled sound projection. extension. For example, architectural prototypes, engineering Using stereophonic signals, auditory events can be evoked models or basic commodity might be enhanced by an integ- for the listener anywhere in the horizontal dimension within rated auditory layer that invites for spatial immersion. the borders of the canvas and beyond. Therefore, horizontal The determining factors remain those that have been con- audio-visual congruence is well achievable, for example, for sulted while conceiving Promenade: annotating visual items in maps or alike. Psychoacoustic • the physical properties of the excited material with re- effects may be exploited for suggesting vertical correspond- spect to sound propagation, ences. Depending on the stiffness of the projection panel, • appropriate mounting locations for the exciters and fitting the location of the exciters on the panel is audible in certain form factors of the object and the exciters, and frequency bands. In this case, a second pair of exciters may • sufficiently isolated radiation of both channels for achiev- serve as a panning means in the vertical dimension, at least to ing the desired effect of crosstalk cancellation. a certain extent. VI. C ONCLUSION Generally, any visual presentation that favours a fixed, frontal and relatively close viewpoint can be expected to work In this paper, the Promenade spatial sound projection device well in combination with a Promenade like sound projection has been presented, along with its originating context, a sound device. installation as part of the arts-based research project of the same name. The device has been presented as combining the two techniques of transaural crosstalk cancellation for inviting C. Interactivity, tracking, sensors a rich spatial auditory experience with the excitation of solid From the beginning, the Promenade device has been con- objects for an unobtrusive appearance, differing from that of ceived with an additional optical distance sensor such that ordinary loudspeakers. The presentation form of a canvas, the playback can react interactively, depending on whether, though not visually occupied in the original installation, invites how close and for how long a listener is present in front for manifold applications that involve audio-visual correspond- of the canvas. For the installation versions involving several ence. Further extensions include the use of distance sensors projection panels, actual use of the sensors has been dismissed for interactive functionality, both with respect to the presented so far. It turned out to be more desirable in the first place to narrative and to the evoked spatial impression. explore the field of sweet spots distributed in space which, ACKNOWLEDGEMENTS when being entered, change the listening experience dramat- ically. This effect may be regarded as an inherent ‘interactive’ The research project Promenade is supported by the Vice behaviour of the projection dispositif, although no technical Rector’s office of research at University of Music and Per- reactivity is involved. forming Arts Graz, the Cultural Office of the City of Graz, Nevertheless, involving the sensor bears several interesting and ESC Media Arts Laboratory, Graz. use cases. If multiple projection panels are present, the sensors R EFERENCES may serve as a simple tracking device in order to activate or [1] E. Y. Choueri, “Optimal crosstalk cancellation for binaural audio with emphasis one or more panels rather than others, both in a two loudspeakers,” 2010. [Online]. Available: https://www.princeton.edu/ decentralised (each panel controls itself without influencing 3D3A/Publications/BACCHPaperV4d.pdf others) or a centralised manner. Furthermore, the spatial ar- [2] B. Xie, Head-Related Transfer Function and Virtual Auditory Display. Plantation: J. Ross, 2013. rangement of sound material could be extended towards an [3] R. Glasgal, “360° localization via 4.x RACE processing,” in 123rd AES enterable, interactive radio play. Different individual listeners Convention. AES, 2007. cannot be distinguished, though, which would require more [4] J. Blauert, Spatial Hearing. Cambridge Mass./London: MIT Press, 1997. [5] R. Glasgal, “Ambiophonics: The synthesis of concert-hall sound fields in sophisticated tracking means. the home,” in 99th AES Convention. AES, 1995. The sensor information may also affect the projection tech- [6] G. Bank and N. Harris, “The distributed mode loudspeaker – theory and nique itself. In the Promenade installation, gradual transitions practice,” in 13th Microphone & Loudspeakers AES Conference. AES, 1998. take place from monaural signals (centered in the projection [7] M. M. Boone, “Multi-actuator panels (MAPs) as loudspeaker arrays for panel) over ordinary stereophony (located roughly in between wave field synthesis,” Journal of the Audio Engineering Society, vol. 52, the exciter positions, thus a partition of the panel) to crosstalk- no. 7/8, pp. 712–723, 2004. [8] M. C. Heilemann, D. Anderson, and M. F. Bocko, “Sound-source loc- cancelled projection. As mid/side-stereophony is used, mod- alization on flat-panel loudspeakers,” Journal of the Audio Engineering ulating the contribution of the side signal allows for further Society, vol. 65, no. 3, pp. 168–177, 2017. controlling the perceived width of the resulting sonic image. 28