Scene-Based Audio Implemented with Higher Order Ambisonics (HOA)

Nils Peters, Deep Sen, Moo-Young Kim, Oliver Wuebbolt, S. Merrill Weiss

Scene-based Audio uses a sound-field technology called “Higher Order Ambisonics” (HOA) to create holistic descriptions of both live-captured and artistically-created sound scenes that are independent of specific loudspeaker layouts. For efficient representation, the audio can be carried as a set of PCM channels that contain predominant sounds and ambience in separate tracks. Standard audio bandwidth-compression techniques then can be applied to the PCM channels. This approach is in contrast to conventional channel-based sound representations, in which one signal is used for each loudspeaker of a target reproduction system, with the implication that upmixing or downmixing is required when loudspeaker configurations other than the intended one are used for actual reproduction. This paper examines how, with Scene-based Audio, there can be satisfactory reproduction of immersive sound at bitrates corresponding to the equivalent of just 6 channels, while an alternative sound-field method that exclusively employs audio objects typically involves much higher bitrates.

Published: 2015-10
Content type: Original Research
Keywords: Scene-based Audio, HOA, Spatial Audio, Next-Generation Audio, MPEG-H, ATSC 3.0
DOI: 10.5594/M001651
ISBN: 978-1-61482-956-0