Dialnorm = -31 + 10 * log10( (1/N) * Σ x_i² ) where ( x_i ) are the dialogue-gated samples.
Abstract: The proliferation of multi-channel audio formats (5.1, 7.1, Atmos) and object-based audio has created a need for efficient, real-time encoding solutions. The Dolby Encoding Engine (DEE) serves as a software-based, enterprise-grade tool for transcoding Pulse Code Modulation (PCM) audio into Dolby Digital (AC-3), Dolby Digital Plus (E-AC-3), and Dolby Atmos metadata. This paper examines the system architecture of DEE, its psychoacoustic modeling core, bit-rate reduction techniques, and its role in live broadcast, over-the-top (OTT) streaming, and post-production workflows. We compare its latency profile against hardware-based encoders and analyze its compliance with the ATSC 3.0 and DVB standards. 1. Introduction As consumer audio moved from stereo to immersive formats, broadcasters and streaming services required a software-defined method to encode multichannel audio without dedicated DSP chips. Dolby Laboratories introduced the Dolby Encoding Engine (DEE) as a cross-platform (Linux, Windows, macOS) solution that integrates into transcoding pipelines (e.g., FFmpeg, AWS Elemental, Harmonic). dolby encoding engine
[5] PEAQ – ITU-R BS.1387-1. (2001). Method for objective measurements of perceived audio quality . Dolby Encoding Engine, AC-3, E-AC-3, Dolby Atmos, audio compression, broadcast, low-latency encoding, dialnorm. Dialnorm = -31 + 10 * log10( (1/N)
[4] ITU-R BS.1770-4. (2015). Algorithms to measure audio programme loudness and true-peak audio level . International Telecommunication Union. This paper examines the system architecture of DEE,