Publications-Detail

Multi Channel Audio Processing: Enhancement, Compression and Evaluation of Quality

Authors:
Schäfer, M.
Editors:
Vary, P.
Ph. D. Dissertation
 
School:
IND, RWTH Aachen
Adress:
Templergraben 55, 52056 Aachen
Series:
Aachener Beiträge zu Digitalen Nachrichtensystemen (ABDN)
Number:
39
Date:
2014
ISBN:
978-3-95886-007-0
Language:
English

Abstract

The demand for high definition audio and video services is rapidly increasing. Two representative examples for this are audio-visual conferencing or video storage and delivery. In this context, efficient techniques are required for enhancement and compression of multi channel audio signals with compatibility to, e.g., mono or stereo systems.
In this thesis, novel signal processing algorithms for both enhancement and compression of multi channel signals are developed and theoretical performance bounds are derived. Additionally, a novel instrumental quality measure for the evaluation of multi channel signal processing algorithms is proposed.
Enhancement schemes for both the recording and the reproduction side are introduced. This includes the optimization of a near field filter-and-sum beamformer to achieve a target directivity characteristic at the recording side. For the reproduction side, an efficient postfilter is presented which increases the speech intelligibility by taking the positive influence of early room reflections into account.
The main part of this thesis covers multi channel predictive compression of audio signals. A predictive multi channel coding system is presented and analyzed. Performance bounds are derived and two methods for an adaptive bit rate distribution between inter channel and intra channel prediction are devised. Novel multi channel noise shaping concepts are introduced. The performance of the compression system is quantified by instrumental measures.
A novel instrumental measure is introduced for the evaluation of multi channel signal enhancement and compression. It combines the proven single channel quality measure PEAQ with a binaural auditory model and a mathematical model of cognitive behavior, providing a reliable evaluation of quality perception and spatial fidelity. The inclusion of spatial information into the instrumental quality measurement leads to a consistently high correlation between the instrumental measure and a listening test.

Download

BibTeX

Copyright © by IKS
schaefer14.pdf
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.