Publications-Detail

HD Telephony by Artificial Bandwidth Extension - Quality, Concepts and Complexity -

Authors:
Schlien, T.
Ph. D. Dissertation
 
School:
IND, RWTH Aachen
Adress:
Templergraben 55, 52056 Aachen
Series:
Aachener Beiträge zu Digitalen Nachrichtensystemen (ABDN)
Number:
47
Date:
Jul. 2019
ISBN:
978-3-95886-350-7
Language:
English

Abstract

The audio bandwidth of digital landline and mobile telephone networks is still mostly restricted to 200 Hz to 3.4 kHz. This is due to compatibility requirements during the transition phase from analogue to digital transmission technology. The resulting characteristic "telephone speech" is widely accepted, but the intelligibility of syllables is only 91%. Meanwhile, improved coding standards for so-called "HD voice" or "Wideband Speech" have been developed which are gradually being introduced into the networks. They support an audio frequency bandwidth of 50 Hz to 7.0 kHz with significantly increased audio quality and speech intelligibility. For a very long time however, new HD-telephones and old narrowband telephones have to co-exist. If an HD-terminal is connected over a narrowband link to an old telephone, the improved coding scheme cannat be used.
In this thesis, signal processing concepts are developed for improving audio quality and intelligibility of narrowband speech by artificial bandwidth extension (ABWE). These algorithms can be applied in the HD terminals or in the network, to transform narrowband speech to HD voice. Based on the source-filter model of speech production and a priori knowledge of the characteristics of speech signals, the missing frequency components between 3.4 kHz and 7 kHz are reconstructed. In comparison to the state-of-the-art ABWE approaches, the main contributions are:
- new concepts of estimating the wideband spectral envelope, e.g., in terms of the model filter by interpolation on the acoustic tube domain
- algorithms for spectral extension of the excitation signal
- new insights concerning the relative importance of the excitation, the temporal envelope and the spectral envelope
- remarkable improvements of the audio quality
- significant low computational complexity
- efficient and effective training and estimation algorithms
The improvements are verified by objective evaluation and subjective listening tests.

Download

BibTeX

Copyright © by IKS
schlien19.pdf
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.