Publications-Detail
HD Telephony by Artificial Bandwidth Extension - Quality, Concepts and Complexity -
- Authors:
- Schlien, T.
- Ph. D. Dissertation
- School:
- IND, RWTH Aachen
- Adress:
- Templergraben 55, 52056 Aachen
- Series:
- Aachener Beiträge zu Digitalen Nachrichtensystemen (ABDN)
- Number:
- 47
- Date:
- Jul. 2019
- ISBN:
- 978-3-95886-350-7
- Language:
- English
Abstract
The audio bandwidth of digital landline and mobile telephone networks is still mostly restricted to 200 Hz to 3.4 kHz. This is due to compatibility requirements during the transition phase from analogue to digital transmission technology. The resulting characteristic "telephone speech" is widely accepted, but the intelligibility of syllables is only 91%. Meanwhile, improved coding standards for so-called "HD voice" or "Wideband Speech" have been developed which are gradually being introduced into the networks. They support an audio frequency bandwidth of 50 Hz to 7.0 kHz with significantly increased audio quality and speech intelligibility. For a very long time however, new HD-telephones and old narrowband telephones have to co-exist. If an HD-terminal is connected over a narrowband link to an old telephone, the improved coding scheme cannat be used.In this thesis, signal processing concepts are developed for improving audio quality and intelligibility of narrowband speech by artificial bandwidth extension (ABWE). These algorithms can be applied in the HD terminals or in the network, to transform narrowband speech to HD voice. Based on the source-filter model of speech production and a priori knowledge of the characteristics of speech signals, the missing frequency components between 3.4 kHz and 7 kHz are reconstructed. In comparison to the state-of-the-art ABWE approaches, the main contributions are:
- new concepts of estimating the wideband spectral envelope, e.g., in terms of the model filter by interpolation on the acoustic tube domain
- algorithms for spectral extension of the excitation signal
- new insights concerning the relative importance of the excitation, the temporal envelope and the spectral envelope
- remarkable improvements of the audio quality
- significant low computational complexity
- efficient and effective training and estimation algorithms
The improvements are verified by objective evaluation and subjective listening tests.
Download
BibTeX- Copyright © by IKS
schlien19.pdf - This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.