A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids

Tran, Linh Thi Thuc; Nordholm, Sven Erik

doi:10.3390/audiolres11030037

Open AccessArticle

A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids

by

Linh Thi Thuc Tran

^1,*,†

and

Sven Erik Nordholm

^2,†

¹

Department of Electrical and Electronics Engineering, Posts and Telecommunications Institute of Technology, Hanoi 12110, Vietnam

²

Faculty of Science and Engineering, Curtin University, Perth 6102, Australia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Audiol. Res. 2021, 11(3), 389-409; https://doi.org/10.3390/audiolres11030037

Submission received: 31 March 2021 / Revised: 9 July 2021 / Accepted: 5 August 2021 / Published: 9 August 2021

Download

Browse Figures

Versions Notes

Abstract

:

Acoustic coupling between microphone and loudspeaker is a significant problem in open-fit digital hearing aids. An open-fit compared to a close-fit hearing aid significantly lowers the signal quality and limits the achievable maximum stable gain. Adaptive feedback cancellation (AFC) enables an efficient approach to reduce the impact of acoustic coupling. However, without careful consideration, it can also introduce bias in estimating the feedback path due to the high correlation between the loudspeaker signal and the incoming signal, especially when the incoming signal is spectrally coloured, e.g., speech and music. The prediction error method (PEM) is well known for reducing this bias. The presented study aims to propose a switched PEM with soft-clipping (swPEMSC) that allows for further improvement in convergence/tracking rates, resulting in a better ability to recover from unstable/howling status. This swPEMSC employs a new update rule inspired by a soft-clipping based stability detector (SCSD). It allows to pick up either the PEMSC-NLMS or PEMSC-APA depending on the magnitude of the effective feedback signal; howling corresponds to a large feedback signal. The PEMSC-NLMS with a small step-size ensures a low steady-state error, but slow convergence/tracking rates, while PEMSC-APA with a large step-size allows for fast convergence/tracking rates, but a high steady-state error. By combining those approaches, the proposed approach can take advantage of good characteristics from both. Experimental results using different types of incoming signals and an abrupt change of feedback paths show that the swPEMSC can shorten unstable periods (howling) by improving the convergence and tracking rates while retaining a low steady-state error and good signal quality.

Keywords:

Index Terms—adaptive feedback cancellation; prediction error method; NLMS; APA; soft-clipping based stability detector

1. Introduction

Acoustic feedback occurs due to the acoustic coupling of the loudspeaker signal into the microphone(s). It is a significant problem in public address (PA) systems and open-fit hearing aids (HAs). With the presence of the forward path, the feedback signal is amplified before looping back into the loudspeaker forming a closed-loop system. This feedback signal not only significantly degrades signal quality, but also limits the achievable amplification of those systems. In some particular conditions, it drives the system into an unstable status and howling may occur. The acoustic feedback becomes a more challenging problem for hearing aid applications due to the high demand for small open-fit hearing aids. Many acoustic feedback cancellation methods have been introduced over the last sixty years [1,2,3]. Among them, adaptive feedback cancellation (AFC) has the prominence to reduce the adverse effect of acoustic feedback. This method estimates the acoustic feedback path by using an FIR filter, enabling an estimate of the feedback signal which now can be cancelled from the microphone signal (cf. Figure 1). Due to the closed-loop nature of the HA, the feedback path estimation can produce bias caused by the high correlation between loudspeaker signal and incoming signal [1,2,4].

To reduce this bias, multiple decorrelation methods have been investigated in literature, e.g., delay insertion [1,5], probe noise insertion [6,7,8,9], frequency shifting [10,11], phase modulation [12], and pre-whitening filters [13,14]. Among those methods, prediction error method based adaptive feedback cancellation (PEM-AFC) is well established as it can be effectively applied in both the time domain [14,15,16,17,18,19] and the frequency domain [2,20,21,22,23,24]. In this method, the input signals of an adaptive filter were pre-whitened by using pre-filters, resulting in lower correlation and so bias. Other methods employing sub-band techniques [25,26,27,28], multiple microphones [19,29,30,31,32,33,34], fast-converging adaptive filtering algorithms [14,15,17,32,35,36,37], affine combination of filters [23], and variable step-size (VSS) [11,38,39,40] or combinations of those techniques [31,41,42] for AFC also yielded performance improvement. In [43], an AFC approach based on decomposing a long adaptive filter into a Kronecker Product of two shorter filters has been proposed. Although the above AFC approaches can improve the system performance to a certain degree, the demand for a reliable AFC approach still increases.

In order to further improve the convergence and tracking rates of the previous AFC methods, a hybrid AFC using normalised least mean square algorithm (H-NLMS) has been introduced recently in [44]. The main idea of the H-NLMS was to develop a soft-clipping based stability detector (SCSD) to control the update of the adaptive filter such that the PEMSC-NLMS was chosen during stable periods, otherwise the NLMS algorithm without pre-filters was chosen. This leverages the fact that the NLMS algorithm allows the system to quickly recover from unstable conditions, while the PEMSC-NLMS may provide a lower bias in the estimate of the feedback path.

The purpose of this study is to further improve convergence/tracking rates of the state-of-the-art H-NLMS such that the system can faster recover from unstable/howling status. Inspired by the SCSD, we propose a switched PEM with soft-clipping for AFC in HAs. In contrast to the H-NLMS, first we employ pre-filters to both AFC methods using either the NLMS or the affine projection algorithm (APA) in order to reduce the biased estimate. Then, the SCSD is applied to produce a new update rule for the proposed swPEMSC in such a way that either the PEMSC-NLMS or PEMSC-APA is selected depending on the system converged or unstable, respectively. This is based on an observation that the PEMSC-NLMS with a small step-size provides a low steady-state error, but slow convergence/tracking rates, while the PEMSC-APA with a large step-size obtains much faster convergence/tracking rates, but a high steady-state error. Moreover, PEMSC-APA also faces the trade-off between a low steady-state error and fast convergence/tracking rates when its projection order increases.

Experimental results demonstrate that the proposed method can address those trade-off problems. Specifically, the proposed swPEMSC not only further improves the convergence/tracking rates of the state-of-the-art H-NLMS approach but also achieves a lower steady-state error. It also outperforms other AFC baselines such as PEMSC-NLMS and PEMSC-APA. Furthermore, the speech quality of the proposed swPEMSC is comparable to that of the H-NLMS and much better than the PEMSC-NLMS and PEMSC-APA. Both swPEMSC and H-NLMS obtain very short howling periods occurring at initialisation and at the sudden change of feedback path compared to the PEMSC-NLMS and PEMSC-APA.

Throughout this paper,

E \{.\}

and superscript T denote expectation and transposition operations, respectively. We use lower and upper letters in bold to represent vectors and matrices, respectively.

The paper is organised as follows. Section 2 describes the structure of a hearing aid and its acoustic feedback problem. Section 3 and Section 4 review the standard AFC method and the PEM-AFC, respectively. The proposed AFC method is presented in Section 5, while the computational complexity of all mentioned methods is compared in Section 6. Experimental results are evaluated in Section 7. Section 8 presents a discussion. Finally, Section 9 concludes the paper.

2. Hearing Aids and Acoustic Feedback Problem

A hearing aid is an electronic device that allows for assisting hearing impaired people to restore their hearing capability. A simple HA consists of a loudspeaker (receiver), a microphone, an amplifier, and a battery. The microphone of a HA picks up sound waves from the ambient environment. These picked up sound waves are converted into electrical signals (so-called incoming signals) which are then delayed and amplified through a forward path (

K (q)

) before reaching HA’s loudspeaker. Finally, the loudspeaker converts the received electrical signals back into sound waves which are fed to the human ear canal. All electrical elements of a HA are powered by a small battery. Due to a presence of a coupling signal (called feedback signal) from loudspeaker to microphone HA suffers from acoustic feedback problem.

Figure 2 illustrates the feedback problem in a HA. The microphone picks up not only the incoming signal but also the acoustic feedback signal, i.e.,

m (k) = x (k) + v (k),

(1)

where

m (k),

x (k)

and

v (k) = F (q) u (k) = f^{T} u (k)

denote microphone, incoming and feedback signals, respectively, with

F (q)

the polynomial transfer function in q of the true feedback path.

F (q) = f^{T} q,

where

f = {[f_{0}, f_{1}, \dots, f_{L_{f} - 1}]}^{T}

is a

L_{f}

-dimensional vector denoting impulse response (IR) of the true feedback path and

q = {[\begin{matrix} 1 & q^{- 1} & \dots & q^{- L_{f} + 1} \end{matrix}]}^{T}

with

q^{- 1}

the discrete-time delay operator. The microphone signal then loops back to the loudspeaker after being processed by a forward path (so-called signal processing path) making HA a closed-loop system. The loudspeaker signal can be expressed as

u (k) = K (q) m (k) .

(2)

In this paper, we assume that

K (q) = |K| q^{- d_{k}}

, where

|K|

and

d_{k}

represent gain and delay in the forward path, respectively. This delay must be at least one sample, i.e.,

d_{k} ⩾ 1

. By substituting (1) into (2) we obtain the transfer function of a closed-loop system from the incoming signal to the loudspeaker signal as follows:

S (q) = \frac{K (q)}{1 - K (q) F (q)} .

(3)

With the presence of the forward path, the acoustic feedback signal is amplified and looped back into the loudspeaker again and again, which may render the system into unstable conditions. Acoustic feedback is the main problem in HA since it significantly degrades the achievable maximum amplification and output signal quality. The stability of an LTI closed-loop system is based on the Nyquist stability criterion which is stated that a closed-loop system is unstable if conditions for loop gain and loop phase in (4) are met simultaneously.

\{\begin{matrix} \begin{matrix} |K (e^{j ω}) F (e^{j ω})| \geq 1 \\ ∠ K (e^{j ω}) F (e^{j ω}) = 2 π n, & n \in Z \end{matrix}, \end{matrix}

(4)

where

K (e^{j ω})

and

F (e^{j ω})

are the frequency responses of the forward path and the acoustic feedback path, respectively, and

ω \in [0, 2 π]

is the angular frequency. The Nyquist stability criterion in (4) is essential for acoustic feedback control as acoustic feedback control methods effectively try to avoid either one or both of these conditions to be met [3]. Note that an unstable system will result in an unbounded output but due to a natural limiting circuit in the amplifier and loudspeaker it may result in howling.

3. Standard AFC Approach

For the sake of simplicity, we assume that incoming signals are stationary and that all AFC systems are discrete and linear time-invariant (LTI). Figure 1 depicts a block diagram of a standard adaptive feedback cancellation system for hearing aids using a single-microphone single-loudspeaker (SMSL). The main idea is to adopt an FIR adaptive filter (

\hat{F} (q)

) to estimate the true feedback path (

F (q)

), then the estimated feedback path is utilised to compute the feedback signal estimate,

\hat{v} (k)

. The error signal,

e (k)

, is computed by subtracting

\hat{v} (k)

from the microphone signal,

m (k)

. This error signal goes through the forward path (

K (q)

) where it is delayed and amplified before it feeds into the HA’s loudspeaker. The error signal utilised for an adaptive estimate of the feedback path is computed as follows:

e (k) = m (k) - \hat{v} (k),

(5)

where

\hat{v} (k) = {\hat{f}}^{T} u (k)

is a

L_{\hat{f}}

-dimensional vector, and the vector

\hat{f} = {[{\hat{f}}_{0}, {\hat{f}}_{1}, \dots, {\hat{f}}_{L_{\hat{f}} - 1}]}^{T}

denotes the estimated feedback path of length

L_{\hat{f}}

. The

L_{\hat{f}}

-dimensional vector

u (k)

is defined as

u (k) = {[u (k), u (k - 1), \dots, u (k - L_{\hat{f}} + 1)]}^{T}

. In HAs, the loudspeaker signal

u (k)

is formed by processing the error signal

e (k)

using the forward path

K (q)

, i.e.,

u (k) = K (q) e (k) .

(6)

The transfer function of the closed-loop AFC system is defined as

\bar{S} (q) = \frac{K (q)}{1 - K (q) [F (q) - \hat{F} (q)]} .

(7)

The closed-loop system

\bar{S} (q)

is stable if the condition

|K (e^{j ω}) [F (e^{j ω}) - \hat{F} (e^{j ω})]| < 1

is fulfilled.

\hat{F} (e^{j ω})

is the frequency response of the estimated feedback path.

By minimising the cost function

J (\hat{f}) = E \{e^{2} (k)\}

with respect to

\hat{f}

, we obtain an optimal solution as

{\hat{f}}_{0} = E {\{u (k) u^{T} (k)\}}^{- 1} E \{u (k) m (k)\} .

(8)

Substituting (1) into (8) yields [1]

{\hat{f}}_{0} = f + \underset{b i a s}{\underset{︸}{R_{u}^{- 1} r_{u x}}} .

(9)

It can be observed that there is a bias in the estimate of feedback path in (9) due to the correlation between the loudspeaker and incoming signals. Thus, the incoming signal acts as a disturbance to the feedback canceller [2]. We recursively approximate the vector

{\hat{f}}_{0}

using the NLMS algorithm as follows:

\hat{f} (k) = \hat{f} (k - 1) + \frac{μ}{u^{T} (k) u (k) + δ_{N L M S}} u (k) e (k),

(10)

where

μ

is a fixed step-size and

δ_{N L M S}

is a small positive value added to avoid division by zero.

In the ideal case where the acoustic feedback path is perfectly estimated, i.e.,

\hat{F} (q) = F (q)

, we obtain the loudspeaker signal as a delayed and amplified version of the incoming signal.

4. PEM-AFC

To address the bias in the feedback path estimate, prediction error method based adaptive feedback cancellation (PEM-AFC) has been proposed in [14] and widely used in both time-domain and frequency-domain AFC approaches. Figure 3 depicts the PEM-AFC model for a SMSL hearing aid [2,14,20]. In the PEM-AFC, first pre-filters are employed to pre-whiten the inputs of the adaptive filter, then the adaptive filter coefficients are recursively updated using pre-whitened signals. In this method, the incoming signal is assumed to be modelled by an autoregressive (AR) process, i.e.,

x (k) = G^{- 1} (q) w (k),

(11)

where

w (k)

denotes white Gaussian noise and

G^{- 1} (q)

is a monic and inversely stable all-pole filter. The loudspeaker and microphone signals are pre-whitened by using

\hat{G} (q)

. The

\hat{G} (q)

is the estimated version of

G (q)

.

m_{p} (k) = \hat{G} (q) m (k),

(12)

u_{p} (k) = \hat{G} (q) u (k),

(13)

x_{p} (k) = \hat{G} (q) x (k),

(14)

where

m_{p} (k)

,

u_{p} (k)

, and

x_{p} (k)

represent pre-whitened microphone, pre-whitened loudspeaker, and pre-whitened incoming signals, respectively.

The prediction error signal

e_{p} (k)

is defined as

e_{p} (k) = m_{p} (k) - {\hat{f}}^{T} u_{p} (k),

(15)

where

u_{p} (k) = {[u_{p} (k), u_{p} (k - 1), \dots, u_{p} (k - L_{\hat{f}} + 1)]}^{T}

is a

L_{\hat{f}}

-dimensional vector. The IR of the true feedback path in the PEM-AFC can be estimated by minimising the mean square prediction error,

E \{e_{p}^{2} (k)\}

,

\begin{matrix} min_{\hat{f}} E \{e_{p}^{2} (k)\} & = min_{\hat{f}} E \{{|m_{p} (k) - {\hat{f}}^{T} u_{p} (k)|}^{2}\} . \end{matrix}

(16)

The optimal solution for (16) can be written as follows:

{\hat{f}}_{0} = E {\{u_{p} (k) u_{p}^{T} (k)\}}^{- 1} E \{u_{p} (k) m_{p} (k)\},

(17)

{\hat{f}}_{0} = f + \underset{b i a s}{\underset{︸}{R_{u_{p}}^{- 1} r_{u_{p} x_{p}}}} .

(18)

By substituting (11) and (14) into (18), an unbiased estimate of the feedback path can be obtained if the assumption (11) is satisfied,

\hat{G} (q) = G (q)

, and at least one delay in the forward path is available. Furthermore, both the feedback path

F (q)

and the AR model

\hat{G} (q)

can be identified in closed-loop without adding a probe signal or nonlinearities if the delay in the forward path is not smaller than the length of AR model

\hat{G} (q)

[2,14].

The optimal coefficients

{\hat{f}}_{0}

of the PEM-AFC can be recursively approximated using the NLMS algorithm as

\hat{f} (k) = \hat{f} (k - 1) + \frac{μ}{u_{p}^{T} (k) u_{p} (k) + δ_{N L M S}} u_{p} (k) e_{p} (k) .

(19)

5. Proposed Method

In this subsection, the structure of the switched prediction error method with soft clipping (swPEMSC) is proposed for adaptive feedback cancellation in hearing aids. Figure 4 illustrates the swPEMSC model. This model is similar to the model of the hybrid NLMS adaptive feedback cancellation algorithm (H-NLMS) using a soft-clipping-based stability detector (SCSD) [44]. The main difference is in the way to update the adaptive filter

\hat{F} (q)

. In [44], a SCSD is adopted to control the update of the adaptive filter in such a way that the NLMS algorithm is selected when the system is or close to unstable conditions, and the PEMSC-NLMS algorithm is selected when it has converged. This is based on the idea that the NLMS algorithm provides a quick recovery from howling and the PEMSC-NLMS algorithm enables lower misalignment leading to a closer estimate of the true channel. When the feedback is strong,

\hat{v} (k)

and

v (k)

are not close, and thus the feedback contribution will dominate over the incoming signal, i.e.,

|v (k) - \hat{v} (k)| ≫ |x (k)| .

Inspired by bounded loudspeaker signal

u (k)

, in practice, howling will be produced in this case. By choosing suitable parameters, the H-NLMS can significantly shorten the howling periods and improve the signal quality [44].

To further improve the convergence/tracking rates as well as steady-state error, while retaining good output sound quality, we propose a new rule to update the feedback path estimate. We observe that standard adaptive algorithms such as LMS, NLMS, and APA suffer from a trade-off between fast convergence/tracking rates and low steady-state error. Particularly, such algorithms yield fast convergence/tracking rates, but high steady-state error when their step-size is large, and vice versa. Figure 5 illustrates the trade-off for AFC using PEMSC-NLMS with different step-size values. In addition, the affine projection algorithm (APA) also exposes to this trade-off with respect to (w.r.t) its projection order (P), i.e., the APA provides a fast convergence/tracking, but high steady-state error when P is large and vice versa [45,46]. Figure 6 shows that the PEMSC-APA obtains a higher convergence/tracking rate than the PEMSC-NLMS. Moreover, with a fixed step-size the PEMSC-APA converges faster but yields a higher steady-state error when the projection order increases to a certain level, for example, from

P = 2

to

P = 6

in the experiment. With

P = 8

, there is almost no improvement in the convergence of PEMSC-APA, but the worse steady-state error is obtained compared to the same experiment with

P = 6

. Therefore, we select

P = 6

for PEMSC-APA in Experiments 2 and 3.

The proposed method, swPEMSC, is developed to address the above trade-off problems. It is inspired by the SCSD in [44]. In contrast to the authors of [44], we propose to employ SCSD to control a switch between the prediction error method using soft clipping and NLMS (PEMSC-NLMS) and the PEMSC using APA (PEMSC-APA). Specifically, we utilise pre-filters to pre-whiten the inputs of the adaptive filter

\hat{F} (q)

leading to lower bias in the estimate of filter coefficients. Then, we apply SCSD to design a switch that allows the adaptive filter to pick up the PEMSC-NLMS algorithm with a small step-size when the system has converged and the PEMSC-APA with a large step-size and projection order when the system is or close to unstable status. As a result, the proposed swPEMSC can achieve a low steady-state misalignment during stable periods and fast convergence/tracking rates to quickly recover from howling during unstable periods. It also solves the mentioned compromise of the APA w.r.t. its projection order, i.e., when P increases to a certain level, the swPEMSC achieves faster convergence or tracking rate, while still retaining a low steady-state error.

Simulation results show that the proposed swPEMSC outperforms either PEMSC-NLMS or PEMSC-APA for adaptive feedback cancellation in HAs in terms of normalised misalignment (MIS), added stable gain (ASG) and perceptual evaluation of speech quality (PESQ). It also further improves convergence/tracking ability compared to the H-NLMS [44] with a price of higher computational complexity. Furthermore, the proposed method yields a comparable or better output signal quality (PESQ) compared to all mentioned baselines. In the following, formulations of the swPEMSC will be described in detail. Definitions of the microphone, feedback and error signals are similar to those in (1) and (5), i.e.,

m (k) = x (k) + v (k),

(20)

\begin{matrix} e (k) & = m (k) - \hat{v} (k) \\ = x (k) + [v (k) - \hat{v} (k)] . \end{matrix}

(21)

The pre-whitened microphone, loudspeaker, and error signals are also defined similar to those in (12), (13), and (15), i.e.,

m_{p} (k) = \hat{G} (q) m (k),

(22)

u_{p} (k) = \hat{G} (q) u (k),

(23)

e_{p} (k) = m_{p} (k) - {\hat{f}}^{T} u_{p} (k),

(24)

where

u_{p} (k) = {[u_{p} (k), u_{p} (k - 1), \dots, u_{p} (k - L_{\hat{f}} + 1)]}^{T}

.

In the proposed method, a soft-clipping (SC) is applied to the error signal yielding the soft-clipping error signal,

e_{S C} (k) = α tanh (\frac{e (k)}{α}),

(25)

where

α

is a scaling parameter. We select

α

such that the most likely range of the incoming signal lies in the linear range of the tanh-function, i.e.,

x (k) \approx α tanh (\frac{x (k)}{α})

, thus

e (k) - e_{S C} (k)

may be utilised to detect instability of the AFC. This SC allows for a controlled nonlinearity on the error signal. In this way, the nonlinearity is known and the AFC can be kept linear. As a result, the feedback cancellation performance is improved. We adopt a SCSD to produce a control signal,

λ (k)

, as

λ (k) = Γ \{|e_{S C} (k) - e (k)| < γ\},

(26)

where

γ

is a decision threshold determining the sensitivity of the detector and

Γ

is a binary function returning 1 if the inequality holds and 0 if not. The proposed AFC method integrates a SCSD into the update rule of the adaptive filter

\hat{F} (q)

such that the PEMSC-NLMS is selected when the system is converged, and the PEMSC-APA is selected when the system is unstable or close to unstable. We set a small step-size value for the PEMSC-NLMS, and a large step-size value for PEMSC-APA aiming at taking advantage of a low steady-state error of the PEMSC-NLMS with a small step-size and fast convergence/tracking rates of the PEMSC-APA with a large step-size. The proposed update rule is defined as follows:

\begin{matrix} \hat{f} (k) & = \hat{f} (k - 1) + \frac{μ_{1} λ (k) u_{p} (k) e_{p} (k)}{u_{p}^{T} (k) u_{p} (k) + δ_{N L M S}} \\ + μ_{2} [1 - λ (k)] U_{p} (k) {[U_{p}^{T} (k) U_{p} (k) + δ_{A P A} I_{P}]}^{- 1} e_{p} (k), \end{matrix}

(27)

where

I_{P}

denotes a

P x P

identity matrix,

μ_{1}

and

μ_{2}

are fixed step-sizes (

μ_{2} ≫ μ_{1}

),

δ_{N L M S}

and

δ_{A P A}

denote regularisation parameters of the NLMS and APA, respectively, and

U_{p} (k)

is a

L_{\hat{f}} x P

matrix representing P recent loudspeaker signal vectors of length

L_{\hat{f}}

after being pre-whitened by the pre-filter

\hat{G} (q)

,

U_{p} (k) = [u_{p, 0} (k) u_{p, 1} (k) \dots u_{p, P - 1} (k)] .

The loudspeaker signal can be computed as

u (k) = K (q) e_{S C} (k) .

(28)

Besides, the soft-clipping error signal is used to estimate the pre-filter coefficients via Levinson–Durbin algorithm.

6. Computational Complexity

In this section, we compare the computational complexity of four considered AFC approaches. The computational complexity for estimating the linear predictor coefficients (LPC) using the autocorrelation matrix and the Levinson–Durbin algorithm is

\frac{5 N^{2} + 2 L N + N}{2 L}

multiplications, where N is the AR-model order and L is the frame length. Additionally, each pre-whitened signal is computed using N multiplications and soft-clipping needs 2 multiplications. Thus the PEMSC requires

M = \frac{5 N^{2} + 2 L N + N}{2 L} + 2 N + 2

multiplications per output sample. For estimating the adaptive filter coefficients using NLMS and APA we need

3 L_{\hat{f}} + 2

and

(P^{2} + 2 P) L_{\hat{f}} + P^{3} + P

multiplications, respectively [47], where

L_{\hat{f}}

denotes the adaptive filter order and P is the projection order.

Table 1 summarises the number of real multiplications per output sample [48] for each AFC approach, where we assume that a real multiplication and a real division have equal complexity. It can be seen that the PEMSC-NLMS has the lowest complexity. The computational complexity of the H-NLMS is slightly higher than that of the PEMSC-NLMS. The AFC approaches using the APA like PEMSC-APA and swPEMSC yield higher computational complexity than the approaches using only the NLMS due to a large value of P. However, the proposed swPEMSC achieves significant improvement on convergence/tracking rates as well as steady-state error compared to other mentioned AFC approaches. It also provides much higher perceptual speech quality (PESQ score) than the PEMSC-NLMS and PEM-APA, and a comparable PESQ score compared to the H-NLMS for both feedback paths.

7. Experimental Results

We use measured feedback paths, so-called free-field (

F_{1}

) and telephone-near (

F_{2}

), corresponding to the case without obstacle between loudspeaker and microphone and the case a telephone placed very close to the ear, respectively [49] for Experiments 1–3. Figure 7 depicts the amplitude and phase responses of these measured feedback paths. It can be observed that the

F_{2}

feedback path has a higher amplitude response than the

F_{1}

feedback path due to the effect of the obstacle.

To evaluate the performance of AFC approaches, we use two common metrics: normalised misalignment (MIS) and added stable gain (ASG). The normalised misalignment [20] and added stable gain [20,50] are defined in (29) and (30), respectively.

{MIS}_{i} = 10 log_{10} (\frac{\int_{0}^{π} {|F_{i} (e^{j ω}) - e^{- j ω d_{f b}} {\hat{F}}_{i} (e^{j ω})|}^{2} d ω}{\int_{0}^{π} {|F_{i} (e^{j ω})|}^{2} d ω}),

(29)

\begin{matrix} {ASG}_{i} & = & 10 log_{10} (min_{ω} \frac{1}{{|F_{i} (e^{j ω}) - e^{- j ω d_{f b}} \hat{F_{i}} (e^{j ω})|}^{2}}) \\ - 10 log_{10} (min_{ω} \frac{1}{{|F_{i} (e^{j ω})|}^{2}}), \end{matrix}

(30)

where i is the feedback path index (

i = 1, 2

),

F_{i} (e^{j ω})

and

{\hat{F}}_{i} (e^{j ω})

denote frequency responses of the ith true and the ith estimated feedback paths at the normalised angular frequency

ω

respectively and

d_{f b}

denotes a delay in the feedback canceller’s path. The lower value of MIS and higher value of ASG indicate the better AFC approach.

In addition, perceptual evaluation of speech quality (PESQ) [51] is utilised to evaluate the quality of the speech signal. The PESQ ranges from −0.5 to 4.5, where the value of −0.5 indicates poor speech quality and the value of 4.5 indicates the highest speech quality. For the PESQ measures, the incoming signal

x (k)

and the loudspeaker signal

u (k)

are chosen as the reference and test signals, respectively. We evaluate the convergence/tracking rates of AFC methods based on the necessary time (namely,

τ_{κ_{i}}

) for each method to reach a certain level of misalignment (namely

κ_{i}

) corresponding to the feedback path

F_{i}

.

The following parameters are selected for all simulations: forward path delay

d_{k} = 96

samples, delay of the feedback canceller’s path

d_{f b} = 1

sample, and regularisation parameters

δ_{N L M S} = δ_{A P A} = 10^{- 6}

. Lengths of the true and estimated feedback paths are

L_{f} = 100

and

L_{\hat{f}} = 64

, respectively. For the pre-filter estimate,

\hat{G} (q)

, a 20-order AR model of the incoming signal is computed for every frame of 160 samples by using the Levinson–Durbin algorithm [52]. For experiments 1–3, the forward path gain

|K| = 30

dB and sampling frequency

f_{s} = 16

kHz are chosen.

We compare the proposed AFC approach to state-of-the-art baselines such as PEMSC-NLMS, PEMSC-APA (with different projection order), and H-NLMS [44] using different types of incoming signals, e.g., speech and music signals. Figure 8 shows the recorded speech and music signals used as incoming signals in experiments 1–3. The length of these signals is truncated to 60 s. To evaluate the tracking ability of AFC approaches, a sudden change of the feedback path from (

F_{1}

) to (

F_{2}

) is employed after half of the simulation time. The following step-sizes are selected for AFC approaches to ensure that each AFC approach achieves its best performance.

For PEMSC-NLMS: $μ = 0.001$ for Experiments 1–3 and $μ = 0.002$ for Experiment 4.
For PEMSC-APA: $μ = 0.0008$ .
For H-NLMS and swPEMSC: $μ_{1} = 0.0008$ , $μ_{2} = 0.8$ .

Experiment 1: This experiment aims at finding a suitable value of projection order such that the proposed method achieves a good performance as well as an acceptable complexity. Figure 9 demonstrates the performance of the proposed approach with different projection order

P \in [2, 4, 6, 8]

in term of MIS and ASG. It can be seen that the convergence rate of the swPEMSC is quite similar for

P = 2, 4, 8

. The tracking rate is improved with an increase of P from 2 to 4 or 6. Further increasing P degrades the system performance. For example, the tracking rate with

P = 8

is lowered than that with

P = 4

or 6. In the experiment, we found that

P = 6

is the best choice as it allows the swPEMSC to achieve the highest convergence/tracking rate while maintaining a similar steady-state misalignment. Similar observations are also reported when using music as the incoming signal, cf. Figure 10. Therefore, we select

P = 6

for the proposed swPEMSC.

Experiment 2: This experiment aims at evaluating the performance of the proposed swPEMSC for recorded speech [30] as the incoming signal. The feedback paths depicted in Figure 7 are selected. We suddenly change the feedback path from

F_{1}

to

F_{2}

after 30 s. The performance of the proposed swPEMSC is evaluated in terms of normalised misalignment (MIS), added stable gain (ASG), signal quality (PESQ) and the necessary time (

τ_{κ_{i}}

).

Figure 11 compares the performance of the proposed swPEMSC with state-of-the-art baselines such as PEMSC-NLMS, PEMSC-APA (with

P = 6

) and H-NLMS. It can be seen that the PEMSC-APA converges quicker than the PEMSC-NLMS. It also tracks a sudden change of the feedback path from free-field (

F_{1}

) to telephone-near (

F_{2}

) quicker than the PEMSC-NLMS. However, a higher steady-state error with larger variations can be observed from MIS and ASG of the PEMSC-APA. The H-NLMS yields faster initial convergence and tracking rates than the PEMSC-APA while providing a lower steady-state error compared to both PEMSC-NLMS and PEMSC-APA. The proposed method outperforms all mentioned AFC methods. Particularly, it achieves the fastest convergence/tracking rates and the lowest steady-state error. It also provides the highest ASG, especially during the periods in which the system has converged, for example, the periods between 15 s and 30 s corresponding to the feedback path

F_{1}

and between 45 s and 60 s corresponding to the feedback path

F_{2}

.

Table 2 summarises the comparison of the swPEMSC with baselines in terms of the necessary time in seconds (

τ_{κ_{i}}

) for each AFC approach to reach

κ_{i}

dB level of misalignment, average normalised misalignment in dB, and average added stable gain in dB corresponding to the ith feedback path (i.e.,

τ_{κ_{i}}

,

{\bar{M I S}}_{i}

,

{\bar{A S G}}_{i}

), respectively. For recorded speech incoming signal [30], we choose

κ_{1} = - 15

dB,

κ_{2} = - 16

dB. The best values are indicated in bold. It can be seen that the PEMSC-APA has higher

{\bar{M I S}}_{i}

,

{\bar{A S G}}_{i}

and much smaller

τ_{κ_{i}}

(

i = 1, 2

) than those of the PEMSC-NLMS. The H-NLMS obtains further improvement in

{\bar{M I S}}_{i}

,

{\bar{A S G}}_{i}

, while retaining similar

τ_{κ_{2}}

and a bit higher

τ_{κ_{1}}

compared to the PEMSC-APA. Among all mentioned AFC approaches, the proposed swPEMSC achieves the best values for all metrics. Specifically, the proposed approach yields approximately 4 dB gain on

{\bar{M I S}}_{1}

and

{\bar{A S G}}_{1}

, 2.5 dB gain on

{\bar{M I S}}_{2}

and

{\bar{A S G}}_{2}

compared to the PEMSC-NLMS. It also yields approximately 2 dB gain on

{\bar{M I S}}_{i}

and

{\bar{A S G}}_{i}

for both feedback paths compared to the PEMSC-APA, and 0.7 dB improvement in

{\bar{M I S}}_{i}

and 0.5 dB improvement in

{\bar{A S G}}_{i}

compared to the H-NLMS. Moreover, the proposed approach needs only 0.3 s to reach −15 dB of misalignment (for

F_{1}

) while the PEMSC-NLMS, PEMSC-APA, and H-NLMS need around 8.3 s, 2.5 s, and 1.7 s, respectively. Similar observations are reported for the case using

F_{2}

. Note that a small value of

τ_{κ_{1}}

implies a faster convergence rate and a small value of

τ_{κ_{2}}

means a faster tracking rate.

We evaluate the speech quality of the compared AFC approaches using the PESQ measure (cf. Table 3). This table shows that H-NLMS and swPEMSC outperform PEMSC-NLMS and PEMSC-APA in terms of PESQ. The swPEMSC yields a slightly higher PESQ for the free-field feedback path (

F_{1}

) but gets a small drop in PESQ for the telephone-near feedback path (

F_{2}

) compared to the H-NLMS. Both H-NLMS and swPEMSC obtain high perceptual speech quality with PESQ scores from 3.7 to 4.2. We observe that the misalignment of swPEMSC yields a higher peak (a sign of howling) than that of the H-NLMS after a sudden change of feedback path. However, this high peak lasts only for a very short time before the system quickly returns to a stable state (see Figure 11a). That may be the reason for a small reduction in the

{PESQ}_{2}

of the swPEMSC compared to that of H-NLMS. To verify this observation, we compute

{PESQ}_{2}

with incoming signal from 32 s to 60 s (skipping the first 2 s that may contain howling for the H-NLMS and swPEMSC). In this case, the

{PESQ}_{2}

scores for PEM-NLMS, PEM-APA, H-NLMS, and swPEMSC are 3.571, 4.084, 4.224, and 4.291, respectively. This result means that the three last methods achieve good signal quality (PESQ > 4), but the proposed approach obtains the highest PESQ for the period of the signal without the howling effect. This result matches well with Figure 11a, where the three last methods show quick tracking rates. The

{PESQ}_{2}

of the PEMSC-NLMS is much lower since this method tracks the change of feedback path slower than other methods resulting in a part of howling still available in the

{PESQ}_{2}

computing period (32 s–60 s).

Those results are consistent with an observation of howling periods in Figure 12. We can see that the PEMSC-APA yields shorter howling than the PEMSC-NLMS. The howling periods of the H-NLMS and the proposed swPEMSC are comparable, but they are much shorter than those of the PEMSC-APA. Therefore, the swPEMSC and H-NLMS can recover from howling periods quicker than the PEMSC-APA and PEMSC-NLMS.

Experiment 3: In this experiment, we evaluate the proposed swPEMSC using recorded music [24] as the incoming signal. The recorded music is a segment of the song “Imagine” by John Lennon. We select the feedback paths depicted in Figure 7 and suddenly change the feedback path after half of the simulation time. We set

κ_{1} = - 11

dB and

κ_{2} = - 14.5

dB.

Figure 13 demonstrates the performance of the proposed approach in comparison with other baselines. The proposed swPEMSC outperforms all considered AFC approaches. In particular, it achieves faster convergence/tracking rates and lower steady-state error than other baselines. Those observations are consistent with the results shown in Table 2 for the music incoming signal. It is shown that the proposed approach obtains the best values for most of metrics such as

{\bar{M I S}}_{1}

,

{\bar{M I S}}_{2}

,

{\bar{A S G}}_{2}

,

τ_{κ_{1}}

, and

τ_{κ_{2}}

. The H-NLMS obtains the best value for

{\bar{A S G}}_{1}

, while the

{\bar{A S G}}_{1}

of swPEMSC is comparable with that of the H-NLMS.

Figure 14 compares howling periods of the proposed approach with those of baselines. It can be seen that the swPEMSC can recover from howling quicker than the PEMSC-NLMS and PEMSC-APA. Its howling length is comparable with that of the H-NLMS.

Experiment 4: This experiment is conducted to verify the robustness of the proposed approach against different input signals and feedback paths. In particular, we evaluate the proposed swPEMSC using five segments of concatenated speech as the incoming signals and a new feedback path.

Note that the incoming signals in this experiment are not recorded. Each speech segment is generated by randomly selecting and concatenating speech utterances extracted from NOIZEUS database [51]. The length of each segment is 40 s. The measured feedback path [20] is selected for this experiment. This measured feedback path (

F_{1}

) and the five segments of speech incoming signals are presented in Figure 15 and Figure 16, respectively. The sampling frequency for this experiment is 8 kHz. To evaluate the tracking ability of AFC methods we produce the second feedback path (

F_{2}

) by right shifting

F_{1}

by 12 samples. In this experiment, we set

|K| = 12

dB for the forward path gain,

κ_{1} = κ_{2} = - 13.5

dB, and step-size

μ = 0.002

for the PEMSC-NLMS. We also select

P = 2

for the PEMSC-APA

P = 6

for the swPEMSC as they allow the best performance for the corresponding approach. Other parameters are set the same as those in Experiment 2.

Figure 17 shows the average MIS and ASG computed over 5 segments of speech input. The feedback path abruptly changes from

F_{1}

to

F_{2}

after 20 s. It can be seen that the PEMSC-APA and PEMSC-NLMS obtain similar performance, while the H-NLMS achieves quicker convergence/tracking rates and also lower steady-state error. Higher average ASGs for both feedback paths are also observed for the H-NLMS compared to those of PEMSC-NLMS and PEMSC-APA. As expected the swPEMSC outperform all baselines. Those observations match well with the results in Table 2, where the proposed swPEMSC achieves the highest

{\bar{M I S}}_{i}

and

{\bar{A S G}}_{i}

. It also obtains comparable

τ_{κ_{i}}

compared to the H-NLMS, but those values much lower compared to the PEMSC-APA and PEMSC-NLMS.

Table 3 shows the average PESQ scores over five segments of speech input, where

{PESQ}_{1}

and

{PESQ}_{2}

are measured over the last 18 s of speech segments corresponding to the feedback path

F_{1}

and

F_{2}

, respectively. It is observed that the

{PESQ}_{1}

scores of all mentioned AFC methods are very high (approximately 4.1), which reflect the fact that after the first 2 s all AFC methods have converged. These results are consistent with the measures of

τ_{κ_{1}}

in Table 2 which shows that all AFC methods during the period of the feedback path

F_{1}

need around 2 s or less to reach −13.5 dB of misalignment. When the feedback path suddenly changes from

F_{1}

to

F_{2}

, the PEMSC-NLMS and PEMSC-APA need approximately 3 s and 2.8 s to reach that level of misalignment, respectively. It may be the reason for a reduction in

{PESQ}_{2}

of those methods since the system may still partly unstable during the period

{PESQ}_{2}

computed. In contrast, the H-NLMS and swPEMSC require a very short time to reach that level of misalignment for both feedback paths, resulting in very high

{PESQ}_{2}

scores (approximately 4.1).

Although the proposed swPEMSC has comparable signal quality compared to the H-NLMS, it achieves the lowest average MIS, the highest ASG, and faster convergence/tracking rates for most scenarios.

Note that in experiments 2 and 3 the average misalignment (

{\bar{M I S}}_{i}

) and average added stable gain (

{\bar{A S G}}_{i}

) corresponding to the ith feedback path (

i = 1, 2

) are computed over 30 s (i.e., 480,000 samples) of each realisation, while in experiment 4 those values are computed over 20 s (i.e., 160,000 samples).

8. Discussion

In this study, the MIS, ASG, and PESQ measures are adopted to evaluate the performance of different AFC approaches, namely, the swPEMSC, H-NLMS, PEMSC-APA, and PEMSC-NLMS for different types of incoming signals and an abrupt change of the feedback path. Moreover, this study evaluates the convergence/tracking rates based on the needed time for each considered approach reaching a certain level of normalised misalignment. Experimental results indicate that the proposed swPEMSC achieves a further improvement in the initial convergence and re-convergence than the state-of-the-art H-NLMS for most scenarios. The reason for faster re-convergence is that the swPEMSC uses an APA adaptive filter update with a high step-size and an optimised project order when unstable/howling status is detected, whereas the H-NLMS uses a standard NLMS. Furthermore, the swPEMSC and H-NLMS outperform the PEMSC-APA and PEMSC-NLMS in terms of convergence/tracking rates as well as average MIS and average ASG.

The results also show that there is a dependency of the projection order (for AFC approaches using the APA algorithm) on the performance. This is expected because the projection order of an APA algorithm depends on the signal characteristics. It is observed that the proposed approach employing a fixed pair of step-sizes

μ_{1} = 0.0008

,

μ_{2} = 0.8

, and a projection order of 6 (

P = 6

) provides better performance than itself with

P = 2, 4, 8

for both speech and music input signals as well as for a sudden change of the feedback path.

Generally, the proposed swPEMSC achieves much better speech quality than PEMSC-APA and PEMSC-NLMS in most mentioned scenarios due to its fast convergence and tracking abilities. It also yields similar speech quality compared to the H-NLMS.

9. Conclusions

In this paper, we propose a new and practical way to improve the AFC performance in open-fit hearing aids. The proposed swPEMSC is developed based on a new update rule for estimating adaptive filter coefficients. This update rule allows a switch from the PEMSC-NLMS to the PEMSC-APA when the system goes from a converged status to an unstable/howling status, and vice versa. This switch is controlled by a control signal produced by using the SCSD. Experimental results show that the proposed swPEMSC outperforms other state-of-the-art AFC approaches such as the PEMSC-NLMS, PEMSC-APA, and H-NLMS for different types of incoming signals (e.g., speech and music) and an abrupt change of feedback paths. In particular, the proposed approach achieves a significant performance improvement in terms of convergence/tracking rates, average MIS and average ASG compared to baselines in most scenarios. It also obtains high perceptual speech quality. The PESQ score as well as the ability to recover from the instability/howling of the proposed approach are comparable to those of the H-NLMS but much better than those of the PEMSC-NLMS and PEMSC-APA. However, the improvements from using swPEMSC come at an increased cost in computational complexity.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/audiolres11030037/s1.

Author Contributions

L.T.T.T.’s contributions include conceptualisation, methodology, data curation, formal analysis, investigation, software and writing—original draft preparation. S.E.N.’s contributions include methodology, writing—review and editing. Both authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in the Supplementary Materials here.

Conflicts of Interest

The authors declare no conflict of interest.

References

Siqueira, M.G.; Alwan, A. Steady-state analysis of continuous adaptation in acoustic feedback reduction systems for hearing-aids. IEEE Trans. Speech Audio Process. 2000, 8, 443–453. [Google Scholar] [CrossRef]
Spriet, A.; Doclo, S.; Moonen, M.; Wouters, J. Feedback control in hearing aids. In Springer Handbook of Speech Processing; Springer: Berlin/Heidelberg, Germany, 2008; pp. 979–1000. [Google Scholar]
Van Waterschoot, T.; Moonen, M. Fifty Years of Acoustic Feedback Control: State of the Art and Future Challenges. Proc. IEEE 2011, 99, 288–327. [Google Scholar] [CrossRef]
Hellgren, J.; Forssell, U. Bias of feedback cancellation algorithms in hearing aids based on direct closed loop identification. IEEE Trans. Speech Audio Process. 2001, 9, 906–913. [Google Scholar] [CrossRef]
Laugesen, S.; Hansen, K.V.; Hellgren, J. Acceptable delays in hearing aids and implications for feedback cancellation. J. Acoust. Soc. Am. 1999, 105, 1211–1212. [Google Scholar] [CrossRef]
Kates, J. Feedback cancellation in hearing aids. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 3–6 April 1990; pp. 1125–1128. [Google Scholar]
Guo, M.; Jensen, S.H.; Jensen, J. Novel acoustic feedback cancellation approaches in hearing aid applications using probe noise and probe noise enhancement. IEEE Trans. Audio Speech Lang. Process. 2012, 20, 2549–2563. [Google Scholar] [CrossRef]
Guo, M.; Elmedyb, T.B.; Jensen, S.H.; Jensen, J. On acoustic feedback cancellation using probe noise in multiple-microphone and single-loudspeaker systems. IEEE Signal Process. Lett. 2012, 19, 283–286. [Google Scholar] [CrossRef]
Nakagawa, C.R.C.; Nordholm, S.; Yan, W.Y. Feedback cancellation with probe shaping compensation. IEEE Signal Process. Lett. 2014, 21, 365–369. [Google Scholar] [CrossRef]
Schroeder, M.R. Improvement of Acoustic-Feedback Stability by Frequency Shifting. J. Acoust. Soc. Am. 1964, 36, 1718–1724. [Google Scholar] [CrossRef]
Strasser, F.; Puder, H. Adaptive feedback cancellation for realistic hearing aid applications. IEEE/ACM Trans. Audio Speech Lang. Process. 2015, 23, 2322–2333. [Google Scholar] [CrossRef]
Guo, M.; Jensen, S.H.; Jensen, J.; Grant, S.L. On the use of a phase modulation method for decorrelation in acoustic feedback cancellation. In Proceedings of the European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 27–31 August 2012; pp. 2000–2004. [Google Scholar]
Hellgren, J. Analysis of feedback cancellation in hearing aids with filtered-X LMS and the direct method of closed loop identification. IEEE Trans. Speech Audio Process. 2002, 10, 119–131. [Google Scholar] [CrossRef]
Spriet, A.; Proudler, I.; Moonen, M.; Wouters, J. Adaptive feedback cancellation in hearing aids with linear prediction of the desired signal. IEEE Trans. Signal Process. 2005, 53, 3749–3763. [Google Scholar] [CrossRef]
Tran, L.T.T.; Dam, H.H.; Nordholm, S. Affine Projection Algorithm For Acoustic Feedback Cancellation Using Prediction Error Method In Hearing Aids. In Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), Xi’an, China, 13–16 September 2016. [Google Scholar]
Rombouts, G.; Van Waterschoot, T.; Moonen, M. Robust and Efficient Implementation of the PEM-AFROW Algorithm for Acousic Feedback Cancellation. J. Audio Eng. Soc. 2007, 55, 955–966. [Google Scholar]
Tran, L.T.T.; Schepker, H.; Doclo, S.; Dam, H.H.; Nordholm, S. Proportionate NLMS for adaptive feedback control in hearing aids. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, New Orleans, LA, USA, 5–9 March 2017. [Google Scholar]
Gil-Cacho, J.M.; van Waterschoot, T.; Moonen, M.; Jensen, S.H. Transform domain prediction error method for improved acoustic echo and feedback cancellation. In Proceedings of the European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 27–31 August 2012; pp. 2422–2426. [Google Scholar]
Tran, L.T.T.; Nordholm, S.E.; Schepker, H.; Dam, H.H.; Doclo, S. Two-microphone hearing aids using prediction error method for adaptive feedback control. IEEE/ACM Trans. Audio Speech Lang. Process. 2018, 26, 909–923. [Google Scholar] [CrossRef]
Spriet, A.; Rombouts, G.; Moonen, M.; Wouters, J. Adaptive feedback cancellation in hearing aids. Elsevier J. Frankl. Inst. 2006, 343, 545–573. [Google Scholar] [CrossRef]
Bernardi, G.; van Waterschoot, T.; Wouters, J.; Moonen, M. An all-frequency-domain adaptive filter with PEM-based decorrelation for acoustic feedback control. In Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 18–21 October 2015; pp. 1–5. [Google Scholar]
Bernardi, G.; van Waterschoot, T.; Wouters, J.; Hillbratt, M.; Moonen, M. A PEM-based frequency-domain Kalman filter for adaptive feedback cancellation. In Proceedings of the 23rd European Signal Processing Conference (EUSIPCO), Nice, France, 31 August–4 September 2015; pp. 270–274. [Google Scholar]
Schepker, H.; Tran, L.T.T.; Nordholm, S.; Doclo, S. Improving Adaptive Feedback Cancellation in Hearing Aids Using an Affine Combination of Filters. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Shanghai, China, 20–25 March 2016. [Google Scholar]
Tran, L.T.T.; Schepker, H.; Doclo, S.; Dam, H.H.; Nordholm, S. Frequency Domain Improved Practical Variable Step-Size for Adaptive Feedback Cancellation Using Pre-Filters. In Proceedings of the 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC), Tokyo, Japan, 17–20 September 2018; pp. 171–175. [Google Scholar]
Yang, F.; Wu, M.; Ji, P.; Yang, J. An improved multiband-structured subband adaptive filter algorithm. IEEE Signal Process. Lett. 2012, 19, 647–650. [Google Scholar] [CrossRef]
Strasser, F.; Puder, H. Sub-band feedback cancellation with variable step sizes for music signals in hearing aids. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, Italy, 4–9 May 2014; pp. 8207–8211. [Google Scholar]
Khoubrouy, S.A.; Panahi, I.M.S. An Efficient Delayless Sub-band Filtering for Adaptive Feedback Compensation in Hearing Aid. J. Signal Process. Syst. 2016, 83, 401–409. [Google Scholar] [CrossRef]
Pradhan, S.; Patel, V.; Somani, D.; George, N.V. An improved proportionate delayless multiband-structured subband adaptive feedback canceller for digital hearing aids. IEEE/ACM Trans. Audio Speech Lang. Process. 2017, 25, 1633–1643. [Google Scholar] [CrossRef]
Nakagawa, C.R.C.; Nordholm, S.; Yan, W.Y. Dual microphone solution for acoustic feedback cancellation for assistive listening. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 25–30 March 2012; pp. 149–152. [Google Scholar]
Nakagawa, C.R.C.; Nordholm, S.; Yan, W.Y. Analysis of Two Microphone Method for Feedback Cancellation. IEEE Signal Process. Lett. 2015, 22, 35–39. [Google Scholar] [CrossRef]
Tran, L.T.T.; Nordholm, S.; Dam, H.H.; Yan, W.Y.; Nakagawa, C.R. Acoustic Feedback Cancellation in hearing aids using two microphones employing variable step size affine projection algorithms. In Proceedings of the IEEE International Conference on Digital Signal Processing (DSP), Singapore, 21–24 July 2015; pp. 1191–1195. [Google Scholar]
Albu, F.; Nakagawa, R.; Nordholm, S. Proportionate algorithms for two-microphone active feedback cancellation. In Proceedings of the 23rd European Signal Processing Conference (EUSIPCO), Nice, France, 31 August–4 September 2015; pp. 290–294. [Google Scholar]
Schepker, H.; Nordholm, S.E.; Tran, L.T.T.; Doclo, S. Null-steering beamformer-based feedback cancellation for multi-microphone hearing aids with incoming signal preservation. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 679–691. [Google Scholar] [CrossRef]
Schepker, H.; Nordholm, S.; Doclo, S. Acoustic Feedback Suppression for Multi-Microphone Hearing Devices Using a Soft-Constrained Null-Steering Beamformer. IEEE/ACM Trans. Audio Speech Lang. Process. 2020, 28, 929–940. [Google Scholar] [CrossRef]
Lee, S.; Kim, I.Y.; Park, Y.C. Approximated affine projection algorithm for feedback cancellation in hearing aids. Comp. Methods Programs Biomed. 2007, 87, 254–261. [Google Scholar] [CrossRef] [PubMed]
Lee, K.; Baik, Y.h.; Park, Y.; Kim, D.; Sohn, J. Robust adaptive feedback canceller based on modified pseudo affine projection algorithm. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Boston, MA, USA, 30 August–3 September 2011; pp. 3760–3763. [Google Scholar]
Pradhan, S.; Patel, V.; Patel, K.; Maheshwari, J.; George, N.V. Acoustic feedback cancellation in digital hearing aids: A sparse adaptive filtering approach. Appl. Acoust. 2017, 122, 138–145. [Google Scholar] [CrossRef]
Thipphayathetthana, S.; Chinrungrueng, C. Variable step-size of the least-mean-square algorithm for reducing acoustic feedback in hearing aids. In Proceedings of the IEEE Asia-Pacific Conference on Circuits and Systems, Tianjin, China, 4–6 December 2000; pp. 407–410. [Google Scholar]
Rotaru, M.; Albu, F.; Coanda, H. A variable step size modified decorrelated NLMS algorithm for adaptive feedback cancellation in hearing aids. In Proceedings of the International Symposium on Electronics and Telecommunications, Timisoara, Romania, 15–16 November 2012; pp. 263–266. [Google Scholar]
Tran, L.T.T.; Schepker, H.; Doclo, S.; Dam, H.H.; Nordholm, S. Improved Practical Variable Step-Size Algorithm For Adaptive Feedback Control in Hearing Aids. In Proceedings of the IEEE International Conference on Signal Processing and Communication Systems, Surfers Paradise, QLD, Australia, 19–21 December 2016. [Google Scholar]
Albu, F.; Tran, L.T.T.; Nordholm, S. A combined variable step size strategy for two microphones acoustic feedback cancellation using proportionate algorithms. In Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Kuala Lumpur, Malaysia, 12–15 December 2017; pp. 1373–1377. [Google Scholar]
Tran, L.T.T.; Schepker, H.; Doclo, S.; Dam, H.H.; Nordholm, S.E. Adaptive feedback control using improved variable step-size affine projection algorithm for hearing aids. In Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Kuala Lumpur, Malaysia, 12–15 December 2017; pp. 1633–1640. [Google Scholar]
Bhattacharjee, S.S.; George, N.V. Fast and Efficient Acoustic Feedback Cancellation based on Low Rank Approximation. Signal Process. 2021, 182, 107984. [Google Scholar] [CrossRef]
Nordholm, S.; Schepker, H.; Tran, L.T.T.; Doclo, S. Stability-controlled hybrid adaptive feedback cancellation scheme for hearing aids. J. Acoust. Soc. Am. 2018, 143, 150–166. [Google Scholar] [CrossRef]
Ozeki, K.; Umeda, T. An adaptive filtering algorithm using an orthogonal projection to an affine subspace and its properties. Electron. Commun. Jpn. 1984, 67, 19–27. [Google Scholar] [CrossRef]
Paleologu, C.; Benesty, J.; Ciochina, S. A variable step-size affine projection algorithm designed for acoustic echo cancellation. IEEE Trans. Audio Speech Lang. Process. 2008, 16, 1466–1478. [Google Scholar] [CrossRef]
Sayed, A.H. Fundamental Adaptive Filtering; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
Shynk, J.J. Frequency-domain and multirate adaptive filtering. IEEE Signal Process. Mag. 1992, 9, 14–37. [Google Scholar] [CrossRef]
Sankowsky-Rothe, T.; Blau, M.; Schepker, H.; Doclo, S. Reciprocal measurement of acoustic feedback paths in hearing aids. J. Acoust. Soc. Am. 2015, 138, EL399–EL404. [Google Scholar] [CrossRef]
Kates, J.M. Room reverberation effects in hearing aid feedback cancellation. J. Acoust. Soc. Am. 2001, 109, 367–378. [Google Scholar] [CrossRef]
Loizou, P.C. Speech Enhancement: Theory and Practice; CRC Press: Boca Raton, FL, USA, 2013. [Google Scholar]
Deller, J.R.; Proakis, J.G.; Hansen, J.H. Discrete Time Processing of Speech Signals; Prentice Hall PTR: Upper Saddle River, NJ, USA, 1993. [Google Scholar]

Figure 1. Typical structure of a hearing aid with AFC.

Figure 2. Typical structure of a hearing aid.

Figure 3. PEM-AFC model.

Figure 4. The model of the proposed method.

Figure 5. Normalised misalignment of the PEMSC-NLMS with

μ \in [0.1, 0.005, 0.001]

, speech input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

).

Figure 5. Normalised misalignment of the PEMSC-NLMS with

μ \in [0.1, 0.005, 0.001]

, speech input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

).

Figure 6. Normalised misalignment of the PEMSC-NLMS, PEMSC-APA with

P \in [2, 4, 6, 8]

, speech input, feedback path changes from

F_{1}

to

F_{2}

.

Figure 6. Normalised misalignment of the PEMSC-NLMS, PEMSC-APA with

P \in [2, 4, 6, 8]

, speech input, feedback path changes from

F_{1}

to

F_{2}

.

Figure 7. Measured feedback paths: (a) Amplitude responses and (b) phase.

Figure 8. Incoming signals: (a) Concatenated speech and (b) music.

Figure 9. Performance of the proposed method with

P \in [2, 4, 6, 8]

, speech input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

): (a) MIS; (b) ASG.

Figure 9. Performance of the proposed method with

P \in [2, 4, 6, 8]

, speech input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

): (a) MIS; (b) ASG.

Figure 10. Performance of the proposed method with

P \in [2, 4, 6, 8]

, music input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

): (a) MIS; (b) ASG.

Figure 10. Performance of the proposed method with

P \in [2, 4, 6, 8]

, music input, feedback path changes from free-field (

F_{1}

) to telephone-near (

F_{2}

): (a) MIS; (b) ASG.

Figure 11. Compare performance of the proposed method with state-of-the art AFC methods using speech input, a sudden change of the feedback path from free-field (

F_{1}

) to telephone-near (

F_{2}

) after 30 s: (a) MIS; (b) ASG.

Figure 11. Compare performance of the proposed method with state-of-the art AFC methods using speech input, a sudden change of the feedback path from free-field (

F_{1}

) to telephone-near (

F_{2}

) after 30 s: (a) MIS; (b) ASG.

Figure 12. Compare howling periods in output signal of the proposed approach with baselines, speech incoming signal, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 30 s.

Figure 12. Compare howling periods in output signal of the proposed approach with baselines, speech incoming signal, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 30 s.

Figure 13. Performance of the proposed method state-of-the art AFC methods using music input, a sudden change of the feedback path from free-field (

F_{1}

) to telephone-near (

F_{2}

) after 30 s: (a) MIS; (b) ASG.

Figure 13. Performance of the proposed method state-of-the art AFC methods using music input, a sudden change of the feedback path from free-field (

F_{1}

) to telephone-near (

F_{2}

) after 30 s: (a) MIS; (b) ASG.

Figure 14. Compare howling periods in output signal of the proposed approach with baselines, music incoming signal, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 30 s.

Figure 14. Compare howling periods in output signal of the proposed approach with baselines, music incoming signal, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 30 s.

Figure 15. The measured feedback path for Experiment 4: (a) Amplitude response and (b) phase.

Figure 16. Speech segments mentioned in Experiment 4.

Figure 17. Compare performance of the proposed method with state-of-the art AFC methods using 5 segments of speech input, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 20 s, MIS (a) and ASG (b) were average values computed over 5 segments of speech input.

Figure 17. Compare performance of the proposed method with state-of-the art AFC methods using 5 segments of speech input, a sudden change of the feedback path from

F_{1}

to

F_{2}

after 20 s, MIS (a) and ASG (b) were average values computed over 5 segments of speech input.

Table 1. Computational complexity per output sample.

AFC Methods	Computational Complexity	#
PEMSC-NLMS	$M + 3 L_{\hat{f}} + 2$	263
PEMSC-APA	$M + (P^{2} + 2 P) L_{\hat{f}} + P^{3} + P$	3363
H-NLMS	$M + 2 (3 L_{\hat{f}} + 2)$	457
swPEMSC	$M + 3 L_{\hat{f}} + 2 + (P^{2} + 2 P) L_{\hat{f}} + P^{3} + P$	3557

A numerical value is given for

N = 20

,

L = 160

,

L_{\hat{f}} = 64

, and

P = 6

.

Table 2. Evaluate performance of PEMSC-NLMS, PEMSC-APA, H-NLMS, and swPEMSC for different types of the incoming signals, feedback path changes from

F_{1}

to

F_{2}

after half of the simulation time,

κ_{1} = - 15

dB,

κ_{2} = - 16

dB (for recorded speech input);

κ_{1} = - 11

dB,

κ_{2} = - 14.5

dB (for recorded music input); and

κ_{1} = κ_{2} = - 13.5

dB (for 5 segments of speech input).

Table 2. Evaluate performance of PEMSC-NLMS, PEMSC-APA, H-NLMS, and swPEMSC for different types of the incoming signals, feedback path changes from

F_{1}

to

F_{2}

after half of the simulation time,

κ_{1} = - 15

dB,

κ_{2} = - 16

dB (for recorded speech input);

κ_{1} = - 11

dB,

κ_{2} = - 14.5

dB (for recorded music input); and

κ_{1} = κ_{2} = - 13.5

dB (for 5 segments of speech input).

AFC Methods	Incoming Signals	${\bar{MIS}}_{1}$ [dB]	${\bar{ASG}}_{1}$ [dB]	$τ_{κ_{1}}$ [s]	${\bar{MIS}}_{2}$ [dB]	${\bar{ASG}}_{2}$ [dB]	$τ_{κ_{2}}$ [s]
PEMSC-NLMS		−16.023	17.656	8.312	−20.115	21.194	4.780
PEMSC-APA	recorded speech	−18.176	19.586	1.545	−20.562	21.385	1.180
H-NLMS [44]		−19.278	21.260	2.715	−21.750	23.175	1.186
swPEMSC		−20.016	21.786	0.304	−22.487	23.638	0.367
PEMSC-NLMS		−11.493	14.066	6.540	−16.568	17.354	6.149
PEMSC-APA	recorded music	−13.341	14.970	1.205	−17.073	18.406	0.779
H-NLMS [44]		−13.358	15.860	0.193	−17.946	19.278	0.441
swPEMSC		−13.847	15.804	0.144	−19.465	19.837	0.105
PEMSC-NLMS		−14.655	15.034	2.103	−14.339	14.424	3.088
PEMSC-APA	5 speech segments	−14.768	14.906	1.775	−14.417	14.175	2.795
H-NLMS [44]		−15.254	15.897	0.203	−15.446	15.662	0.169
swPEMSC		−16.810	17.936	0.251	−16.043	17.227	0.131

Table 3. PESQ measures of the PEMSC-NLMS, PEMSC-APA, H-NLMS, and swPEMSC with a sudden change of feedback paths from

F_{1}

to

F_{2}

after half of the simulation time, recorded speech and 5 segments of speech input as incoming signals.

Table 3. PESQ measures of the PEMSC-NLMS, PEMSC-APA, H-NLMS, and swPEMSC with a sudden change of feedback paths from

F_{1}

to

F_{2}

after half of the simulation time, recorded speech and 5 segments of speech input as incoming signals.

AFC Methods	Incoming Signals	${PESQ}_{1}$	${PESQ}_{2}$
PEM-NLMS		1.652	3.013
PEM-APA	recorded speech	2.067	3.448
H-NLMS [44]		4.167	4.047
swPEMSC		4.218	3.729
PEM-NLMS		4.132	1.646
PEM-APA	5 speech segments	4.124	1.890
H-NLMS [44]		4.077	4.129
swPEMSC		4.134	4.075

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tran, L.T.T.; Nordholm, S.E. A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids. Audiol. Res. 2021, 11, 389-409. https://doi.org/10.3390/audiolres11030037

AMA Style

Tran LTT, Nordholm SE. A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids. Audiology Research. 2021; 11(3):389-409. https://doi.org/10.3390/audiolres11030037

Chicago/Turabian Style

Tran, Linh Thi Thuc, and Sven Erik Nordholm. 2021. "A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids" Audiology Research 11, no. 3: 389-409. https://doi.org/10.3390/audiolres11030037

Article Menu

A Switched Algorithm for Adaptive Feedback Cancellation Using Pre-Filters in Hearing Aids

Abstract

1. Introduction

2. Hearing Aids and Acoustic Feedback Problem

3. Standard AFC Approach

4. PEM-AFC

5. Proposed Method

6. Computational Complexity

7. Experimental Results

8. Discussion

9. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI