A FPGA Implementation of Prediction Error Method for Adaptive Feedback Cancellation using Xilinx System Generator™

Marius Rotaru, Cristian Stanciu, Silviu Ciochină
Dept. of Telecommunications
University Politehnica of Bucharest
Bucharest, Romania
marius.rotaru@gmail.com, {cristian, silviu}@comm.pub.ro

Felix Albu, Henri Coandă
Electrical Engineering Department
Valahia University of Targoviste
Targoviste, Romania
{felix.albu, coanda}@valahia.ro

Abstract—This paper describes a real-time, field programmable gate array (FPGA) implementation of Feedback Cancellation (FC) system to improve the intra-cabin communication among the driver and passengers, which is typically degraded by the noisy environment and by the distance in between them. The feedback canceller, used to reduce the acoustic coupling the loudspeaker and the microphone, is based on the continuously adaptive filtering technique, implementing the prediction error method (PEM) for closed loop system identification. The adaptive algorithm implements the modified least mean square algorithm (MLMS) while for the linear prediction a fix-order linear predictor has been selected. The implementation was done using Xilinx System Generator™ (XSG)

Keywords-Adaptive Algorithm; Adaptive Feedback Cancellation; Prediction Error; MLMS; FPGA; Xilinx System Generator

I. INTRODUCTION

The acoustic feedback is a major problem of audio processing field, occurring whenever the sound is captured and reproduced in the same environment. The communication in vans and limousines between the passengers in the front and the rear is degraded due to the presence of the noise as well as the long distance between them [1]. This can be improved by using a speech reinforcement system. The simplest, one channel speech reinforcement system, picks-up the speech using a microphone, amplifies it and then plays it back to a loudspeaker. Due to the electro-acoustic coupling between loudspeakers and microphone, a closed-loop system is created. To avoid the instability (howling) of the system, a feedback canceller has to be used.

Different methods attempting to minimize the effect of acoustic feedback have been proposed in literature. They are broadly classified as feedforward suppression and feedback cancelation techniques. For the case of feedforward suppression technique, the use of notch-filter based howling suppression (NHS) represents a traditional and robust solution [1-3]. The main disadvantage of this method is that it is reactive: in order to identify and eliminate the oscillation frequencies the howling must firstly occur. A more promising solution is to use the adaptive feedback cancellation (AFC) method, which is based on the estimation

of acoustic feedback path, belongings to the class of room modeling methods. As illustrated in the Fig. 1, the feedback canceller \( F(q) \) produces an estimate \( \hat{y}(n) \) of the feedback signal \( f(n) \), obtained by filtering the loudspeaker signal \( x(n) \) with \( F(q) \), and subtracts it from the microphone signal \( d(n) \) so that ideally the clean speech signal \( s(n) \) amplified by a factor \( K \) and played back to the loudspeaker. Depending on the quality of the feedback path estimation, the feedback is almost eliminated. The most robust method to eliminate it is based on the closed loop identification theory [4] - [5]. The direct method for closed loop identification [5], named prediction-error-method (PEM) is a promising proactive solution for AFC. Prediction error for AFC (PEMAFC) algorithms has been deeply analyzed in context hearing aids applications [6-10]. Also, recently the PEMAFC method has been tested in the car scenario case [11-12]. In this paper we propose a FPGA implementation of the PEMAFC algorithm in Xilinx System Generator™, based on the configuration from [8-9] using a fix model of input signal.

The paper is organized as follows: Section II is dedicated to the description of the PEMAFC algorithm implemented on FPGA. The Section III covers the FPGA implementation aspects. In Section IV the experiments as well as the results are reported. Section V presents the conclusions of the work.

II. PREDICTION ERROR METHOD FOR AFC SYSTEM

The notation from [8] has been adopted: \( q^{-1} \) denotes the unitary delay, so that \( q^{-1} u(n) = u(n-1) \). A discrete-time filter with filter length L is represented as a polynomial \( F(q) \) in \( q \), i.e.,

![Figure 1. Adaptive Feedback cancellation structure](image-url)
\begin{equation}
F(q) = f_0 + f_1 q^{-1} + \cdots + f_{L-1} q^{-L+1},
\end{equation}

or by its vector \( \mathbf{f} = [f_0, f_1, \ldots, f_{L-1}]^T \), so that the filtering operation consists of applying the polynomial to the input sequence:

\begin{equation}
F(q)x(n) = \mathbf{f}^T \mathbf{x}(n),
\end{equation}

with \( \mathbf{x}(n) = [x(n), x(n-1), \ldots, x(n-L+1)]^T \).

As proposed in the [6], [7] and reiterated in [8], the direct method of the closed-loop identification of feedback path \( F(q) \) and the desired signal model \( H(q) \) is presented in the Fig. 2. The main assumption of this method starts from the fact that the desired signal \( d(n) \) can be modeled as a \( H(q)w(n) \), where \( w(n) \) is the white noise signal and the \( H(q) \) is the desired signal model, which is inversely stable \( (A(q) = H^{-1}(q)) \). In such case the bias in the feedback path estimation can be eliminated by decorrelating the signal of the adaptive algorithm by passing them through \( A(q) \). In case of PEM, the feedback path \( F(q) \) and the desired signal model \( H(q) \) are estimated by minimizing the energy of so calling prediction error \( e_p(n) \):

\begin{equation}
e_p(n) = \hat{H}^{-1}(q)(d(n) - \hat{F}(q)x(n)),
\end{equation}

i.e.,

\begin{equation}
J(\hat{f}(n)) = E\{ |\hat{H}^{-1}(q)(d(n) - \hat{F}(q)x(n))|^2 \}
= E\{ |d_p(n) - \hat{f}^T(n)x_p(n)|^2 \}.
\end{equation}

Minimizing (4) generates:

\begin{equation}
\hat{f}(n) = E\{x_p(n)x_p^T(n)\}^{-1}E\{x_p(n)d_p(n)\}.
\end{equation}

Writing \( d_p(n) \) as:

\begin{equation}
d_p(n) = \hat{H}^{-1}(q)s(n) + F(q)x(n).
\end{equation}

When \( H(q) = \hat{H}(q) \),

\begin{equation}
d_p(n) = w(n) + F(q)x(n),
\end{equation}

and the speech signal \( s(n) \) is converted to a white noise signal \( w(n) \), resulting an unbiased feedback path estimate.

From (4) it can be observed that minimizing \( J(\hat{f}(n)) \) is equivalent with performing an adaptive filtering on the decorrelated (pre-whitened) signals \( \{d_p(n), x_p(n)\} \) or equivalently on \( \{e_p(n), \bar{x}_p(n)\} \).

Both fixed and adaptive estimates of the desired signal model \( H^{-1}(q) \) have been considered in literature [9]. Since our attention is focused on the FPGA implementation, in this paper we choose the fixed model, representing the averaging of speech spectrum, defined by:

\begin{equation}
H(q) = \frac{1}{1 - \alpha q^{-1}}, \quad |\alpha| < 1
\end{equation}

The adaptive filtering algorithm selected to perform these minimization is the Modified LMS (MLMS) [13], which is particularly suited for adaptive systems whose performance suffers from the presence of strong target signals (such as speech) that exhibit large fluctuations in short-time power levels. The sum version of this algorithm has been selected, with the following formula for updating the filter coefficients:

\begin{equation}
w(n+1) = w(n) + e(n)x(n)f[x(n), e(n)],
\end{equation}

\begin{equation}
f[x(n), e(n)] = \mu(n) = \frac{\mu}{L(\hat{\sigma}_e^2(n) + \hat{\sigma}_s^2(n)) + \epsilon},
\end{equation}

where \( L \) is the length of adaptive filter, \( \mu \) is an adaptation constant and \( \hat{\sigma}_e^2 \) and \( \hat{\sigma}_s^2 \) are the power estimates of the error respectively of the reference signal. They can be obtained as follow:

\begin{equation}
\hat{\sigma}^2_{e,s}(n) = \lambda \hat{\sigma}^2_{e,s}(n-1) + (1-\lambda)(e^2(n), x^2(n)),
\end{equation}

where \( \lambda \) is a weighting factor chosen as \( \lambda = 1/(KL) \), with \( K > 1 \). A Summary of PEMAFC algorithm is described in the Table 1.

<table>
<thead>
<tr>
<th>TABLE 1.</th>
<th>TIME-DOMAIN PEMAFC ALGORITHM FOR THE FIXED ORDER LP</th>
</tr>
</thead>
<tbody>
<tr>
<td>Initialization:</td>
<td>( L, \epsilon, \mu, \lambda, \hat{f}(0) = 0, e(0) = 0 ),</td>
</tr>
<tr>
<td>Computation:</td>
<td>for each input sample ( x(n), n = 1, 2, \ldots )</td>
</tr>
<tr>
<td>( e(n) = d(n) - \hat{f}^T(n)x(n); )</td>
<td></td>
</tr>
<tr>
<td>( x(n) = Ke(e(n)D) )</td>
<td></td>
</tr>
<tr>
<td>Pre-filter ( e(n) ) and ( x(n) ) with ( A(q) )</td>
<td></td>
</tr>
<tr>
<td>( e_p(n) = e(n) - \alpha e(n-1), \quad x_p(n) = x(n) - \alpha x(n-1) ),</td>
<td></td>
</tr>
<tr>
<td>Estimate the power of pre-filtered signals</td>
<td></td>
</tr>
<tr>
<td>( \hat{\sigma}^2(n) = \lambda \hat{\sigma}^2(n-1) + (1-\lambda)(e^2_p(n) + x^2_p(n)) ),</td>
<td></td>
</tr>
<tr>
<td>Update the coefficients</td>
<td></td>
</tr>
<tr>
<td>( \hat{f}(n+1) = \hat{f}(n) + \frac{\mu}{L \hat{\sigma}^2_e + \epsilon} e_p(n)x_p(n) ).</td>
<td></td>
</tr>
<tr>
<td>End</td>
<td></td>
</tr>
</tbody>
</table>
III. FPGA IMPLEMENTATION

This section describes the adaptation of the PEMAFC algorithm on a realizable hardware platform suitable for automotive environment. Considering the PEMAFC based Feedback Cancellers a subcomponent of a speech reinforcement system including a noise cancellation component as well as a voice activity detector, the target platform must be cost effective with a relatively high performance DSP. The Xilinx Automotive (XA) Spartan®-6 families of FPGA was used [14].

A. Design Process

The PEMAFC algorithm was originally developed as MATLAB scripts using high precision floating-point arithmetic. The SoNoScout NVH Binaural recording and analysis system 653A and Sound Level Meter 2250L were also used. A one-to-one conversion to an equivalent FPGA implementation cannot be directly or easily implemented with reasonable resource utilization. Also, the precision of data in the FPGA implementation is limited to a fixed number of bits (fixed-point representation) which results in the addition of quantization errors to the system.

The first step of the implementation consists of identifying the parts of the algorithm, which became the main blocks of the hardware. A fixed point version of these blocks has been implemented using Fixed-Point Toolbox™ in Matlab® [15] and then compared to the floating point version.

Following the Matlab fixed point validation, a Xilinx System Generator™ (XSG) [16] model was developed. Each block has been individually validated by passing the data to/from Matlab workspace as well as using the “scope” block in the model by connecting the signal of interest to it. Once the XSG model has been created it has been validated against Matlab implementation. Finally, the design has been synthesized using the Xilinx ISE 13.4 design suite and run on the FPGA target in the “hardware-in-the-loop co-simulation”.

B. The Hardware implementation

The implementation of the PEMAFC algorithm is done on a Spartan®6 FPGA, i.e., the XC6SLX45[15]. The system clock frequency is approximately 100 MHz and the sampling frequency is 16 kHz; consequently, there are approximately 6250 clock periods available between two successive samples.

The flow of the algorithm is described in Fig.3. In the first phase, the output of the adaptive MAC filter y(n) is generated and the error signal e(n) is computed. In the second phase a fixed order pre-whitening of error signal e(n) and its delayed and amplified version x(n) is realized based on (8). A power estimate (10) of both decorrelated error and reference signals is generated in the third step. In the fourth step the algorithm’s step size is computed based on the previously power estimated values (9). The last phase is dedicated updating the adaptive filter’s coefficients based on their past values, decorrelated reference and error signals x_p(n) respectively e_p(n) and the previously computed step size. The FPGA implementation of the PEMAFC algorithm requires a few RAM memory blocks, as follows.

IV. RESOURCE USAGE AND SIMULATION RESULTS

The functional simulations have been done using FPGA target in the hardware-in-the-loop co-simulation mode as in the Fig.4. The real signal was read from Matlab workspace and it was either a voice signal with an additive white Gaussian noise (SNR=30dB) or an auto-regressive noise generated by passing a white Gaussian noise through a 10 order AR system.

The sampling frequency was 16kHz and the adaptive algorithm parameters used are: \( L = 256 \) (the same length as feedback path); fix step size \( \mu = 2^{-5} \); regularization factor...
The gain and the delay of the forward path are $K=10$ dB and $D=60$ samples (3.75ms) respectively. The feedback path was simulated using Simulink FIR Filter block.

The performance measure was the normalized misalignment (in dB), defined as $20 \log_{10} \| \hat{f} - f \|_2 / \| f \|_2$, where $f$ is the true impulse response of the feedback path and $\| \cdot \|_2$ denotes the $l_2$ norm. The effect of the quantization error on the AFC performance is shown in Fig. 5.

Table II shows the resource requirement of the FPGA implementation as reported by the Xilinx ISE Foundation.

<table>
<thead>
<tr>
<th>Available Resources (total)</th>
<th>Used Resources</th>
</tr>
</thead>
<tbody>
<tr>
<td>Slices (6822)</td>
<td>896 (15%)</td>
</tr>
<tr>
<td>FFs (54,576)</td>
<td>3100 (5%)</td>
</tr>
<tr>
<td>4-LUTs (27288)</td>
<td>2895 (10%)</td>
</tr>
<tr>
<td>RAMBSB (323)</td>
<td>3 (1%)</td>
</tr>
<tr>
<td>DSP48A1x (58)</td>
<td>4 (6%)</td>
</tr>
</tbody>
</table>

**ACKNOWLEDGMENT**

This work was supported under the Grant POSDRU/107/1.5/S/76813 and Grant CNCS-UEFISCDI PN-II-ID-PCE-2011-3-0097.

**REFERENCES**