17th LSI Design Contests・in Okinawa  Design Specification - 1

1. Noise Cancelling Algorithm

To remove noise, there are some methods are suggested such as Microphone Array Method which is used several microphones and method using adaptive filtering to presume unknown pathway. In this simulation, the Noise Cancelling System by using Specter Subtraction Method (SS Method) which removes the noise in frequency domain is targeted.

The difference from last year is that we introduce other efficient techniques described in Sec. 1-2.

1-1. Specter Subtraction Method (SS Method)

In this section, the Noise Cancelling Algorithm is explained. The algorithm removes the noise in frequency domain. A diagram of the Noise Cancelling Algorithm is shown in Figure 1.

Figure 1e

Figure 1

First, the observed signal is assumed to be given by the sum of speech and noise. The observed signal, the speech and the noise at time n are defined as x(n), s(n) and d(n) respectively. Then, the observed signal x(n) is expressed by

Equation 1

when the speech and the noise are uncorrelated.

Next, to remove noise in frequency domain, the Discrete Fourier Transform (DFT) applies the observed signal x(n). After that, the observed signal specter X(l,k) from top to k in l-frame is expressed by

Equation 2

when S(l,k) and D(l,k) denote the speech specter and the noise specter respectively. The estimate value of speech is expressed by the observed signal specter X(l,k) as.

Equation 3

Here, G(l,k) is a specter gain, and the estimate value specter is result of the observed signal specter X(l,k) multiplied by suitable specter gain G(l,k). The ideal estimate value is expressed by

Equation 4

By substituting eq.(4) into eq.(3), the ideal specter gain G_opt (l,k) given as.

Equation 5

Then, the speech specter X(l,k) is extracted perfectly. However, the noise specter D(l,k) can’t be obtained by only the speech specter X(l,k)’s information. Therefore, for SS Method, the noise specter estimate value can be obtained by using the ‘L’ number of frames in non-speech interval. is defined as

Equation 6

The simulation of the SS method was performed by Scilab. The result is shown in Figure 2. We can download the program (here).

Figure 2a

(a)Input Signal

Figure 2b

(a)Output Signal

Figure 2

1-2. MAP method using Variable Speech Distribution

This is a new algorithm that we introduce this year. The file related to the algorithm was put in the DL_file_ver3 at here.

1-2-1. MAP Estimate Method

Object

Equation 7
Equation 8
Figure 1

Figure 3

・PDF to S and D

Equation 9
Equation 10

Suppose X=const.+D and no relation between D and S, then

Equation 11

Equation 12

Define ε in order to maximize S as

Equation 13

S can be rewritten as

Equation 14

Now we want to get following

Equation 15

Equation 16
Equation 17

From (16) and (17), we get

Equation 18

Therefore we get the spectrum gain

Equation 19

When we difine

Equation 20

・Decision-Directed Mehod

Equation 21

We often choose β=0.98 and γ(l) - 1 should be positive so that

Equation 22

・MAP Method

Most of speech signal follows Rayleigh distribution rather than Gaussian one

Equation 23
Figure 2

Figure 4

Suppose real and imaginary part of noise are uncorrelated each other with half of variance,

Equation 24

As same manner from (13) to (19), we get the spectrum gain for MAP estimate.

Equation 25

1-2-2. Speech spectrum distribution proposed by T.Lotter and P.Vary

PDF (Probability Density Function) of the speech spectrum that has been proposed by T.Lotter and P.Vary is one of the useful. According to them, the phase spectrum can be expressed approximately by an uniform distribution, the amplitude spectrum is also expressed approximately by the equation (7).

Equation 26

Γ(・) is the gamma function. ν and μ, is a parameter that determines the shape of the distribution. The figure 3 shows the PDF given by equation (26).

Figure 3

Figure 5

Figure 4

Figure 6

We get the spectrum gain for Lotter/Vary method

Equation 27

where

Equation 28

MAP estimated value of the phase spectrum and the phase spectrum of the observed signal.

1-2-3. Variable Speech Distribution

We show how to change voice PDF in Figure 5.

Figure 5

Figure 7

It is called Variable Speech PDF method that we adaptively change the shape of the speech spectrum distribution in the non-speech section and the speech section.

Equation (7) can be approximated to a Rayleigh distribution from exponential distribution by the parameter ν. We show the distribution curve which equation (7) gives on the case of each ν in Figure 6, and we fixed μ to 3.2.

Figure 6

Figure 8

When ν=0.0 Equation (7) match an exponential distribution, and when ν=2.0 (7) is approximated to a Rayleigh distribution. It means that we can approximate to the change of actual speechPDF by changing the value of ν.

The spectral gain based on the variable speech distribution is obtained by allowing variation in the spectral gain parameter of Lotter/Vary's.

Equation 29

The algorithm for determining ν(l,k) is given by

Equation 30
Equation 31

N is the number of FFT spectrum, and α is a parameter for adjusting the size of ν~(l,k).


The simulation of the MAP method was performed by Scilab. The result is shown in Figure 7. You can download the program here.

Figure 7

Figure 9

References

[1] T. Lotter, P. Vary ; Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model, EURASIP Journal on Applied Signal Processing, 2005.

[2] A. Kawamura, M. Kurosaki ; 大容量化するマルチメディア・データを転送・保存・活用するために ディジタル音声&画像の圧縮/伸張/加工技術, in H. Ochi, CQ Publishing Co., 2013.