Inlämning av Examensarbete / Submission of Thesis

Hemanth Yerramsetty , pp. 64. ING/School of Engineering, 2012.

The work

Författare / Author: Hemanth Yerramsetty
y.hemanth@hotmail.com
Titel / Title: Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank
Översatt titel / Translated title: Microphone Array Wiener Beamformer and Speaker Localization With emphasis on WOLA Filter Bank
Abstrakt Abstract:

This thesis describes the design and implementation of a speech enhancement system that uses 4-channel microphone array beam forming and speech enhancement algorithms applied to a speech signal in a multiple source environment. To locate the accurate Direction of Arrival (DOA) from the source, it is necessary to design a suitable microphone array system with more efficient localization algorithm. The goal of the system is to improve the quality of the primary speech signal.
A filter bank is a signal processing tool that can facilitate manipulation of signals in the frequency domain. The WOLA (Weighted Overlap and Add) filter is an efficient method used to implement a uniformly distributed multi-channel filter bank. The WOLA is generally used in applications that demand high quality filters in term of stop band rejection and filter shape.
Beamformers work by means of steering an array of microphones towards a desired look direction through utilizing signal information rather than physically moving the array. In this research, Wiener beam former is examined the input signals are first split into frequency bands so that Wiener beam forming techniques can be used.
There are many algorithms developed for estimating the number of sources and locating the DOA, such as Bayesian algorithm, kalman filtering, Generalized Cross Correlation (GCC) and Steered Response Power (SRP) algorithm. But SRP algorithm with its steered beam forming technique for speaker localization is more robust using microphone array. The Phase Alignment Transform (PHAT) has gained a lot of attention in the recent research for its quite robust response in low noise, but reverberant environment. So combining SRP-PHAT will become the robust localizer in reverberant environment.
Experiments were done on recorded data of human talkers. The algorithm gives accurate DOA from the dominant speaker. In addition to these, listener opinion testing is performed.

Ämnesord / Subject: Signalbehandling - Signal Processing

Nyckelord / Keywords: RIR, Bemaforming, filterbank, srp-phat

Publication info

Dokument id / Document id: houn-8uqna7
Program:/ Programme Magisterprogram i Elektroteknik / Master of Science in Electrical Engineering
Registreringsdatum / Date of registration: 05/28/2012
Uppsatstyp / Type of thesis: Masterarbete/Master's Thesis (120 credits)

Context

Handledare / Supervisor: Dr. Nedelko Grbic
ngr@bth.se
Examinator / Examiner: Dr. Benny Sallberg
Organisation / Organisation: Blekinge Institute of Technology
Institution / School: ING/School of Engineering

+46 455 38 50 00

Files & Access

Bifogad uppsats fil(er) / Files attached: bth2012yerramsetty.pdf (799 kB, öppnas i nytt fönster)