Inlämning av Examensarbete / Submission of Thesis

Mahboob ur Rahman MEE10:18, pp. 63. ING/School of Engineering, 2010.

The work

Författare / Author: Mahboob ur Rahman
mehboob17@gmail.com
Titel / Title: SPEECH RECOGNITION FOR WEB BASED TELEPHONY
Abstrakt Abstract:

Web based telephony purges the need of explicit downloading and installing a VoIP client software. Calls in web based telephony can be made directly from the browser. The combination of web technologies and traditional telephony makes it possible to introduce new exciting services. One such new service is introduced as a result of this thesis work. The voicemails received are automatically transcribed and converted into text; the text is then saved to an inbox. The performance of the introduced service is good and gives a better recognition rate in the current configuration. The speech recognition covers a continuous speech of English and a maximum vocabulary of 64 thousand words. Adobe Flash 10 has a proprietary protocol for the streaming of audio over internet. Red5 server is an open source server that has support for RTMP plug in. Red5Phone is an open source SIP phone containing a flash based client. The new service introduced is added to the existing Red5Phone solution. Speech recognition for web based telephony was investigated, developed, implemented, and tested.
Sphinx-4 is an open source state-of-the art ASR system. It is capable of keeping up with the requirement of large vocabulary transcription. Sphinx-4 was configured and integrated with the developed service for the transcription of voicemails. The performance of Sphinx-4 was rigorously evaluated before its configuration.

Ämnesord / Subject: Signalbehandling - Signal Processing
Telekommunikation - Telecommunications
Nyckelord / Keywords: Speech Recognition, VoIP, Web Telephony, RTMP, SIP, Red5Phone, Sphinx-4, Voicemail

Publication info

Dokument id / Document id:
Program:/ Programme Magisterprogram i Elektroteknik / Master of Science in Electrical Engineering
Registreringsdatum / Date of registration: 04/20/2010
Uppsatstyp / Type of thesis: Masterarbete/Master's Thesis (120 credits)

Context

Handledare / Supervisor: Jörgen Nordberg
jorgen.nordberg@bth.se
Examinator / Examiner: Jörgen Nordberg
Organisation / Organisation: Blekinge Institute of Technology
Institution / School: ING/School of Engineering

+46 455 38 50 00
I samarbete med / In co-operation with: Audio Processing and Media Transport, Multimedia Technologies, Ericsson Research Luleå, Ericsson AB

Files & Access

Bifogad uppsats fil(er) / Files attached: mahboob_final_thesis_updated-1.pdf (935 kB, öppnas i nytt fönster)