Design and Implementation of a SoPC System for Speech Recognition

This paper presents the design of a System on Programmable Chip (SoPC) based on Field Programmable Gate Array (FPGA) for speech recognition in which Mel-Frequency Cepstral Coefficients (MFCC) for speech feature extraction and Vector Quantization (VQ) for

PDF / 1,746,766 Bytes
7 Pages / 439.37 x 666.142 pts Page_size
2 Downloads / 257 Views

DOWNLOAD

REPORT

Abstract This paper presents the design of a System on Programmable Chip (SoPC) based on Field Programmable Gate Array (FPGA) for speech recognition in which Mel-Frequency Cepstral Coefficients (MFCC) for speech feature extraction and Vector Quantization (VQ) for recognition are used. The execution speed of the blocks in the speech recognition system is surveyed by calculating the number of clock cycles while executing each block. Keywords Speech recognition

MFCC VQ SoPC FPGA Nios

1 Introduction Speech recognition system is applied in many application fields such as health care, military, human computer interaction, avionics technicians… [1], especially, the applications which support disabled people to communicate with the world in a better way. For that reason, there are many studies on software/hardware implementation of speech recognition systems for many years. However, because of a large number of accents spoken around the world, there are still many challenges that need further research and development, for example, Vietnamese speech recognition. T. Van Hoang N. L. T. Truong H. Trang (&) University of Technology, Vietnam National University, HoChiMinh City, Vietnam e-mail: [email protected] T. Van Hoang e-mail: [email protected] N. L. T. Truong e-mail: [email protected] X.-T. Tran VNU University of Engineering and Technology, 144 Xuan Thuy, Hanoi, Vietnam e-mail: [email protected]

J. J. (Jong Hyuk) Park et al. (eds.), Multimedia and Ubiquitous Engineering, Lecture Notes in Electrical Engineering 240, DOI: 10.1007/978-94-007-6738-6_147, Springer Science+Business Media Dordrecht(Outside the USA) 2013

1197

1198

T. Van Hoang et al.

The research on speech recognition going mainly in two directions, namely: the software runs on Personal Computers (PCs) and embedded systems. For the first direction, many studies and software tools have been developed successfully. In particular, the Hidden Markov Model Toolkit (HTK) is a toolkit for building Hidden Markov Models (HMMs) used in speech recognition successfully [2]. There are also many tools running on the PC or smart phone aimed at the control device via speech. For the second direction, embedded systems have many advantages as high performance, convenience, low cost, and great development potential. However, speech recognition research based on embedded systems is more difficult. This paper will present the implementation of a speech recognition system as an embedded system using FPGA technology. In fact, the implementation of speech recognition systems has been done using FPGA technology in recent years. In paper [3], speech recognition systems are implemented as hardware/software co-design systems using Hidden Markov Model (HMM). This project use Linear Predictive Coding (LPC) method in feature extraction block. So, the recognition accuracy is not high compared with the MFCC method. In paper [4], the MFCC method is applied, but the optimization was not taken into account yet to increase performance. Another work, presented in [5

Data Loading...

Design and Implementation of a SoPC System for Speech Recognition

Recommend Documents

Isolated Word Automatic Speech Recognition System

Automatic speech recognition: a survey

Design and Implementation of Autonomous Navigation System for Mobile Harbors

Sparse Representations for Speech Recognition

Pattern Recognition for Speech Detection

Speech Recognition

Deep Learning for NLP and Speech Recognition

A Novel Idea for Designing a Speech Recognition System Using Computer Vision Object Detection Techniques

Adversarial Examples Attack and Countermeasure for Speech Recognition System: A Survey

Noise effect on Amazigh digits in speech recognition system

Design and implementation of a measurement system for automatically measurement of electrical parameters of thermoelectr

Automatic Speech Recognition of Galo