Recent Consultancy Work ofMan-Wai Mak1. Speech Recognition Software Modules for Multimedia Language Learning Tools This project develops a speech recognition engine that is designed to be integrated into speech recognition and language learning systems. The engine consists of two classes: CWaveAudio and CSphRec. The first class is for collecting audio data from audio devices and the second class is for recognizing the audio data. The engine is based on the continuous density hidden Markov models, which is currently the most effective models for speech recognition. The consultancy service also provides examples of how to use the engine for isolated-word speech recognition. A demonstration
of speech recognition using the software modules
2. Speech Codec for Multimedia Applications This project is to assist a company to develop a CELP-based speech decoding chip. The flow of the algorithms, its bit allocation patterns and software implementation are broken down into manageable pieces. Software-based encoders/decoders are also provided to enable the company to evaluate and compare CELP codecs and LPC-10 codecs.
3. CELP-Based Encoders for Consumer Electronics Products This project is to assist a company to develop a CELP-based speech encoding chip. The flow of the algorithms, its bit allocation patterns and software implementation are broken down into manageable pieces. Software-based encoders/decoders are also provided.
4. Measurement of Xylophones’ Frequency Spectra
Last update: 7 April 2003M.W. Mak's Homepagehttp://www.eie.polyu.edu.hk/~mwmak/mypage.htm |