Hi,
I purchased the LD3320 MP3 player and Chinese speech recognizer module made by icroute (www.icroute.com/) from eBay a few months ago. Its Chinese voice recognition mechanism can be initialized with the Pinyin transliterations of the Chinese text to be recognized, and the module will attempt to listen to the audio sent to its input channel to identify any voice that resembles the programmed list of Chinese words sent during initialization. MP3 playback is supported as well - with headphone/lineout (stereo) and speaker (mono) outputs.
Although the PCB that I purchased seems to have some serious issues (wrong pinout labeling and ECM microphone connected in reverse polarity), I finally got it to work. I decided to make a demo video showing the module working with a PIC microcontroller and an ST7735 128x160 16-bit color LCD to display the speech recognition results. It shows the results of the module trying to recognize proper names in Chinese (bei jing [北京], shang hai [上海], hong kong [香港], chong qing [重庆], tian an men [天安门) and other words such as "a li ba ba". A single beep means that the speech is recognized while a double beep indicates unrecognized speech. Although the speech recognition quality highly depends on the input audio, volume level and other environmental conditions, overall the detection sensitivity and selectivity seems satisfactory as can be seen from the video.
The end of the video shows the stereo playback of an MP3 song stored on the SD card - using a PAM8403 amplifier whose output is fed into two 8-ohm loudspeakers. The playback quality seems good and comparable to the VS1053 module.
Let me know your thoughts. I have yet to see a similar IC for English speech recognition.
Attached to this thread is the board (front and back).
I purchased the LD3320 MP3 player and Chinese speech recognizer module made by icroute (www.icroute.com/) from eBay a few months ago. Its Chinese voice recognition mechanism can be initialized with the Pinyin transliterations of the Chinese text to be recognized, and the module will attempt to listen to the audio sent to its input channel to identify any voice that resembles the programmed list of Chinese words sent during initialization. MP3 playback is supported as well - with headphone/lineout (stereo) and speaker (mono) outputs.
Although the PCB that I purchased seems to have some serious issues (wrong pinout labeling and ECM microphone connected in reverse polarity), I finally got it to work. I decided to make a demo video showing the module working with a PIC microcontroller and an ST7735 128x160 16-bit color LCD to display the speech recognition results. It shows the results of the module trying to recognize proper names in Chinese (bei jing [北京], shang hai [上海], hong kong [香港], chong qing [重庆], tian an men [天安门) and other words such as "a li ba ba". A single beep means that the speech is recognized while a double beep indicates unrecognized speech. Although the speech recognition quality highly depends on the input audio, volume level and other environmental conditions, overall the detection sensitivity and selectivity seems satisfactory as can be seen from the video.
The end of the video shows the stereo playback of an MP3 song stored on the SD card - using a PAM8403 amplifier whose output is fed into two 8-ohm loudspeakers. The playback quality seems good and comparable to the VS1053 module.
Let me know your thoughts. I have yet to see a similar IC for English speech recognition.
Attached to this thread is the board (front and back).
Attachments
Last edited: