Noise-Robust Speech Recognition Technology

Motorola Clamor™ software provides developers with a highly accurate, noise-robust, small vocabulary speech recognizer for use in a variety of electronic and computing applications. The Clamor software-based recognizer was designed to be specifically insensitive to ambient noise and to work with most languages. This makes it suitable for embedded command-and-control applications in phones, PDAs, stereos, set-top boxes, games, automotive equipment, and countless other products.

Speech recognition is coming of age in today's high-tech world. Mobile professionals need access to their pertinent data at all times, often in situations where having one's hands free is important. Imagine a situation in which you could navigate your voicemail system, moderate your car radio volume, or dial your cellular phone using only your voice. Clamor software enables these and many other scenarios in which mobile professionals will be able to access information on the fly.

Features:

Noise Robust - highly accurate even in noisy environments.
Works well in cars with windows open, airport lobbies or with loud music playing.
Dependable - Press-to-talk helps guarantee reliable recognition
Flexible Vocabulary - Up to 40 words or short phrases (up to two seconds in duration) active at one time.
Unlimited number of switchable dictionaries allows for expanded vocabularies.
Accommodates Multiple Users - More than one user can train the system for accurate recognition of his/her voice.
Only two repetitions needed to train each word or phrase.
Works Across Languages - Language independent system will work with virtually any language.
American and British English, French, German, Farsi, Japanese, Cantonese and Mandarin have been tested.
Embedded Platforms:

Motorola's M*CORE®, 56166 and 56800 DSP series, ARM 610/710, StrongARM™, Intel™ X86, OS-9, Windows™ 98, NT and CE. Written in ANSI C, the Clamor software system is portable to most selected platforms.

Requirements:

Microprocessor capable of 10 MIPS or more is desirable
8kHz or 16kHz sampling rate
Nonvolatile memory is required to store user training data
ARM implementation requires 32K Flash for voice templates and 20K RAM for code and buffering
Response Time- One-second average response time with 13MHz ARM 610 implementation.
0.15 second average response time with Pentium™ 133 MHz Windows 95/98 implementation.
Recognition Engine Size- Hand assembled DSP implementation code size for the recognition engine is 1KB.
Speech templates are approximately 16KB for 10 words or short phrases (or .5KB for an average two second voice template)


Windows® evaluation SDK available.

_________________________
Mark I + Mark IIa | Jeep 97 TJ
my current blog