The player already has a mic input for just such a purpose, there's just no software to use it.

Recognizing even just a short list of commands like that is relatively complicated, especially in a non-ideal environment... accents, noisy road, music playing, maybe you drive a topless car, knowing when you're talking to the Empeg and not to your wife about pausing for a moment to think about your daughter's play next week, etc.