ICSI Speech FAQ:
2.2 What are the basic approaches to speech recognition?

Answer by: dpwe - 2000-07-22


Essentially all speech recognition systems use the same basic three-stage architecture:

These stages are illustrated in the following overview block diagram:

Of course, particular systems may blur the line between these stages, for instance by involving the subword likelihood estimation as a part of the search for well-matched word sequences.

Systems can vary at any of these stages.

At ICSI we have historically used neural nets as our acoustic models - the so-called 'hybrid connectionist' approach pioneered by Morgan and Bourlard - rather than the more common Gaussian mixture models. For a discussion of why, see the next FAQ answer.


Previous: 2.1 What is speech recognition? - Next: 2.3 Why do we use connectionist rather than GMM?
Back to ICSI Speech FAQ index

Generated by build-faq-index on Tue Mar 24 16:18:13 PDT 2009