Under Construction
Meeting Recorder: Portable Speech Recognition
Meeting Recorder Notes
- Vector Processing
- A vector processor (such
as IRAM) allows certain
types of regular computations (e.g. multimedia processing) to be
performed very efficiently both in terms of execution time and
energy consumption. Many components of speech recognition are highly
regular, and may be efficiently executed on a vector architecture.
- Spontaneous Speech
- Normal speech between people, as opposed to read speech, prepared
speech, or speech to a computer. Spontaneous speech contains many
disfluencies, including false starts, restarts, repeated words, filled
pauses (uh, um), etc. Speech recognition on spontaneous speech is
much harder than dictation.
- Function Words
- The words in an utterance that do not carry semantic
meaning. Informally, the 200 or so most common English words,
including "a", "the", "an", "of", etc. Function words tend to be
monosyllabic and unstressed in English. Speech recognition systems
typically perform poorly on content words. Information retrieval
systems usually remove all content words, both from the corpus and the
query.
- Content Words
- The words in an utterance that carry the semantic content or
meaning. Informally, all the non-function words. Content words tend to
longer than function words, and also tend to be stressed in
English. As a result, speech recognition systems usually perform
better on content words than on function words.
[ Back ]