
Tell your friends about this item:
Speech Recognition by Man and Machine: Influence of Speaking Rate, Style, and Effort on the Recognition Performance of Human Listeners and Automatic Classifiers
Bernd T. Meyer
Speech Recognition by Man and Machine: Influence of Speaking Rate, Style, and Effort on the Recognition Performance of Human Listeners and Automatic Classifiers
Bernd T. Meyer
While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from the distinct errors between these two. Based on the differences, the signal processing mechanisms are analyzed that are suitable to increase the robustness of ASR. The comparison focuses on the influence of intrinsic variation of speech, i.e., changes in speaking rate, effort and style, as well as dialect and accent. The outcome of the experiments suggests that the processing of temporal cues in ASR bears room for improvement. Therefore, spectro-temporal features are employed as input to ASR systems, which results in an increase of recognition performance for varying speaking effort and speaking style compared to standard features. This documents the usefulness of spectro-temporal and temporal information for automatic recognizers.
Media | Books Paperback Book (Book with soft cover and glued back) |
Released | November 3, 2010 |
ISBN13 | 9783838121550 |
Publishers | Suedwestdeutscher Verlag fuer Hochschuls |
Pages | 140 |
Dimensions | 226 × 8 × 150 mm · 213 g |
Language | English |
See all of Bernd T. Meyer ( e.g. Paperback Book )