Abstract:
This paper describes three prototypes of computer-based clinical record-keeping tools
that use a combination of window-based graphics and continuous speech in their user
interfaces. Although many of today’s commercial speech-recognition products achieve
high rates of accuracy for large grammars (vocabularies of words or collections of sentences and phrases), they can only “listen
for” (and therefore recognize) a limited number of words or phrases at a time. When
a speech application requires a grammar whose size exceeds a speech-recognition product’s
limits, the application designer must partition the large grammar into several smaller
ones and develop control mechanisms that permit users to select the grammar that contains the words or phrases they wish
to utter. Furthermore, the user interfaces they design must provide feedback mechanisms that show users the scope of the selected grammars. The three prototypes described
were designed to explore the use of window-based graphics as control and feedback
mechanisms for continuous-speech recognition in medical applications. Our experiments
indicate that window-based graphics can be effectively used to provide control and
feedback for certain classes of speech applications, but they suggest that the techniques
we describe will not suffice for applications whose grammars are very complex.
Key-Words
Continuous-Speech Recognition - Multimodal Human-Computer Interfaces - Sublanguages
- Oncology