Voice as Sound: Using Non-verbal
Voice Input for Interactive Control

with John F. Hughes

Demostration Movie [AVI 41MB] [Quicktime 8MB]
Demonstration software [VoiceGames] (Japanese only)

We describe the use of non-verbal features in voice for direct control of interactive applications. Traditional speech recognition interfaces are based on an indirect, conversational model. First the user gives a direction and then the system performs certain operation. Our goal is to achieve more direct, immediate interaction like using a button or joystick by using lower-level features of voice such as pitch and volume. We are developing several prototype interaction techniques based on this idea, such as "control by continuous voice", "rate-based parameter control by pitch," and "discrete parameter control by tonguing." We have implemented several prototype systems, and they suggest that voice-as-sound techniques can enhance traditional voice recognition approach.


[Home] [Projects]

Copyright (c) 2001 Takeo Igarashi