Disclosed herein is a system for improving audio quality in video conferencing or teleconferencing involving a small digital figure or a robot associated with a microphone which is remote-controlled by participants. The robot automatically detects poor audio quality that interferes with intelligibility. Using automatic speech recognition, the robot processes and interprets incoming speech at the microphone to establish pre-determined acceptable levels of accuracy in audibility. The system enables participants to signal the robot when experiencing difficulty in understanding the discussion during the course of the meeting. The robot then demonstrates the level of difficulty through gestures in a number of ways that could be entertaining or direct. This system enables the speaker to be aware of his or her clarity levels without disturbing the flow of the presentation and increases the effectiveness of video conferencing.

