A solution is provided that correctly positions a sound source for a headset user in a video call, by first finding the orientation of the head using short ultrasonic bursts and finding the difference in time-of-flight using one microphone at each ear. This head orientation, combined with the location of the participants on the screen, is then used in the binaural processing to render the sound source position correctly.

