User gaze locations are tracked during an artificial reality experience. Audio content that is spatialized to a target location is adjusted such that it is temporarily spatialized to a test location for a period of time and then reverts back to being spatialized at the target location. The spatialized audio content is presented concurrently with display of a virtual object at the target location. Over the period of time the audio content is spatialized to the test location and then reverts back to the target location and the level of realism of the artificial reality experience is evaluated using the tracked gaze locations during the period of time.

