A system to monitor hands, while a person moves around in his surroundings, and to protect the hands from contamination or harm is disclosed. The person is immersed in a mixed reality (MR) environment and wearing a head mounted display (HMD) device. The HMD device includes an imaging device and a scene categorization unit. A video captured by the imaging device is being fed continuously to the scene categorization unit. The scene categorization unit classifies a scene of the surroundings into a scene category. The scene category can be a sensitive location (for example, a restroom, a medical room, an industrial scene, etc.) or a safe location. If the scene is classified as the sensitive location, an object recognition algorithm is utilized to identify objects and surfaces in each frame of the video. A contamination detection unit then performs a template matching of each of the surfaces and the objects identified in each frame with each of a first library of images and a second library of images. The first library of images contains images of sample objects and sample surfaces which are prone to contamination by germs. The second library of images contains images of sample objects, such as tools, materials, equipment, etc., which are harmful/dangerous for the person. Based on results of the template matching, a contamination level or a danger level is displayed as a first virtual overlay on the objects and the surfaces identified in each frame of the video. The contamination level and the danger level are determined based on an analysis of the surfaces and the objects. Thereafter, if the person approaches further to the objects or the surfaces unprotected, a second virtual overlay is displayed that warns the person from approaching towards such objects and surfaces. Further, the system carries out this entire process of monitoring (described above) in a continuous manner.

