Browser applications provide users with the ability to search within a webpage. Such searches are limited to textual content of the page and do not take into account content depicted as images within the webpage. This disclosure describes techniques to enable search to be performed over the entire webpage, including image content. With user permission, images on a webpage are analyzed to determine presence of text within the image. If text is detected, OCR techniques are applied to recognize characters in the text. When the user enters a search query for searching within the page, the text content of the webpage and text obtained via OCR from images within the webpage are compared with the search query. Matching terms are highlighted to the user, with the image content highlighting based on a bounding box for the matching term.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Sharifi, Matthew, "In-page search over text and images in a web page", Technical Disclosure Commons, (February 05, 2019)