Certain features such as browser tab previews, low-memory frame representations, etc., use low-fidelity snapshots of webpages. Low-fidelity snapshots can be created by rasterizing the webpage or by converting it to PDF format. Screenshots can use substantial memory, lose the hyperlinks on a page, and do not provide the ability to scroll and zoom without pixelation artifacts. Typical print-to-PDF applications do not support scrolling screenshots and sub-frames.
This disclosure describes techniques to record content of the frames of a webpage, including multi-origin iframes, such that each frame (or sub-frame) of a webpage can be scrolled independently. For security, hyperlink information, e.g., the hit regions and the URLs, are stored separately from webpage recordings, and frame recording and rasterization are carried out using sandboxed processes. The techniques produce a low-memory representation of a webpage with minimal pixelation effects, with full text fidelity regardless of zoom level, and active hyperlinks and scrollable sub-frames.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Vollick, Ian; Kitagawa, Calder; Kroeger, Robert; Friedman, Yaron; Birch, Sam; and Ozuysal, Yusuf, "Lightweight Representation of Multi-origin Web Content", Technical Disclosure Commons, (March 19, 2020)