Abstract

This disclosure aims to allow the user the possibility to extract the template of a document using only a picture of it. Our method can be described as follows: 1. The document image is identified in the scene (when mobile scanning, for instance); 2. The document image is cropped from the scene; 3. The document image is segmented into its different regions: e.g., title, image, graphic, content text; 4. For each region, an algorithm (e.g., a ML model) will be used to extract the region template features and an OCR engine will extract the text from the region; 5. The template features will be processed by another algorithm capable of a. matching those “unknown” features to known ones, or b. return a template code format (e.g., Latex format); 6. The template regions and extracted texts are joined in a customizable software (e.g., Microsoft Word or TeXstudio), so the user can modify it.

Creative Commons License

This work is licensed under a Creative Commons Attribution-Share Alike 4.0 License.

Recommended Citation

INC, HP, "A WORKFLOW FOR DOCUMENT TEMPLATE EXTRACTION USING MACHINE LEARNING", Technical Disclosure Commons, (September 01, 2022)
https://www.tdcommons.org/dpubs_series/5351

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

A WORKFLOW FOR DOCUMENT TEMPLATE EXTRACTION USING MACHINE LEARNING

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

A WORKFLOW FOR DOCUMENT TEMPLATE EXTRACTION USING MACHINE LEARNING

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information