Abstract
Systems and methods are described for transforming unstructured cooking-related multimedia into actionable outputs. A client provides a video link, web link, screenshot, or photo. One or more models extract ingredients and preparation actions using combinations of speech recognition, optical character recognition, and visual recognition. A structured recipe is generated with ingredients and ordered steps, and additional attributes such as serving size, nutrition or macronutrients, and dietary tags are derived. A meal plan may be produced over a time window, and required ingredients are aggregated. Optionally, the user captures images of pantry and/or refrigerator contents via a guided flow. Inventory items are detected from the images, optionally with quantity estimates. A differential shopping list is generated by comparing required ingredients to detected inventory, and may be mapped to retailer products for cart creation and checkout.
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Anonymous, "Systems and Methods for Automated Recipe Extraction from Multimedia Content with Visual Pantry Inventory Detection and Differential Shopping List Generation", Technical Disclosure Commons, ()
https://www.tdcommons.org/dpubs_series/10759