Abstract

A method for resolving misspelled and phonetically variant product titles in ecommerce by applying phonetic encoding as a pre-normalization step before downstream product identity resolution. The system converts product title tokens into phonetic representations using Double Metaphone encoding, matches phonetic codes against a known-good product vocabulary, and substitutes corrected tokens before the title enters any downstream normalization or matching pipeline. This enables cross-store product resolution to handle common misspellings such as brand name variants without requiring retraining or vocabulary expansion in the downstream system.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS