Abstract

 

The present disclosure relates to a method and a system for generating canonical features for normalized entity keys in distributed transaction aggregation. The present disclosure suggests receiving transaction data comprising a plurality of transaction records, each transaction record associated with an entity identifier derived from one or more entity attributes. After receiving the data, the present disclosure suggests aggregating the transaction records using a normalized merchant entity key, which consolidates multiple entity identifiers corresponding to a same merchant entity. Thereafter, the present disclosure suggests generating an intermediate data structure for each normalized merchant entity key and for each selected attribute column. Upon generating the intermediate data structure, the present disclosure suggests storing the intermediate data structure in an intermediate provenance table. Subsequently, the present disclosure suggests applying a selection function to the intermediate provenance table to deterministically select a canonical attribute value. Further, the present disclosure suggests generating a final canonical data structure comprising one record per normalized merchant entity key. As a result, the present disclosure reduces data redundancy and computational overhead while providing a consistent, scalable, and reliable framework for normalized entity-level transaction aggregation across distributed systems.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS