Abstract
Various embodiments of the present invention provide a method and system to automate data parity and retention checks between data centers with parallel data load pipelines. The method comprises, receiving one or more data entities; identifying one or more key metrics to be used; identifies one or more time dimension attribute within which the validation needs to be performed; creating one or more text files based on one or more inputs received; identification of key metrics and time dimension attribute; generates dynamically a Structured Query Language (SQL) for each of the rows from the text files as well as executes the generated scripts separately to capture the values for the corresponding datacenters; and compares the text files based on one or more metric values to get its difference and percentage of difference and updates the captured results.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Ramesh, Rohini Ms, "DATA PARITY & RETENTION CHECKS IN DIFFERENT DATA CENTERS FOR FINDING DATA LEVEL MISMATCH", Technical Disclosure Commons, (November 17, 2022)
https://www.tdcommons.org/dpubs_series/5505