De-identify sensitive information from ETL Job design and diagnostic information
Publication Date: 2014-Jul-25
The IP.com Prior Art Database
The Extract-Transform-Load job design and diagnostic information such as run time logs may contain some of the sensitive information like name of the developer who created the job, IP addresses of source and target database machines, user name and passwords (in encrypted form), OS user who created the job and machine host name on which the job was created etc. Following is an example to collect the sensitive information from the ETL Job design and diagnostic information and de-identify it. To maintain consistency and understandability of the job design and related run time logs, de-identify all occurrences of the sensitive info with the same value at all places respectively. Also, the identified sensitive information is persisted in a dynamic data dictionary for organization wide reuse.