I had an interesting thought cross my mind today.
Currently we are in the process of building up and designing our first warehouse. This is also the first warehouse project that I have been involved in.
In a few of our source systems, data is both deleted, in such that we pretend it never existed in the first place and data is purged, which I am going to define as still relevant data, just removed from the system to create space (our purging routines includes creating text reports of the data that is purged, which is then stored in our reports archive).
In an ideal world, data that is "deleted" should be tombstoned - I.E., marked as no longer relevant and if needed, removed further down the track, but as I am finding with a lot of our legacy systems, this is rarely the case.
Based on what I have written above, is it common to classify data that is "deleted" differently to data that is "purged" in a warehouse and/or ODS?