Demo Data for Everyone

  • Steve Jones - SSC Editor

    SSC Guru

    Points: 715053

    Comments posted to this topic are about the item Demo Data for Everyone

  • David.Poole

    SSC Guru

    Points: 75108

    There are an increasing number of articles demonstrating that obfuscated data isn't as obfuscated as it might first appear.  The Netflix example being just one.

    Tools like Redgate SqlGenerator can be used to generate sufficient quantities of realistic, but not real, data.  I had a lot of fun with the tool and wrote it up in https://www.red-gate.com/hub/product-learning/sql-data-generator/how-to-generate-various-forms-of-realistic-data-for-testing-development-and-prototypes

    I do agree that generating realistic data isn't easy once you step beyond "Here is a table.  Fill it". Getting the proportions and relationships right becomes ever important as the sophistication of testing and demonstrations increases.  As the article points out, it isn't the job of an afternoon!  Setting it up correctly can be a project in its own right. The setup needs to be treated as a product and nutured as such.

    Putting large test datasets into some form of shared binary repository is a good strategy.  The need to refresh such data is likely to be infrequent in most cases

     

  • Rod at work

    SSC-Dedicated

    Points: 33107

    Very interesting article. I never thought of automating the production of sample or test data. We never made sample/test data at my old job. In my current job, the creation of sample/test/demo data is done by my boss. I think he uses TOAD for that. Still though, I don't think the process he uses necessarily generates any edge cases. It's just random data. I appreciate your pointing out this possibility, Steve. As always, we could improve our process.

    Thank you!

    Kindest Regards, Rod Connect with me on LinkedIn.

Viewing 3 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply