Demo Data for Everyone

As someone learning about DevOps, I follow a number of people, one of whom is Gene Kim. When I see him get excited about a post, I usually read it. That's how I found this post on Demo Data as Code. It's a short, but interesting read. I think this is actually something more people ought to implement in their environments and not just for demos.

DevOps is about reliability and repeatability, among other things, but those two are tackled with automation for a known process. We don't want simple, silly mistakes, or even complex errors that might undermine our ability to move forward and create value. We don't want simple errors eating up resources and time from expensive talent with unnecessary work. Part of ensuring both repeatability and reliability involves using data in our databases to evaluate our application. This isn't necessarily for demos, though it could be used for demos.

Once of the areas that is often left out of the process is the data that we use in our building our systems. We need some data for developers, for QA, and often for demos. In all of those cases, when humans need to repeatedly look at how well the software performs, and want to re-test things, they need some consistent data. I'd also argue that the need for agility means that we need a manageable data set. I think SQL Provision from Redgate is amazing, but I still don't want to always develop with 2TB of masked data. I certainly don't want to demo this for customers from a laptop, and might not want to share this in the cloud.

At Redgate, we sell masking with SQL Provision, and it supports most of the process that's outlined in the Demo Data as Code article. What it needs, however, is a small set of data that can be masked in a deterministic fashion. What I recommend to most clients is that they build a known set of test data, which could be used for demos. This can include all your edge cases and show off new features. It's helpful for developers, testers, and salespeople, who will always have a known, useful set of data.

This can't be a build it and forget it, much like what is emphasized in the article. This will need to be altered over time. There ought to be a process to build this dataset, likely from production data that gets sanitized. This can then be distributed through SQL Provision (or similar technology), with backups, or even as a set of scripts in your VCS. Ensure an environment can be hydrated instantly on any platform, from a developer workstation to a sales laptop to a QA server. Once you have this, everyone can work on evaluating your software from a known baseline.

And if you find the need for more data, then just add it. You have a process, so add an additional step that will cover the holes you inevitably find.

Agree or Lose Features

by Steve Jones

SQLServerCentral

Forcing users to agree to a new EULA or removing features seems like a bad idea to Steve.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

5 (2)

You rated this post out of 5. Change rating

2021-05-26

150 reads

Discuss

Finding Legal Data

by Steve Jones

SQLServerCentral

Using data scraped from the web might be convenient, but is it legal. Perhaps more importantly, is it moral? Steve has a few thoughts.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2021-04-19

156 reads

Discuss

The Road to Better Data Handling

by Steve Jones

SQLServerCentral

Data Privacy and Protection

As data becomes more valuable and regulations require safer processing, it is important we become more careful in our daily work.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-06-27

221 reads

Discuss

Building or Buying Analytics

by Steve Jones

SQLServerCentral

One of the decisions that I've been involved with at the beginning of every software project is whether to buy software to solve the problem or build our own. This might be a quick "is there software anyone knows about to do this?" query, or an in-depth review of the marketplace or something in between. […]

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-06-08

210 reads

Discuss

The Change Failure Rate

by Steve Jones

SQLServerCentral

The number of times that you have a failure when deploying changes is a good metric to watch.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

5 (1)

You rated this post out of 5. Change rating

2019-06-03

1,168 reads

Discuss

Demo Data for Everyone

Rate

Share

Categories

Share

Rate

Demo Data for Everyone

Rate

Share

Categories

Share

Rate

Related content

Agree or Lose Features

Finding Legal Data

The Road to Better Data Handling

Building or Buying Analytics

The Change Failure Rate