SQLServerCentral - www.sqlservercentral.com

A community of more than 1,600,000 database professionals and growing

Featured Contents

The Voice of the DBA

Anonymisation Confusion

The GDPR starts getting enforced in a few weeks. It's been law for a couple years, but the authorities have given companies time to comply. I know various entities are frantically working towards compliance as I keep getting updates to Terms of Service. My company is among them, and we are diligently ensuring we can prove that we aren't violating any rules. That's good because I'm sure fines will reduce any bonus we might earn this year.

As I've been reading over the law and talking with customers, I've learned quite a bit. Redgate Software builds products to help with compliance and we're updating guidance and information on how to work with data. As I've helped to update information and explain concepts I keep running into the term "pseudonymisation". If you listen to the podcast, you'll probably hear me struggle to pronounce it, but more importantly, I was initially confused about what this actually meant.

The Data Protection site from Ireland has a great description of how this differs from anonymisation. You can read through the document, but anonymisation means that the data can't be some how reverse engineered to find the original data. In terms of privacy, an anonymised set of my data wouldn't allow anyone to determine the data is about me.

If the data is pseudonumised, data about me would be replaces with a token, but there might be other means of discovering a data set is about me. As an example, in an eCommerce system, you might have an order key in a dev data set that's copied directly from production. However, my name would be replaced with something else, like Bob Smith. It's not apparent that it's my data, but the protection is limited. If the data were anonymised, the order key would be replaced as well to prevent reverse engineering.

Many of us have gotten used to being lax with dev and test data, often just restoring from production. It's handy, convenient, and allows you to find problems in production or verify changes using known values. The downside of this is that we have poor data security. There are no shortage of data breaches from dev and test systems. Certainly plenty of data has been lost from developer laptops as well. Even if you had your laptop encrypted, there's no real excuse for using real data in less secure environments. 

We need to learn to use anonymised data, and become comfortable with the idea of working on secure data sets. That also means we need the skills to ensure we build good, useful, valid datasets with production-like characteristics.

Steve Jones from SQLServerCentral.com

Join the debate, and respond to today's editorial on the forums


The Voice of the DBA Podcast

Listen to the MP3 Audio ( 4.2MB) podcast or subscribe to the feed at iTunes and Libsyn. feed

The Voice of the DBA podcast features music by Everyday Jones. No relation, but I stumbled on to them and really like the music.

ADVERTISEMENT
SQL Compare

The industry standard for comparing and deploying SQL Server database schemas

Trusted by 71% of Fortune 100 companies, SQL Compare is the fastest way to compare changes, and create and deploy error-free scripts in minutes. Plus you can easily find and fix errors caused by database differences. Download your free trial

Database DevOps

Database DevOps Demo Webinar

Learn how to automate your database deployments alongside your app code in this free demo webinar. Register now

Featured Contents

 

SQL Server Dynamic Online Catalog: Why Should You Care? Stairway to Exploring Database Metadata Level 1

Phil Factor from SQLServerCentral.com

In this first level, we look at the overview of what metadata is contained in each database, gain some understanding of the different types of information, and examine a few basic examples. More »


 

Free eBook: Troubleshooting SQL Server: A Guide for the Accidental DBA

Press Release from Redgate

Three SQL Server MVPs (Jonathan Kehayias, Ted Krueger and Gail Shaw) provide fascinating insight into the most common SQL Server problems, why they occur, and how they can be diagnosed using tools such as Performance Monitor, Dynamic Management Views and server-side tracing. The focus is on practical solutions for removing root causes of these problems, rather than "papering over the cracks". More »


 

Kafka Integration with HDInsight

Additional Articles from Database Journal

Microsoft HDInsight is the cloud service that deploys and provisions Hadoop clusters on the Azure cloud. It's a completely managed, open source analytics service to support enterprise needs and supports a wide variety of scenarios with the help of open source frameworks like Hadoop, Storm, Spark, R Server and Hive. More »


 

From the SQLServerCentral Blogs - Rotating encryption keys for Always Encrypted

Ed Leighton-Dick from SQLServerCentral Blogs

In my last post, I talked about the process of rotating your encryption keys. It’s just one of those routine... More »


 

From the SQLServerCentral Blogs - Changing the location of docker named volumes

Andrew Pruski from SQLServerCentral Blogs

A few weeks ago I was presenting at SQL Saturday Raleigh and was asked a question that I didn’t know... More »

Question of the Day

Today's Question (by Steve Jones):

I decide to build a short Python function from the command line that looks like this:

>>> def add99 (x):
...      x.append(99)
...      print(x)

I then execute this code from the REPL.

>>> y = [1,2,3]
>>> add99(y)
[1, 2, 3, 99]

If I now do this in the same REPL, what is returned?

y

Think you know the answer? Click here, and find out if you are right.


We keep track of your score to give you bragging rights against your peers.
This question is worth 1 point in this category: Python.

We'd love to give you credit for your own question and answer.
To submit a QOTD, simply log in to the Contribution Center.

ADVERTISEMENT

Exam Ref 70-765 Provisioning SQL Databases

Prepare for Microsoft Exam 70-765–and help demonstrate your real-world mastery of provisioning SQL Server databases both on premise and in SQL Azure. Designed for experienced IT professionals ready to advance their status, Exam Ref focuses on the critical thinking and decision-making acumen needed for success at the MCSA level. Get your copy from Amazon today.

Yesterday's Question of the Day

Yesterday's Question (by Steve Jones):

I want to create a database and set the maximum size for a database file in SQL Server 2017. I can use the MAXSIZE parameter to do this. What can I specify for the value of the parameter? (choose 3)

Answer:

  • A numerical value with "TB" appended
  • Use the keyword "unlimited"
  • Just a numerical value with no scale

Explanation:

The MAXSIZE parameter can take a numerical value with the optional sizes of KB, MB, GB, or TB. There is no percentage value. You can also choose to use "unlimited" instead of a numeric value.

Ref: CREATE DATABASE - click here


» Discuss this question and answer on the forums

Database Pros Who Need Your Help

Here's a few of the new posts today on the forums. To see more, visit the forums.

SQL Server 2017 : SQL Server 2017 - Administration

Has anyone installed SQL Server on Ubuntu 18.04? - Banging my head on this one as I cannot find a way to install SQL Server on one of my...

To use PowerBI in-premisses do I need 2017? or can I use 2016? - Hello, I was told in my company (one of the IT Architects that work with me)  that my team will have...


SQL Server 2017 : SQL Server 2017 - Development

SSIS Buffer Error - Hi, I am trying to execute a package and I am getting the following error messages: Error: A buffer failed while allocating...

Help with a simple TSQL script to open and save an Excel file. - I have an SSRS report that exports to a specific folder on a scheduled basis.  There's a problem with the...

Removing stored procedures to move to cloud - I got an interesting email today about cutting back on the reliance of the applications on databases and plans to...


SQL Server 2016 : SQL Server 2016 - Development and T-SQL

openrowset from sql server 2000 to sql server 2016 - Hello all, I have sql server 2000 :(  and i have a view that query sql server 2016 via linked server and...

DDL trigger causing inserts to fail - We've created a trigger as below to audit DDL events and now it is causing us frustrations as the following...

Odd behavior -- execution plan different when I execute in the target database vs not in the target database - So I have a query that's basically SELECT column FROM DB1.Schema.View.  The view is a bit of a mess, does...

Updating datetime2 column not working - Hello, Trying to update some datetime2 columns. UPDATE T_MYTABLE SET dateTransferred = '2018-03-26 05.00.00' where id = 1223 Causes an error.  So, I tried casting it. UPDATE...

Date Upto 28 days ahead, no weekends - Hi All, I have a statement returning data from today up to 28 days ahead, but it is including weekends. I...

Importing from XML to SQL 2014 - This is my 4th day on SQL so I am a newbie. I created a query to import from an...


SQL Server 2014 : Administration - SQL Server 2014

SQLJOBVIS - anyone still have a copy? - This is the nifty old utility talked about in this article. http://www.sqlservercentral.com/blogs/chadmiller/2010/05/15/sql-agent-visual-job-schedule-viewer/ It seems the company and the website no longer...


SQL Server 2012 : SQL 2012 - General

Query optimization - Hi Guys, I have to optimize a query that takes hours to run. I cancelled the query because it takes a...

FileStream Encryption - I am evaluating encryption methods for filestream as I am currently using TDE for non file data but filestream data...


SQL Server 2008 : T-SQL (SS2K8)

OUTPUT clause with INSERT - Hello, Is there anyway I can replicate this within an INSERT/OUTPUT statement: declare @ID as

Number of Days between two Day Names - I have a table with a Start Day and a End Day Column ID StartDay EndDay 1 Monday Friday 2 Tuesday Wednesday 3 Friday...


SQL Server 2008 : SQL Server Newbies

SSIS error: version number in the package is not valid - I installed SSIS 2008 (SSIS, SSRS, SSAS....not the database engine) on a server that already had SQL Server 2005 installed...

Frequently used DMV's & DBCC commands..... - Hi Guys I'm looking FOR DMVs & DBCC commands that day to day DBA needs.... which are more frequently used.... Please help...


Data Warehousing : Integration Services

SSIS - Decrypting PGP files from a child package - Hi All Not sure if this is the most appropriate place to post but wondering if someone can assist with this...


SQL Server 2005 : Backups

log_reuse_wait_desc = replication but there's no replication - I have a database whose log file keeps growing. I checked the sys.databases log_reuse_wait_desc column and it shows "replication" as...

This email has been sent to {user_email}. To be removed from this list, please click here.
If you have any problems leaving the list, please contact the webmaster@sqlservercentral.com.
This newsletter was sent to you because you signed up at SQLServerCentral.com.
Feel free to forward this to any colleagues that you think might be interested.
If you have received this email from a colleague, you can register to receive it here.
This transmission is ©2018 Redgate Software Ltd, Newnham House, Cambridge Business Park, Cambridge, CB4 0WZ, United Kingdom. All rights reserved.
Contact: webmaster@sqlservercentral.com