Inserting missing rows

I can think of three ways to insert just the missing rows from one table or query into another table.

The first way is so ugly that I hate to even bring it up, yet I know that there are developers and (gasp) DBAs using this very technique. Basically, you create a cursor to loop through each row of the first table and check, one row at a time, to see if the row exists in the second table before inserting it. This is so inefficient, I'm not giving a code example.

The second way is better: Use a sub-query in the WHERE clause along with the NOT IN operator. Several years ago when I was working on my Master's degree, I took a course in the SQL language using Oracle. I'm not sure if sub-queries and especially correlated sub-queries are better in Oracle than SQL Server, but the professor taught us to use this method to solve almost every problem. Needless to say, a lot of my old code uses this approach as well. This method gets a little more complicated with multi-valued primary keys. I recently ran across an example that had two sub-queries in the WHERE clause.

Here is an example of what it looked like and it does work (Note: after posting this and riding into work, I realized that this will not work in all cases. I have seen it used in production; I suspect the primary key was actually defined incorrectly):

insert into tableB(PK_Col1, PK_Col2, Col3, Col4)
select a.PK_Col1, a.PK_Col2, a.Col3, a.Col4
from tableA as a
where a.PK_Col1 not in (select PK_Col1 from tableB)
and a.PK_Col2 not in (select PK_col2 from tableB)

The "not in" method does work with a one-column primary key.

The third method takes advantage of the OUTER JOIN syntax. By joining the two tables with this syntax, you can retrieve all of the rows in tableA regardless of whether there is a match in tableB. For those rows where there isn't a match, all of the columns in tableB will be null. By selecting only those rows where the values in tableB are null you have found your missing rows.

Here is an example:

insert into tableB (PK_Col1, PK_Col2, Col3, Col4)
select a.PK_Col1, a.PK_Col2, a.Col3, a.Col4
from tableA as a left outer join tableB as b
on a.PK_Col1 = b.PK_Col1 and a.PK_Col2 = a.PK_Col2
where b.PK_Col1 is null

While I didn't find much difference in the estimated query plans between the two with just a small amount of sample data, it would be interesting to see what would happen with millions of rows. Both methods did two clustered index scans. I think the LEFT OUTER JOIN method would be much more efficient with a large amount of data.

Book Review: Big Red - Voyage of a Trident Submarine

by Andy Warren

SQLServerCentral.com

Blogs

I've grown up reading Tom Clancy and probably most of you have at least seen Red October, so this book caught my eye when browsing used books for a recent trip. It's a fairly human look at what's involved in sailing on a Trident missile submarine...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-03-10

1,439 reads

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

by Robert Davis

SQLServerCentral.com

Blogs

Question: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? This question was sent to me via email. My reply follows. Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? Databases to be mirrored are currently running on 2005 SQL instances but will be upgraded to 2008 SQL in the near future.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-23

1,567 reads

Inserting Markup into a String with SQL

by Phil Factor

SQLServerCentral.com

T-SQL

In which Phil illustrates an old trick using STUFF to intert a number of substrings from a table into a string, and explains why the technique might speed up your code...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-18

1,631 reads

Networking - Part 4

by Andy Warren

SQLServerCentral.com

Blogs

You may want to read Part 1 , Part 2 , and Part 3 before continuing. This time around I'd like to talk about social networking. We'll start with social networking. Facebook, MySpace, and Twitter are all good examples of using technology to let...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-17

1,530 reads

Speaking at Community Events - More Thoughts

by Andy Warren

SQLServerCentral.com

Blogs

Last week I posted Speaking at Community Events - Time to Raise the Bar?, a first cut at talking about to what degree we should require experience for speakers at events like SQLSaturday as well as when it might be appropriate to add additional focus/limitations on the presentations that are accepted. I've got a few more thoughts on the topic this week, and I look forward to your comments.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-13

360 reads

Inserting missing rows

Rate

Share

Share

Rate

Inserting missing rows

Rate

Share

Share

Rate

Related content

Book Review: Big Red - Voyage of a Trident Submarine

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

Inserting Markup into a String with SQL

Networking - Part 4

Speaking at Community Events - More Thoughts