Deleting Duplicate Rows

To delete duplicate rows in Sql server is one of the tedious tasks. Duplicate

rows might exist in a table because of bad database design or because

constraints are not applied. Unlike Oracle where we get unique row-id for each

row and using that we can delete duplicate records, there is no simple way to

delete duplicate records in Sql Server.

One method used to delete duplicate records is to write a stored procedure,

use cursor in that and delete row one by one. There is another simple

interactive way to delete duplicate rows in Sql Server without using stored

procedures and cursor.

I will use Enterprise manager for explanation but scripts can be used if

duplicate records are to be deleted on regular basis from some table. Create one

table OrderDetail with columns ShipmentId, OrderId, ArticleId and Quantity where

ShipmentId, OrderId and ArticleId fields should be unique.

Add few duplicate records into this table.

Create a blank copy of table using:

Select * into OrderDetailCopy from OrderDetail where 1 = 0

Create Unique index on columns ShipmentId, OrderId and ArticleId. Also check

Ignore duplicate key checkbox. Save the table.

Copy the records from OrderDetail into OrderDetailCopy using:

insert into OrderDetailCopy Select * from OrderDetail

You will get warning message:

Server: Msg 3604, Level 16, State 1, Line 1
Duplicate key was ignored.

Now you have OrderDetailCopy table without any duplicate rows.

Drop table OrderDetail using

Drop table OrderDetail

Rename table OrderDetailCopy to OrderDetail using

Sp_rename 'OrderDetailCopy','OrderDetail'

A Normalization Primer

by Brian Kelley

SQLServerCentral.com

T-SQL

For most DBAs, normalization is an understood concept, a bread and butter bit of knowledge. However, it is not at all unusual to review a database design by a development group for an OLTP (OnLine Transaction Processing) environment and find that the schema chosen is anything but properly normalized. This article by Brian Kelley will give you the core knowledge to data model.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(3)

You rated this post out of 5. Change rating

2003-01-13

18,756 reads

Discuss

Reaching the Outer Limits

by Steve Jones

SQLServerCentral.com

T-SQL

What's this error:Arithmetic overflow error converting IDENTITY to data type int? It was a new one to me, but read on to find out what it means.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(2)

You rated this post out of 5. Change rating

2002-10-14

7,022 reads

Discuss

Working with email addresses in SQL Server!

by Additional Articles

Other

T-SQL

This article shows you how to design the storage for email addresses, how to validate email addresses, how to retrieve demographic information from email addresses efficiently, using computed columns and indexes. It also covers the security aspect of dealing with email addresses.

2002-08-01

1,340 reads