Find and Remove Duplicate Records SQL Server

Question

Find and Remove Duplicate Records SQL Server

Viewing 15 posts - 16 through 30 (of 76 total)

You must be logged in to reply to this topic. Login to reply

Daniel Matthee Mr or Mrs. 500 Points: 579 More actions · Answer 1

andy_111 (2/1/2016)

Adrian_1 (2/1/2016)
The OP is intending that only 1 row will be deleted (because SET ROWCOUNT = 1) and therefore on the 2nd pass will not delete because there is now only 1 left...

see previous post

according to MSDN, "Using SET ROWCOUNT will not affect DELETE, INSERT, and UPDATE statements in a future release of SQL Server. Avoid using SET ROWCOUNT with DELETE, INSERT, and UPDATE statements in new development work, and plan to modify applications that currently use it. For a similar behavior, use the TOP syntax. For more information, see TOP (Transact-SQL)."

https://msdn.microsoft.com/en-us/library/ms188774.aspx%5B/quote%5D

Based on the nested comment if SET ROWCOUNT = 1 and you have a 10k dups. All that I will say is good luck!:-P.

--------------------------------------------------------------------------------------------------------------------------------------------------------
To some it may look best but to others it might be the worst, but then again to some it might be the worst but to others the best.
http://www.sql-sa.co.za

INCREDIBLEmouse SSC Eights! Points: 835 More actions · Answer 2

I agree with the general forum sentiment - rowcount being an older, deprecated method, should obviously be avoided. I agree the windowing methods are more appropriate. I am glad it was posted though, from a general trivia perspective. Without noticing its role, the delete query would lead one to assume an incorrect number of records would be removed. :w00t:

awais9981 SSC Enthusiast Points: 186 More actions · Answer 3

It is a weird solution but I am glad I have seen a new solution to remove duplication after 10 years(after introduction of row_number in SQL 2005)

Luis Cazares SSC Guru Points: 183706 More actions · Answer 4

Even the method posted on Microsoft Support is better than this ugly RBAR option with a deprecated feature.

Luis C.
General Disclaimer:
Are you seriously taking the advice and code from someone from the internet without testing it? Do you at least understand it? Or can it easily kill your server?

How to post data/code on a forum to get the best help: Option 1 / Option 2

bob_chang Grasshopper Points: 12 More actions · Answer 5

Nice post! I have a problem looking for duplicates in a similar table with "text" fields. What would that SQL look like?

Thank you,

Bob

JustASQLGuy SSC-Addicted Points: 447 More actions · Answer 6

Wouldn't this be a simpler solution and be a bit less confusing?

delete Customers

where CustID in (

select max(CustID) -- or change to min() if wanting to keep latest

from Customers

group by CustName

having count(*) > 1

)

Daniel Matthee Mr or Mrs. 500 Points: 579 More actions · Answer 7

vopipari (2/1/2016)
Wouldn't this be a simpler solution and be a bit less confusing?
delete Customers
where CustID in (
select max(CustID) -- or change to min() if wanting to keep latest
from Customers
group by CustName
having count(*) > 1
)

This will still remove ALL customer records that have a duplicate record

--------------------------------------------------------------------------------------------------------------------------------------------------------
To some it may look best but to others it might be the worst, but then again to some it might be the worst but to others the best.
http://www.sql-sa.co.za

JustASQLGuy SSC-Addicted Points: 447 More actions · Answer 8

It only picks the min or max CustID for the pair of duplicates.

pchoiniere Newbie Points: 1 More actions · Answer 9

Microsoft said "Using SET ROWCOUNT will not affect DELETE, INSERT, and UPDATE statements in a future release of SQL Server. Avoid using SET ROWCOUNT with DELETE, INSERT, and UPDATE statements in new development work, and plan to modify applications that currently use it. For a similar behavior, use the TOP syntax."

INCREDIBLEmouse SSC Eights! Points: 835 More actions · Answer 10

vopipari (2/1/2016)
It only picks the min or max CustID for the pair of duplicates.

Adrian_1 SSC Veteran Points: 280 More actions · Answer 11

Adrian_1

SSC Veteran

Points: 280

February 1, 2016 at 8:23 am

#1855696

but it won't tidy up triplicates....

JustASQLGuy SSC-Addicted Points: 447 More actions · Answer 12

Adrian_1 (2/1/2016)
but it won't tidy up triplicates....

True, I was only thinking of duplicates. This fixes that though:

delete Customers

where CustName in (

select CustName

from Customers

group by CustName

having count(*) > 1

)

and CustID not in (

select max(CustID) -- keeps latest, change to min() if wanting to keep the first

from Customers

group by CustName

having count(*) > 1

);

hjp Default port Points: 1437 More actions · Answer 13

bob_chang (2/1/2016)
Nice post! I have a problem looking for duplicates in a similar table with "text" fields. What would that SQL look like?
Thank you,
Bob

The same... The answers given here are all generic. It doesn't matter what data types you are dealing with.

Adrian_1 SSC Veteran Points: 280 More actions · Answer 14

Personally I prefer

WITH cte AS (

SELECT a.*,

ROW_NUMBER() OVER(PARTITION BY field1,field2 ORDER BY field1,field2) rrn

FROM dbo.file_with_duplicates a)

DELETE FROM cte WHERE rrn > 1

Kenneth Igiri SSCertifiable Points: 5004 More actions · Answer 15

JustASQLGuy (2/1/2016)
Wouldn't this be a simpler solution and be a bit less confusing?
delete Customers
where CustID in (
select max(CustID) -- or change to min() if wanting to keep latest
from Customers
group by CustName
having count(*) > 1
)

Once I read the post the issue of deleting ALL records hit me because I have been in this kind of situation before. Take care to backup the table as an exported copy before trying out on production. The above query by JustASQLGuy comes closer to a solution and if run multiple time in a WHILE loop could clear all entries occuring multiple times. I haven't tested this theory though.

Br. Kenneth Igiri
https://kennethigiri.com
All nations come to my light, all kings to the brightness of my rising

Find and Remove Duplicate Records SQL Server

Cookies on SQLServerCentral