RE: Is there any obvious performance difference between updating 1 column in a 4 column wide table vs 1 columns in a 1000 column wide table ?

SSC Guru

Points: 1003863

December 18, 2017 at 12:55 pm

#1972579

Luis Cazares - Monday, December 18, 2017 11:01 AM
I thank you for the test, Jeff. However, I still don't believe that someone will be updating 10 million rows at a time. The most common scenario would be to update one row at a time (no RBAR just different operations).
When updating one row the reads, writes and times are very similar and only vary for less than 5ms (which I feel is insignificant).

Understood. The whole table update was meant to demonstrate that table width does matter. As for there being "less than 5ms" difference for a singleton, that's fine but only if you're not updating thousands of rows per hour. The other problem is that the table might not be used only for that one "thing".

While "It Depends" is certainly true, I've found that (most of the time) super-wide tables cause more trouble than they're worth. Another example of problems I've seen (and had to repair) is that someone thought they were doing the right thing by making a CLR for "field" level audits that could be copied to any table. It would automatically figure things out based on the content of the INSERTED and DELETED logical tables. It worked absolutely great... until they applied it to an insane 137 column wide table. The people that installed it didn't understand the two logical trigger tables had to be fully materialized to work in the CLR because they were out of scope for the CLR. It was taking more than 4 minutes to update just 4 columns on just 10,000 rows.

I'm also not trying to convince anyone to totally avoid super-wide tables (we fixed the trigger problem rather than "sister" the table because of other requirements)... Rather I'm trying to make folks understand that they can be quite the problem and that they need to test "insitu" to make sure the problems that can be present, aren't. For example, a lot of folks never test for the inevitable flush to disk. Although some would consider it to be some form of "death by pre-optimization", this type of thing is a really important consideration at design time because it's either a huge pain or an impossibility to change further down the road.

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

Change is inevitable... Change for the better is not.

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)