Reducing the size of the distribution database & improving transactional replication performance: Part 2

The impact of having data sitting in the distribution database that doesn’t need to be there can be significant. The aim of this series of articles is show you how to keep the distribution database as small (and efficient) as possible, and to keep the impact of the clean-up job to a minimum. Each of the steps outlined will help reduce the data that is unnecessarily held in the distribution database, and free up resources for other things.

Part 1 of this article focussed on the intricacies of SQL Server transactional replication, and made the observation that the clean-up mechanism treats publications as a collective rather than individuals. Part 1 looked into publication settings, and part 2 looks at how distribution job schedules can cause unnecessary bloating in the distribution database.

When the distribution clean-up job runs it executes the sp_MSdistribution_cleanup procedure. This procedure then executes a number of other procedures as part of the clean-up process. The most important procedure in terms of what will actually be removed is sp_MSmaximum_cleanup_seqno. This procedure finds the maximum transaction sequence number to be removed from the distribution DB, and uses the following process to find this:

Find the most recent synchronisation for each publication in the MSdistribution_history table.*
Find the oldest of all of the synchronisations from step 1.
Find the most recent transaction from MSrepl_transactions that is older than the synchronisation from step 2. This is the latest transaction to remove from the distribution DB.

* The MSdistribution_history table holds details of all synchronisations within the transaction retention period (default of 72 hours). If any distribution agent has not synchronised during this time then the oldest value in MSsubscriptions is used for that distribution agent.

Differing distribution schedules

If, for example, you have two distributions, one with a schedule of once a minute, and one with a schedule of once a day you will see something similar to the following:

Find the most recent synchronisation for each publicationin the MSdistribution_history table.
- You will have one synchronisation up to one minute old, and one synchronisation up to one day old.
Find the oldest of the synchronisations from step 1.
- This will most likely be the synchronisation up to one day old.
Find the most recent transaction that is older than the synchronisation from step 2. This is the latest transaction to remove from the distribution DB.

In this specific example the MSrepl_transactions and MSrepl_commands tables will only be cleaned up once per day. For the rest of the day these tables will be filling up and the clean-up job will be running, reading more data and taking more resources each time it runs, but deleting nothing.

The resolution to this is simple; just set all of the distribution job schedules to be the same or similar.

Part 3 of the series focuses on replicating intensive stored procedure executions rather than the outcome of the execution.

The post Reducing the size of the distribution database & improving transactional replication performance: Part 2 appeared first on BI Design.

Book Review: Big Red - Voyage of a Trident Submarine

by Andy Warren

SQLServerCentral.com

Blogs

I've grown up reading Tom Clancy and probably most of you have at least seen Red October, so this book caught my eye when browsing used books for a recent trip. It's a fairly human look at what's involved in sailing on a Trident missile submarine...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-03-10

1,439 reads

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

by Robert Davis

SQLServerCentral.com

Blogs

Question: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? This question was sent to me via email. My reply follows. Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? Databases to be mirrored are currently running on 2005 SQL instances but will be upgraded to 2008 SQL in the near future.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-23

1,567 reads

Inserting Markup into a String with SQL

by Phil Factor

SQLServerCentral.com

T-SQL

In which Phil illustrates an old trick using STUFF to intert a number of substrings from a table into a string, and explains why the technique might speed up your code...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-18

1,631 reads

Networking - Part 4

by Andy Warren

SQLServerCentral.com

Blogs

You may want to read Part 1 , Part 2 , and Part 3 before continuing. This time around I'd like to talk about social networking. We'll start with social networking. Facebook, MySpace, and Twitter are all good examples of using technology to let...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-17

1,530 reads

Speaking at Community Events - More Thoughts

by Andy Warren

SQLServerCentral.com

Blogs

Last week I posted Speaking at Community Events - Time to Raise the Bar?, a first cut at talking about to what degree we should require experience for speakers at events like SQLSaturday as well as when it might be appropriate to add additional focus/limitations on the presentations that are accepted. I've got a few more thoughts on the topic this week, and I look forward to your comments.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-13

360 reads

Reducing the size of the distribution database & improving transactional replication performance: Part 2

Differing distribution schedules

Rate

Share

Share

Rate

Reducing the size of the distribution database & improving transactional replication performance: Part 2

Differing distribution schedules

Rate

Share

Share

Rate

Related content

Book Review: Big Red - Voyage of a Trident Submarine

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

Inserting Markup into a String with SQL

Networking - Part 4

Speaking at Community Events - More Thoughts

Cookies on SQLServerCentral