Monitoring and Alerting

Monitoring your systems is important. It's not just me that thinks so, as plenty of experienced DBAs and developers know the value of monitoring. Heck, most people have learned to build some sort of metric collection into their software. Azure makes it easy to instrument your application and gather lots of data on how well things are working. Perhaps too easy to gather too much data and then you pay for it, or can't find time to analyze it. High performing software development shops use monitoring in their Continuous Integration (CI) and Continuous Delivery (CD) pipelines to better understand the health of their code and speed of their workflow, in addition to instrumenting the actual application.

For those of us that need to ensure our database servers are running well, we not only need monitoring, but also alerting. I ran across a couple articles that have thoughts about monitoring and the difference between monitoring and alerting. While I don't completely agree with all the items in the second piece, I do think that it's important that you get alerting working well.

I've had more than my share of un-actionable alerts, or even unnecessary alerts in my career. These days I've learned to better classify those items that matter to me. Most of the time what I find myself doing is downgrading most alerts because very few are actually mission critical. Far too often I've worried about 100% CPU or slow log writes or even zero sales in an hour or some other metric that "seems" critical. However, since few of these alerts stop business from flowing, I've learned to lower their priority or just remove them as alerts and allowing monitoring to track the values. I do need to watch the monitoring and fix issues, but I don't need to get up at 3am.

The other thing I've worked to do is automate responses to problems. If I know there are ways a computer can respond, let it. Don't get a human involved if the system can manage itself. Certainly the automated solutions don't always work, but have some escalation built in that only alerts a human after the system has exhausted its own responses. After all, we don't want to exhaust humans if we don't need to do so.

The Devil is in the Monitoring Details

by Steve Jones

SQLServerCentral.com

Editorial

Monitoring is both simple and hard.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2019-01-28

130 reads

Discuss

Mitigate Issues Early

by Steve Jones

SQLServerCentral.com

Editorial

When you know there's a problem, it's better to solve it early rather than late.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(1)

You rated this post out of 5. Change rating

2018-11-28

59 reads

Discuss

Monitoring Costs

by Steve Jones

SQLServerCentral.com

Editorial

There are lots of costs to monitoring your systems that many people ignore.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2018-09-18

66 reads

Discuss

Be Prepared with Baselines

by Steve Jones

SQLServerCentral.com

Editorial

Analyzing performance often requires you to understand what is normal and what is not. Steve talks about the importance of baselines.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(1)

You rated this post out of 5. Change rating

2018-06-11

76 reads

Discuss

What is the Future of Monitoring?

by Steve Jones

SQLServerCentral.com

Editorial

Monitoring your systems is critical to ensuring security and stability. Steve Jones shares a few thoughts on where monitoring might go with more computer assistance in the future.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2016-09-28

96 reads

Discuss

Monitoring and Alerting

Rate

Share

Categories

Share

Rate

Monitoring and Alerting

Rate

Share

Categories

Share

Rate

Related content

The Devil is in the Monitoring Details

Mitigate Issues Early

Monitoring Costs

Be Prepared with Baselines

What is the Future of Monitoring?