• It takes a little more time, but changing the exception handling process for a job from "wait for users to notify the right person" to "have SQL Server notify the right person" is definitely worthwhile.

    I have checks for jobs that haven't run or haven't finished in a reasonable period of time but we can always use more. It's the irregular processes that run once a month that are hard to pin down.