• I would set up a query of the session and request DMVs to run at the same time as your troubled agent job. Capture data every 5 to 15 seconds and spill it to a table or a file. Then examine that data the next morning after an issue. This will help you see if something is happening in the environment that is causing the issue. You could use something like sp_whoisactive if you don't want to write your own query