SQL Agent job terminated unexpectedly

  • Hi,

    We have 2 node active/passive cluster setup for SQL Server 2005 SP3 on Windows 2003 x64 with SP2

    A scheduled SQL Agent backup job has failed and after investigation, I found the below information and errors from system event logs & Application event logs:

    Here, the SQL Service is restarted but it's NOT failover to passive node:hehe:

    System Event viewer info & errors:

    Event Type:Information

    Event Source:b06bdrv

    Event Category:None

    Event ID:26

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    \Device\NTPNP_PCI0035: Ndis device unbound successfully.

    Event Type:Warning

    Event Source:DnsApi

    Event Category:None

    Event ID:11197

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    The system failed to update and remove host (A) resource records (RRs) for network adapter

    with settings:

    Adapter Name : {AFD706B1-1159-4C90-B45E-B96890E67F30}

    Host Name : SQL1

    Primary Domain Suffix : abc.com

    DNS server list :

    10.10.10.10, 20.20.20.20

    Sent update to server : <?>

    IP Address(es) :

    10.10.10.10

    The reason the update request failed was because of a system problem.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:IP Address Resource

    Event ID:1077

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    The TCP/IP interface for Cluster IP Address 'SQL IP Address 1 (SQLVS)' has failed

    Event Type:Error

    Event Source:ClusSvc

    Event Category:IP Address Resource

    Event ID:1077

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    The TCP/IP interface for Cluster IP Address 'Cluster IP Address' has failed.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:Failover Mgr

    Event ID:1069

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    Cluster resource 'SQL IP Address 1 (SQLVS)' in Resource Group 'SQLGroup ' failed.

    For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:Failover Mgr

    Event ID:1069

    Date:8/16/2010

    Time:3:42:12 PM

    User:N/A

    Computer:SQL1

    Description:

    Cluster resource 'Cluster IP Address' in Resource Group 'Cluster Group' failed.

    Application Event viewer errors:.

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] OnlineThread: QP is not online.

    Same errors are logged in Cluster log file in C:\Windows\Cluster

    This was happened only once so far. Please advice what will be cause for this? Is it something to do with SQL Server or Cluster?

    Thanks

  • gmamata7 (8/18/2010)


    Hi,

    We have 2 node active/passive cluster setup for SQL Server 2005 SP3 on Windows 2003 x64 with SP2

    A scheduled SQL Agent backup job has failed and after investigation, I found the below information and errors from system event logs & Application event logs:

    Here, the SQL Service is restarted but it's NOT failover to passive node:hehe:

    System Event viewer info & errors:

    Event Type:Information

    Event Source:b06bdrv

    Event Category:None

    Event ID:26

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    \Device\NTPNP_PCI0035: Ndis device unbound successfully.

    Event Type:Warning

    Event Source:DnsApi

    Event Category:None

    Event ID:11197

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    The system failed to update and remove host (A) resource records (RRs) for network adapter

    with settings:

    Adapter Name : {AFD706B1-1159-4C90-B45E-B96890E67F30}

    Host Name : SQL1

    Primary Domain Suffix : abc.com

    DNS server list :

    10.10.10.10, 20.20.20.20

    Sent update to server : <?>

    IP Address(es) :

    10.10.10.10

    The reason the update request failed was because of a system problem.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:IP Address Resource

    Event ID:1077

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    The TCP/IP interface for Cluster IP Address 'SQL IP Address 1 (SQLVS)' has failed

    Event Type:Error

    Event Source:ClusSvc

    Event Category:IP Address Resource

    Event ID:1077

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    The TCP/IP interface for Cluster IP Address 'Cluster IP Address' has failed.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:Failover Mgr

    Event ID:1069

    Date:8/16/2010

    Time:3:11:34 PM

    User:N/A

    Computer:SQL1

    Description:

    Cluster resource 'SQL IP Address 1 (SQLVS)' in Resource Group 'SQLGroup ' failed.

    For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

    Event Type:Error

    Event Source:ClusSvc

    Event Category:Failover Mgr

    Event ID:1069

    Date:8/16/2010

    Time:3:42:12 PM

    User:N/A

    Computer:SQL1

    Description:

    Cluster resource 'Cluster IP Address' in Resource Group 'Cluster Group' failed.

    Application Event viewer errors:.

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

    Event Type:Error

    Event Source:MSSQL$INS1

    Event Category:(3)

    Event ID:19019

    Date:8/16/2010

    Time:3:11:32 PM

    User:N/A

    Computer:SQL1

    Description:

    [sqsrvres] OnlineThread: QP is not online.

    Same errors are logged in Cluster log file in C:\Windows\Cluster

    This was happened only once so far. Please advice what will be cause for this? Is it something to do with SQL Server or Cluster?

    Thanks

    Did all sql agent jobs failed or just this one?

    In the cluster administrator check if you have resources setup properly for failover.

    Alex S
  • Only one job is failed. Because that is the only job running at that time and there are NO other jobs running

    and all settings are fine. Thanks

  • gmamata7 (8/18/2010)


    Only one job is failed. Because that is the only job running at that time and there are NO other jobs running

    And all settings are fine. Thanks

    Can you recreate the job and execute it?

    Alex S

Viewing 4 posts - 1 through 4 (of 4 total)

You must be logged in to reply to this topic. Login to reply