TCP/IP Error while restoring a Database through network

  • Hi, I'm unable to restore a database through network.

    Server A: Host of backup copy.Hosted in 8 node Cluster 1.

    Server B: Destination. Hosted in 8 node Cluster 2

    The process is one of our sequential jobs which places the backup copy on Server A and next step is to restore it on Server B.

    We see that database goes to restoring mode but has no progress in restoration which obviously keeps the job in running state and never finishes until someone manually kills the SPID and have to restore manually.

    Also when I manually restore, After 30% or 40% of progress TCP/IP error pops up. This job ran successfully for 8 months and suddenly failing.

    Both source and destination are in the same network and same data Centers.

    Please suggest and let me know if you need any info on the same.

    Thanks in advance.

  • Can you copy the files to the remote server & then restore? Or restore successfully on Server A?

    Sounds like a network problem rather than a SQL issue - can you or any network specialists check the NIC's & connections between the servers?

  • How can I check the NIC Connections between the server?

    If I copy the backup file on to the local machine , I can restore it successfully. But that now how our jobs are designed. This restoration is one of the steps in the jobs. We have 9 other steps apart from this. We typically cannot change the step because it has dependencies.

    Today i have tried 8 times to restore it through network. It failed 7 times. But I observed that server behaves weird when the restore is in progress. The CPU usage spikes up to 80%, Task manager does not responds and sometime SSMS does not responds.

    Our server horse power is pretty good. 16 processors with default parallelism. 74 GB RAM.

    Can anyone suggest few test scenarios so that I can test in here Like perf monitor or something. This is happening in our PROD so I need to move a little quick here.

  • Mac1986 (8/30/2012)


    If I copy the backup file on to the local machine , I can restore it successfully. But that now how our jobs are designed. This restoration is one of the steps in the jobs. We have 9 other steps apart from this. We typically cannot change the step because it has dependencies.

    No problem, just wanted to check the backup files themselves are ok - this would suggest they are.

    From what you've described, almost certain the problem is at the network level, but I'm not a networking expert so can't help you diagnose further, sorry.

    Does the NIC have any diagnostic software? Not sure perfmon can help you here.

Viewing 4 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply