One Node could not up in SQL Server Failover cluster

  • We have 2 nodes window Server 2012 R2 and SQL Server 2012 Enterprise Version cluster setup.

    We can switch roles and Node to one node to another and revert back to previous node with out any issues.

    But we are facing when one Node is restarted. We could not restart that Node in cluster Service start in Failover cluster Manager.

    Error Details is displayed as below inside double code.

    "Cluster node NODE1 could not to join the cluster because it failed to communicate over the network with any other node in the cluster. Verify the network connectivity and configuration of any network firewalls."

    I checked windows firewall. windows firewall is all of in Node1, Node2, SAN and DC.

    I have disabled and enabled the Internal and private network of Node 1. I have validated the cluster. it is showing no error though.

    Node1:

    Public IP: 10.10.0.11

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (Ip of DNS)

    Private: 10.10.0.5

    Subnet Mask: 255.0.0.0

    Node2:

    Public IP: 10.10.0.12

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (Ip of DNS)

    Private Network

    IP: 10.10.0.6

    Subnet Mask: 255.0.0.0

    SAN:

    Public IP: 10.10.0.13

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (IP of DNS)

    Private Network: Not configured.

    pinging to each other ip is successful from one node to another.

    Could you please suggest how to solve this issue.

  • are there any antiviruses installed and running?

    is the Ip addresses reserved and static or dynamic?

    make sure that the IP addresses are static configured and reserved for the SQL cluster.

    please attach a screenshot of the affected node and network settings if possible.

    Is the cluster service running on both nodes?

  • Please attach cluster verification report.

  • Dear Sir,

    I have attached the snapshot of Ip configuration of Node1 , Node2 and DC.

    Node1 and Node2 has both private and Public ip.

    Now i have recreated cluster and reinstalled Active and Active sql server cluster using 2 node. I can switch roles and services from one node to another node. If i restart any of the node or if i stop cluster service to any of node. I can not start cluster service to that node even if i that node is up again.

    I hope that the information given to you is sufficient. if you need further info please let me know. I am waiting solution. Thanks in advance.

  • Hi

    Thanks for the feedback.

    I see your subnet mask configurations are different for public and private IP are different, can you please use the same subnet mask IP for both public and private, 255.255.255.0 on both nodes and restart each done separately afterwards and the test again.

  • May I ask if you can use different IP ranges as well, it shouldn't have any impact to your underlying problem but it's recommended

    to have public IPs on 192.***.*** range and private(heartbeat) on 10.0.0.1

  • keshab.basnet (7/24/2015)


    Dear Sir,

    I have attached the snapshot of Ip configuration of Node1 , Node2 and DC.

    Node1 and Node2 has both private and Public ip.

    Now i have recreated cluster and reinstalled Active and Active sql server cluster using 2 node. I can switch roles and services from one node to another node. If i restart any of the node or if i stop cluster service to any of node. I can not start cluster service to that node even if i that node is up again.

    I hope that the information given to you is sufficient. if you need further info please let me know. I am waiting solution. Thanks in advance.

    "reinstalled Active and Active sql server cluster"

    "If i restart any of the node or if i stop cluster service to any of node"

    Technically speaking if you stop a cluster service it impacts SAN storage availability on both nodes. If SAN storage is unmanaged it will impact SQL services from starting. Never stop Cluster service without initiating failover to another node. Pls run cluster validation for both storage and network devices and then send us the report. I am suspecting that there might be storage issues in your cluster.

  • keshab.basnet (7/23/2015)


    But we are facing when one Node is restarted. We could not restart that Node in cluster Service start in Failover cluster Manager.

    The cluster service should automatically restart on the rebooted node, are you saying this isn't happening and you're trying to start it remotely via failover cluster manager

    keshab.basnet (7/23/2015)


    "Cluster node NODE1 could not to join the cluster because it failed to communicate over the network with any other node in the cluster. Verify the network connectivity and configuration of any network firewalls."

    I checked windows firewall. windows firewall is all of in Node1, Node2, SAN and DC.

    I have disabled and enabled the Internal and private network of Node 1. I have validated the cluster. it is showing no error though.

    What errors, if any, do you see in the cluster event log?

    keshab.basnet (7/23/2015)


    Node1:

    Public IP: 10.10.0.11

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (Ip of DNS)

    Private: 10.10.0.5

    Subnet Mask: 255.0.0.0

    Node2:

    Public IP: 10.10.0.12

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (Ip of DNS)

    Private Network

    IP: 10.10.0.6

    Subnet Mask: 255.0.0.0

    SAN:

    Public IP: 10.10.0.13

    SubNet Mask:255.255.255.0

    Default Getway: 10.10.0.1

    Prefered DNS: 10.10.0.10 (IP of DNS)

    Private Network: Not configured.

    pinging to each other ip is successful from one node to another.

    Could you please suggest how to solve this issue.

    Are you using iSCSI storage devices?

    Is this a test system?

    -----------------------------------------------------------------------------------------------------------

    "Ya can't make an omelette without breaking just a few eggs" 😉

  • Yes sir i have used iSCSI storage devices. I have separate SAN windows. During cluster validation, there is no single warning related to SAN storage drives. I will generate cluster validation tomorrow and provide you. In mean time, could you please suggest me if there is any solutions.

  • let's see a cluster validation report first please

    -----------------------------------------------------------------------------------------------------------

    "Ya can't make an omelette without breaking just a few eggs" 😉

  • Also supply us with error logs from cluster manager, look for the option called critical events.

  • keshab.basnet what was the solution? i also face same issue

  • Did someone add the registry key settings to disable the old TLS and/or SSL protocols ? ( and not reboot the node at that time )

    Johan

    Learn to play, play to learn !

    Dont drive faster than your guardian angel can fly ...
    but keeping both feet on the ground wont get you anywhere :w00t:

    - How to post Performance Problems
    - How to post data/code to get the best help[/url]

    - How to prevent a sore throat after hours of presenting ppt

    press F1 for solution, press shift+F1 for urgent solution 😀

    Need a bit of Powershell? How about this

    Who am I ? Sometimes this is me but most of the time this is me

Viewing 13 posts - 1 through 12 (of 12 total)

You must be logged in to reply to this topic. Login to reply