We have a SQL Server setup with Availaibility Groups, in the cluster we have four nodes, two in one datacenter and two in another datacenter.
There are four AGs on these cluster nodes, each primarially on one of the four nodes, with one other node in the same datacenter for syncronous HA and the other two for asyncronous in the other datacenter for DR.
We have one listener for each AG.
One AG seems to automatically failover at certain times due to various Network issues, however the AG seems to failover fine - the HA secondary becomes primary and all databases come online however applications seem unable to connect and we see various connection error type messages such as,
'"Cannot connect to SQL Server instance 'HH-SQL-D11' :
A transport-level error has occurred when receiving results from the server. (provider: TCP Provider, error: 0 -
The semaphore timeout period has expired.) : The semaphore timeout period has expired  (requires acknowledgement)"
'named pipes provider error 40 the network path was not found'
Yet when we failover manually the connection is fine and works as expected. Other AGs on the cluster have automatically failed over and have worked fine. I have checked various things like the SQL Instance is up, TCP/IP and Named pipes enabled, remote connections enabled. I have asked the networks team to check things like firewalls and they say they are all ok.
I have tried to read the cluster logs but I am not sure what to look for in here and I am at a lost as to what to look at to try and fix this issue. Its a problem as it means that HA is not operating and if it does failover automatically applications connecting will not work.
Any help would be greatly appreciated.