Storage Network Partitioned - S2D

Luke Jackson 0 Reputation points
2025-08-20T20:16:58.94+00:00

We have deployed an Azure Local cluster (three nodes, HPE DL380 Gen11)

All of a suddent the storage network in Failover Cluster Manager is reporting as "Partitioned" and we're seeing errors in the Windows Admin Centre console related to a loss of quorum.

If I disallow the cluster to use the storage network the error disappears but we also lose a good amount of performance, when I enable it, we get some performance back (not all) and it very quickly transitions to partitioned.

The storage network is two different physical Mellanox NIC's connected to an Aruba switch via 25Gb DAC cables. All communication would appear to be fine, pings are fine between the IP's on the storage network, the interfaces are trunk ports allowing both Storage VLAN's to communicate accordingly.

On Windows Admin Centre we get these errors:

The cluster detected network connectivity issues that prevent Storage Spaces Direct from working properly.

If a maintenance operation is ongoing, please suspend it and restore access to all storage until the storage stabilizes.

The switch configuration is matched across all storage NIC ports and the settings across the nodes look to be identical for internal PFC/RDMA etc.

Azure Local
{count} votes

1 answer

Sort by: Most helpful
  1. Ankit Yadav 410 Reputation points Microsoft External Staff Moderator
    2025-08-26T17:17:07.0633333+00:00

    It turned out that the IP addresses assigned to the two storage networks on node 2 and node 3 were misconfigured (ie NIC was attached to VLAN with IP (10.71.2.X) but the IP address was assigned from VLAN with IP (10.71.1.X)). This caused the loss in communication when using storage network.

    To fix the issue, the customer had to fix the IP address misconfiguration.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.