Carp works fine for weeks at a time then seemingly randomly gets split brain..
Ha pair in Vmware, originally worked fine for roughly 3years but as of the last 6 months or so will every so often get split brain but only on some interfaces.. 2 Dell R440s and HPE/Aruba 2920s
Server 1 and switch 1 in comms room 1, separate building, linked with lacp trunk between switches, has switch 2 and server 2. Esxi 6.0u3 on both servers Single vswitch on each with port group for wan and separate port group for the vlan trunk (multiple vlans) - same config on both other than ips.. when the issue occurs if we reboot server2 (always seems to be the one with master fir some and backup for others) then issue resolved until it comes back up.. if we reboot switch 2 then when it comes back up everything is fine for at least a few days (shortest time before issue reoccurred) but generally at least 3 or 4 weeks. Have replaced switch 2 twice now and HPE/Aruba have found no issue with any of them but same issue eventually returns... Doesn't make much sense but any guidance on where else to look... as I say originally worked without issue for years.. infrastructure has not changed except for additional vms on both servers but separate port groups for those...
sorry both hosts are 6.5u3