@Derelict said in HA Sync problems since updating to 2.4.4:
explicit-exit-notify does not apply in that case because the OpenVPN instance does not exit on failover. That means the clients will have to time out and when they attempt to reconnect they'll get whatever system holds the CARP VIP at that time.
Ah thanks for adding that. So would only work on a VPN configured on a failover gateway group type of WAN?
Edit: Also we got the problem to appear less now by adding the "no monitoring at all" switch to all 15 VPN gateways. But:
that eliminates the possibility for a some grouping of VPN interfaces
that blocks monitoring and status/availability checks on those gateways
is irritating to the admin staff (as all VPNs are always "on" even if they won't work)
etc.
We can definetly track that down to something that changed since 2.4.4 as with 2.4.3 and ~6-7 VPNs active at the time (with their gateways configured active) that problem didn't happen at all.
So situation is:
with 2.4.4 - whysoever - VPN GWs assigned and configured are "tilting out" the standby node
even without gateway monitoring, the standby node spits out ~300 lines of syslog with every config save on the master as every single VPN interface is set down, then up, then reconfigured, then detected with a "new IP" (even if that didn't change at all) etc. etc.
all that starts after every simple config change
I understand the config sync to be non-trivial, but it strikes me odd, that changes to aliases or rules have to trigger that whole bunch of interface action just to reload and activate some simple ruleset or alias changes?
Thanks for any insight,
Jens