HA/CARP/VIPs

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

M

CARP/HA, SYNC and XMLRPC SYNC explained

• • mk6032

3

1
Votes

3
Posts

11.3k
Views

M

Thanks for the excellent reply. I've retested as you suggested by entering persistent maintenance and there is no packet loss that way (perst maint, reboot, leave persist maint). I am still having a small problem with freeradius xmlrpc sync between the two but I posted that in a separate topic (see https://forum.pfsense.org/index.php?topic=135864.0).

Regards,
Matt
J

Primary does not auto fallback with pfsense 2.7.2

• • jypsilantis

5

0
Votes

5
Posts

45
Views

J

@SteveITS Thank you for this. I expected to see something similar with my primary's NICs. I did however set up CARP a number of times with the UI in 2.7.2, which may have triggered the problem in my case.

I have bi-directional pfsync set up, but XMLRPC sync is only from the primary to the secondary.

I will report the issue to the developers.
V

New Zealand for management and physical Netgear switch

• • VMlabman

6

0
Votes

6
Posts

49
Views

J

@VMlabman well you need vlan 20 tagged on port pfsense is connected to. Port xg1 on your switch

Also what switch is that... You want to be using 802.1q, that seems like maybe your in port mode, not sure what vlan type static means. What is the make and model of that switch?

Maybe that switch is calling it protocol mode? But pfsense is tagging vlan 20, this is 802.1q protocol

Your port connected to your pfsense lan should be vlan 1 untagged, and port 20 tagged. Port your laptop is on is vlan 20 untagged, pvid 20

your port 8 looks fine, but only seeing vlan 1 on port 1 that you have arrow showing connected to pfsense Lan
R

HAProxy / Lets Encrypt / Postfix - Dovecot

• • RHLinux

8

0
Votes

8
Posts

1.6k
Views

V

@Gertjan Reason why 465 is deprecated due to its encryption over SSL which isweak hence why it’s been replaced for 587 over TLS. Just because cloud providers and ISPs supports 465 still doesn’t mean you should.

Port 25 is used for Mail Servers to Exchange Mails and should not be used for Client to Server Roles.
C

Backup Node Normal Behavior

• • CaptainKeyboard

17

0
Votes

17
Posts

127
Views

V

@CaptainKeyboard
The hint to consider rule was in my first post.

But glad, that's working now.
P

Renewing Self Signed WebConfigurator Cert Breaks HA Node Access

• • planedrop

6

0
Votes

6
Posts

61
Views

P

@SteveITS Gotcha, yeah I probably should be using a proper cert anyway.

I'll still see if I can replicate this with another HA setup though and post back here.
T

virual ip From ip alias to CARP type

• • tosman06

5

0
Votes

5
Posts

88
Views

T

@viragomann thank you!
V

FRR BGP over IPsec , when HA happens (slave-> master, master ->slave)

• • vinns

14

0
Votes

14
Posts

112
Views

M

@michmoor said in FRR BGP over IPsec , when HA happens (slave-> master, master ->slave):

curious...
Is FRR running on the standby firewall?

Not at the moment, I'm about to build the slave to form the HA, only a single firewall running at the moment, just waiting for two NICs to arrive.

@michmoor said in FRR BGP over IPsec , when HA happens (slave-> master, master ->slave):

If it is there needs to be a way to have the process down and only running when it becomes active otherwise the standby is going to attempt peering with upstream.

If state is slave, pfSsh.php playback disable frr.. perhaps a good logic for the script to run every second.
M

Is "mass addition" of IP Aliases possible?

• • mnlipp

4

0
Votes

4
Posts

60
Views

M

So I edited config.xml (plus 63 IP Aliases) and held my breath...

The web interface of the secondary firewall became unresponsive for several minutes (the command line was still available). During this time, the secondary sent dozens of messages about assuming CARP state whatsoever.

Eventually, things settled down and I could access the web interface again. I found that both firewalls considered themselves master for the "interface" CARP IP and all Alias IPs associated with it.

I temporarily disabled CARP on both firewalls and enabled it again. Now things look okay.
N

Full-mesh using 2×Netgate 7100 1U + 2×Dell S4148T-ON

• • nxsysop

3

0
Votes

3
Posts

330
Views

V

@nxsysop Hi, i know this is an old post, but wondering if your solution worked. We are also trying to setup using a pair of 8200's. We are going to use LACP, but wasn't sure if static or dynamic would work with the Dell switches which are setup using VLT. Thanks
E

Stop IGMP Proxy Service with CARP in status Backup

• • Enrica_CH

2

0
Votes

2
Posts

51
Views

E

I didn't find a solution until now to have HA with IGMP Proxy.

Has somebody a solution which works fine?
U

No State Creator Host IDs visible

• • unico-dm

21

0
Votes

21
Posts

700
Views

H

As this problem bit me several times now when upgrading CARP-clusters I tried to troubleshoot it a bit more. Here is what I found out:

I read somewhere that there were protocolchanges in pfsync between 2.7.0 and 2.7.2. So if you upgrade the nodes one by one there is the situation where one system is already on 2.7.2 and the other one still on 2.7.0. During this time 2.7.0 tries to sync states to 2.7.2 and vice versa. Due to the versionmismatch the statetable seems to have invalid records in the list that causes the too many creators error and causes the statesync to fail. Even after both nodes reach 2.7.2 those invalid records still remain in the statetables, which then still breaks the sync.

The resolution to this is:
Either reboot both nodes at the same time, so the broken records are not synced back and forth. Downtime about 1-2 minutes on fast systems.
Or disable the statesync via pfsync on both nodes (system>high availability), reset the statetables on both nodes (Diagnostics>States, Reset states). Then reenable the statesync again. Downtime a few seconds as only the dropped connections need to be reestablished.

Unfortunately both procedures cause a disruption of traffic, though the second way of getting pfsync going again is rather short. An additional impacti is, that during the upgradeprocess statesync is broken from the time the first node reboots till the states get cleared on both nodes after both nodes are upgraded.

Hope this helps some people that run into the same issues.

Regards
Holger
L

WAN link unplugged, but LAN not failoverto Backup

• • leiw

15

0
Votes

15
Posts

458
Views

P

i have replicated topology in GNS3 Lab and have same issue:

Immagine 2024-03-27 172830.jpg
I

CARP - VLAN VIPS showing master on both

• • iptvcld

1

0
Votes

1
Posts

51
Views

No one has replied
C

Setup pfSync causes an instant crash pfsense 2.7

• • cwager990

9

0
Votes

9
Posts

160
Views

C

@kprovost I have now got this working, I have no idea what I did differently but on two newly built virtual machines I have it working.
M

PfSense in Azure

pfsense • • minesh.patel

12

0
Votes

12
Posts

2.8k
Views

B

It’s generally recommended to avoid using the Virtual IP (VIP) to access the GUI for security reasons. The VIP is typically exposed to more traffic and potential attacks, so accessing the GUI through it could expose sensitive administrative interfaces. Instead, it’s safer to access the GUI from a management interface or VPN that’s not directly exposed to the internet. When you route all traffic from the Test subnet through the pfSense firewall using a specific LAN IP, you’re essentially creating a single point of failure. If you want to use the VIP (10.0.2.101) and still have the traffic appear to come from the load balancer’s public IP, you’ll need to ensure that the VIP is correctly configured for outbound NAT and that the load balancer is set up to handle outbound traffic from the VIP address.
E

IPSec taking long time to connect after CARP IP failover.

• • emmdee

7

0
Votes

7
Posts

204
Views

P

Are you using pfSense CE or Plus? I think that is my first follow up question, Plus is supposed to have some more "stuff" in it to help with IPsec failover delays, as mentioned in the docs.

It's been a while since I've had to failover a node for testing so I could be remembering wrong but I think it was near instant failover. But the docs do mention it could take until the timeout of the tunnel if the peer is the one initiating.

Do you have dead peer detection enabled and do you know if the other side of the tunnel does? That should in theory cause the peer to initiate the tunnel again quickly.

Also, as far as I can tell, the backup node in the HA cluster should become an initiator when it's status changes to Master; I'm sure it is, but can you confirm (when in failover) that the primary says Backup and the secondary says Master? Just to be 100% sure that is working.

Finally, from what I am seeing, I think it should work just as well without XLMRPC so that's the good news.
S

Right way to hardware HA for LANs,- LAGGr?

• • Sergei_Shablovsky

3

0
Votes

3
Posts

143
Views

S

Up
E

DNS resolution issue with High Availability

• • emc

11

0
Votes

11
Posts

110
Views

E

@viragomann

I watched all of netgate official tutorials.
In one of them they mention that if my setup is structured as a DMZ, the outbound NAT should be set as default:

https://www.youtube.com/watch?v=-UszV8qIaRw&t=2426s

My setup is set as a DMZ
COMCAST ROUTER -> DMZ WAN CARP IP (either pfsense1 or pfsense2)

I removed the custom NAT outbound rules pointing to the WAN CARP IP, and left it at hybrid default rules.
The DNS resolution is working now.

Besides this small mention in a tutorial from 9 years ago, I do not see anywhere else this mention about DMZ in the documentation from netgate. Either way, it is working now. I hope this helps someone else in the future.

Thank you for your help!
D

New to HA -- questions about DHCP server on LAN interface

• • Defiling2063 0

2

0
Votes

2
Posts

67
Views

D

I checked the primary and secondary pfsense again last night. The dhcpd were on on both. I guess that is probably the intended behaviour. I see the failover dhcpd in the dhcp status page. I think I am all good. Thanks.

1 / 125