Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    PFSense Private network interface disable very frequetly

    Scheduled Pinned Locked Moved General pfSense Questions
    12 Posts 3 Posters 1.3k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • C
      chandranath
      last edited by

      We have a PFSense running on Baremetal and noticed that PFSense private interface is disabled automatically and this is happening very frequently. We had to restart baremetal to recover this issue. Both PFSense and Switch side interface status show UP but no traffic passing through an interface and even local gateway was not reachable. This is happening for Private interface only... We have enabled IPS Suricata

      NollipfSenseN 1 Reply Last reply Reply Quote 0
      • NollipfSenseN
        NollipfSense @chandranath
        last edited by

        @chandranath You realize that to help us understand what is happening with your network you'll need to provide relevant info such as screen shot of your configuration.

        pfSense+ 23.09 Lenovo Thinkcentre M93P SFF Quadcore i7 dual Raid-ZFS 128GB-SSD 32GB-RAM PCI-Intel i350-t4 NIC, -Intel QAT 8950.
        pfSense+ 23.09 VM-Proxmox, Dell Precision Xeon-W2155 Nvme 500GB-ZFS 128GB-RAM PCIe-Intel i350-t4, Intel QAT-8950, P-cloud.

        1 Reply Last reply Reply Quote 0
        • C
          chandranath
          last edited by chandranath

          We found this information when firewall interface is stop sending traffic , We have Baremetal in IBM Cloud and we manage only Baremetal and backend switch managed by IBM Cloud and they confirmed they did not find any issue.
          Error : laggport: ix0 flags=1c<COLLECTING>

          ifconfig lagg0

          lagg0: flags=8943<UP,BROADCAST,RUNNING,PROMISC,SIMPLEX,MULTICAST> metric 0 mtu 9000
          options=8500b8<VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,VLAN_HWFILTER,VLAN_HWTSO>
          ether 0c:c4:7a:8f:7c:fc
          inet6 fe80::ec4:7aff:fe8f:7cfc%lagg0 prefixlen 64 scopeid 0xb
          inet 10.45.30.76 netmask 0xffffffc0 broadcast 10.45.30.127
          inet 10.45.30.67 netmask 0xffffffc0 broadcast 10.45.30.127 vhid 11
          nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
          media: Ethernet autoselect
          status: active
          carp: MASTER vhid 11 advbase 5 advskew 0
          groups: lagg
          laggproto lacp lagghash l2,l3,l4
          laggport: ix0 flags=8<COLLECTING>
          laggport: ix2 flags=8<COLLECTING>
          Collapse

          During an outage out secondary firewall became a master and was exchanging VRRP.
          10.45.30.76 . Primary and 10.45.30.85 secondary firewall

          tcpdump -l -i lagg0 -nn "vrrp"

          tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
          listening on lagg0, link-type EN10MB (Ethernet), capture size 262144 bytes
          15:55:53.130271 IP 10.45.30.76 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 240, authtype none, intvl 5s, length 36
          15:55:53.477548 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
          15:55:58.877258 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
          15:55:59.109346 IP 10.45.30.76 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 240, authtype none, intvl 5s, length 36
          15:56:04.269434 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
          Collapse


          Nov 26 13:52:32 firewall1 kernel: carp: demoted by 240 to 240 (send error 50 on lagg0)
          Nov 26 13:52:32 firewall1 kernel: carp: 10@lagg1: MASTER -> BACKUP (more frequent advertisement received)
          Nov 26 13:52:32 firewall1 kernel: carp: 13@lagg1.816: MASTER -> BACKUP (more frequent advertisement received)
          Nov 26 13:52:32 firewall1 kernel: ifa_maintain_loopback_route: deletion failed for interface lagg1: 3
          Nov 26 13:52:32 firewall1 kernel: ifa_maintain_loopback_route: deletion failed for interface lagg1.816: 3

          Please let us if you need more information.

          1 Reply Last reply Reply Quote 0
          • C
            chandranath
            last edited by

            =================================================
            2 identical hardware/baremetal, used for pfSense HA pair.

            Intel(R) Xeon(R) CPU E3-1270 v3 @ 3.50GHz
            8 CPUs: 1 package(s) x 4 core(s) x 2 hardware threads

            pfSense version:
            2.4.4-RELEASE-p3 (amd64)
            FreeBSD 11.2-RELEASE-p10

            Setup has multiple 2 LACP bonds, VLANs, aliases, NAT, CARP, VPN tunnels, Suricata IPS.
            Bandwidth CPU utilization is around single digit.

            We see CARP being triggered on private interface post LACP bonding errors. Please find error on previous post.
            The CARP switches only private interface traffic, resulting split brain. Tweaking "net.inet.carp.senderr_demotion_factor" value affected complete switchover.
            However we are not able to the root cause of bonding failure.
            The frequency is quite regular and mostly during start of the day.
            The same hardware used to work with other firewall model with no issues for more than 24 months.
            The issue happens on both firewalls.
            Backend switch did not show any errors on interface.

            1 Reply Last reply Reply Quote 0
            • NollipfSenseN
              NollipfSense
              last edited by

              Hopefully, others more advance might be able to help you so I'll give it a bump!

              pfSense+ 23.09 Lenovo Thinkcentre M93P SFF Quadcore i7 dual Raid-ZFS 128GB-SSD 32GB-RAM PCI-Intel i350-t4 NIC, -Intel QAT 8950.
              pfSense+ 23.09 VM-Proxmox, Dell Precision Xeon-W2155 Nvme 500GB-ZFS 128GB-RAM PCIe-Intel i350-t4, Intel QAT-8950, P-cloud.

              C 2 Replies Last reply Reply Quote 0
              • C
                chandranath
                last edited by chandranath

                Two identical hardware/baremetal, used for pfSense HA pair.
                8 CPUs: 1 package(s) x 4 core(s) x 2 hardware threads
                pfSense version:
                2.4.4-RELEASE-p3 (amd64)
                FreeBSD 11.2-RELEASE-p10
                Setup has multiple VLANs, 2 LACP bonds, aliases, NAT, CARP, VPN tunnels, Suricata IPS.
                Bandwidth CPU utilization is around single digit.
                Network Connections Intel® i210 Gigabit Ethernet Controllers: The NIC card is integrated into the motherboard
                • Two (2) i210 LAN controllers for LAN1/LAN2
                • Two (2) RJ-45 rear I/O panel connectors with Link and Activity LEDs

                1 Reply Last reply Reply Quote 0
                • C
                  chandranath
                  last edited by chandranath

                  Please find an attached screen shot of both Primary and secondary firewall basic configuration.

                  Firewall 1: Primary
                  c290f32c-5fa7-4260-90bc-d587fa04758a-image.png
                  62f93df3-1a1a-49cd-9bc5-3ab164fdc01f-image.png
                  f42f4e3d-a69b-405e-8633-d67e2a0a4693-image.png
                  ![a184b5cc-bb7c-4b66-9bf8-5fcf898b153d-image.png]
                  beedf7fe-8e48-4ef1-a840-2cf1a8b75a5d-image.png (/assets/uploads/files/1577985712398-a184b5cc-bb7c-4b66-9bf8-5fcf898b153d-image.png)
                  e121ba5d-212e-4732-9ccd-a69aa90ac268-image.png When issue happens: Below is CARP status – Firewall1 -primary
                  We found this information when firewall interface is stop sending traffic , We have Baremetal in IBM Cloud and we manage only Baremetal and backend switch managed by IBM Cloud and they confirmed they did not find any issue.
                  Error : laggport: ix0 flags=1c<COLLECTING>
                  ifconfig lagg0
                  lagg0: flags=8943<UP,BROADCAST,RUNNING,PROMISC,SIMPLEX,MULTICAST> metric 0 mtu 9000
                  options=8500b8<VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,VLAN_HWFILTER,VLAN_HWTSO>
                  ether 0c:c4:7a:8f:7c:fc
                  inet6 fe80::ec4:7aff:fe8f:7cfc%lagg0 prefixlen 64 scopeid 0xb
                  inet 10.45.30.76 netmask 0xffffffc0 broadcast 10.45.30.127
                  inet 10.45.30.67 netmask 0xffffffc0 broadcast 10.45.30.127 vhid 11
                  nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
                  media: Ethernet autoselect
                  status: active
                  carp: MASTER vhid 11 advbase 5 advskew 0
                  groups: lagg
                  laggproto lacp lagghash l2,l3,l4
                  laggport: ix0 flags=8<COLLECTING>
                  laggport: ix2 flags=8<COLLECTING>
                  Collapse
                  During an outage out secondary firewall became a master and was exchanging VRRP.
                  10.45.30.76 . Primary and 10.45.30.85 secondary firewall
                  tcpdump -l -i lagg0 -nn "vrrp"
                  tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
                  listening on lagg0, link-type EN10MB (Ethernet), capture size 262144 bytes
                  15:55:53.130271 IP 10.45.30.76 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 240, authtype none, intvl 5s, length 36
                  15:55:53.477548 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
                  15:55:58.877258 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
                  15:55:59.109346 IP 10.45.30.76 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 240, authtype none, intvl 5s, length 36
                  15:56:04.269434 IP 10.45.30.85 > 224.0.0.18: VRRPv2, Advertisement, vrid 11, prio 100, authtype none, intvl 5s, length 36
                  Collapse
                  Nov 26 13:52:32 firewall1 kernel: carp: demoted by 240 to 240 (send error 50 on lagg0)
                  Nov 26 13:52:32 firewall1 kernel: carp: 10@lagg1: MASTER -> BACKUP (more frequent advertisement received)
                  Nov 26 13:52:32 firewall1 kernel: carp: 13@lagg1.816: MASTER -> BACKUP (more frequent advertisement received)
                  Nov 26 13:52:32 firewall1 kernel: ifa_maintain_loopback_route: deletion failed for interface lagg1: 3
                  Nov 26 13:52:32 firewall1 kernel: ifa_maintain_loopback_route: deletion failed for interface lagg1.816: 3
                  ca9c5f5d-ace5-4d19-b357-fd87333950fd-image.png

                  Firewall2 -Secondary
                  5c1cbc30-9b95-4527-8f59-e8a86c0c82e6-image.png
                  f1411b1b-26d2-4218-9e34-874b5e4edd31-image.png
                  e3c295c2-ec82-4370-9628-b39744e88fa6-image.png
                  22b37b4a-c631-49b2-a434-57f93492fbd9-image.png

                  When issue happens: Below is CARP status – Firewall2 -Secondary
                  59fb8ad2-6e22-4845-8768-2886ee5b3339-image.png

                  After few days faced similar issue on firewall2-secondary ( Current Master)
                  6f2532f8-721a-44aa-b2de-16354c8ad1d2-image.png

                  1 Reply Last reply Reply Quote 0
                  • C
                    chandranath @NollipfSense
                    last edited by

                    This post is deleted!
                    1 Reply Last reply Reply Quote 0
                    • C
                      chandranath @NollipfSense
                      last edited by chandranath

                      This post is deleted!
                      C 1 Reply Last reply Reply Quote 0
                      • C
                        chandranath @chandranath
                        last edited by

                        This post is deleted!
                        1 Reply Last reply Reply Quote 0
                        • C
                          chandranath
                          last edited by chandranath

                          This post is deleted!
                          1 Reply Last reply Reply Quote 0
                          • C
                            chetanwa
                            last edited by

                            Hello..
                            If someone gets similar issue, please try disabling LACP strict mode.
                            It worked in our case.

                            All the best

                            1 Reply Last reply Reply Quote 0
                            • First post
                              Last post
                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.