Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    CARP broken in nighly build

    Scheduled Pinned Locked Moved HA/CARP/VIPs
    6 Posts 3 Posters 1.1k Views 3 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • T Offline
      thesurf
      last edited by

      I installed 4 pc with nightly build from 21.8.2018.
      I then setup two ha cluster with 2 member.

      I can ping from cluaster a member 1 and 2 to the other cluster b member 1 and 2 without any problem.
      I then added a virtual IP with carp. Both cluster reconive master and backup state correct.

      When I ping from cluster a member 2 to meber 1 (carp IP - master) I get a package loss.
      When I ping from cluster a member 1 (master) to cluster b member 1 (master) I get package loss. It is so extrem that the ping stals when I to a ping at the same time the other way arround.

      0_1535133113396_a2baf142-c317-4aa6-b6c0-01017a171ad8-image.png

      0_1535133254781_a14144a2-bdae-45b8-8633-0dfab06b8f51-image.png

      Ping Cluster a member 1 to cluster b member 1 (carp master IP)
      0_1535133331562_0417dfac-3d64-48c8-9564-054ce35b286c-image.png

      Ping Cluster a member 1 to cluster b member 1 (interface IP)
      0_1535133385084_449a4f8c-5b83-4cfa-b6cc-511098c9722f-image.png

      als you can see cable switch everything is the same.

      1 Reply Last reply Reply Quote 0
      • T Offline
        thesurf
        last edited by

        I now did a full reinstall on 2.4.3 on the same hardware.
        To make it short. SAME PROBLEM! ☹

        I can ping all hosts and the switches without a problem. But I get lost packages and timeouts when I ping from one side to the other side carp ip.

        Ping cluster b member 2 to cluster a member 1 (is master)

        0_1535296282220_fe4d2bbc-abfe-42e4-9b01-3b9b60e296aa-image.png

        Tcpdump on the cluster a member 1 as you can see. It stops sudden and then I get the timeout packages.
        0_1535296295705_14c795e0-78b5-4020-a025-ddc4e1f7f66a-image.png

        I have no clue where here the Problem is! I have other pfSense installations with ha and lagg that work without a problem.

        1 Reply Last reply Reply Quote 0
        • T Offline
          thesurf
          last edited by

          @Moderador-PfSense : please move to HA/Carp section

          1 Reply Last reply Reply Quote 0
          • T Offline
            thesurf
            last edited by

            Mystery solved!

            The virtual IP in carp has the timeout and package loss due to configuration.

            In my test setup I build a vlan for each wan link between my offices. On both sides are pfsense Cluster. Now the router interfaces on both sides are in the same vlan.

            I setup the carp interfaces on both side with a different password but the same vhid.

            That seems to lead to the problem of package loss and totaly no traffic.

            Once the vhid in the vlan was different everything started working as expected.

            1 Reply Last reply Reply Quote 0
            • JeGrJ Offline
              JeGr LAYER 8 Moderator
              last edited by

              Just to mention it for others searching for CARP problems: Check the troubleshooting guides, that's what they are for. :)

              https://www.netgate.com/docs/pfsense/highavailability/troubleshooting-high-availability-clusters.html#conflicting-vhids

              The aforementioned problem is the first topic in this guide. And VHIDs in general should be a topic to get accustomed with when running HA setups not only with pfSense clusters on both sides. You could have easily had a couple of Juniper, Cisco or other L3 Switches in a VRRP/HSRP/other HA combination setup on the other side. That's what I had a few years ago. Upstream provider (ISP in a datacenter) had our uplinks on a HSRP setup and told me multiple times, they were using an ID >10 in their setup, so I could run it with vhid 1. Turned out the tech was wrong and it defaulted to - yes - vhid 1 on their side, too. So both had the same virtual mac address for both our VIPs on both sides. Easy to see, that such a thing will screw L2 up perfectly. ;)

              Don't forget to upvote 👍 those who kindly offered their time and brainpower to help you!

              If you're interested, I'm available to discuss details of German-speaking paid support (for companies) if needed.

              1 Reply Last reply Reply Quote 0
              • johnpozJ Offline
                johnpoz LAYER 8 Global Moderator
                last edited by

                Moved to HA/Carp section.

                An intelligent man is sometimes forced to be drunk to spend time with his fools
                If you get confused: Listen to the Music Play
                Please don't Chat/PM me for help, unless mod related
                SG-4860 24.11 | Lab VMs 2.8, 24.11

                1 Reply Last reply Reply Quote 0
                • First post
                  Last post
                Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.