Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Unexpected alias behaviour - two ranges / size limits with FQDN

    Scheduled Pinned Locked Moved General pfSense Questions
    80 Posts 4 Posters 917 Views 4 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • tinfoilmattT Offline
      tinfoilmatt @stephenw10
      last edited by

      @stephenw10 Hey, at least you're getting paid to placate this behavior. I'm only here to have fun! 🤣

      1 Reply Last reply Reply Quote 1
      • P Online
        Patch @SteveITS
        last edited by Patch

        @SteveITS
        After more testing I suspect the root cause of the bug investigated in this thread is

        • An alias containing one or more FQDN is limited to a little over 512 entries (total across all such aliases), not the 5000 per alias limit suggested in the manual.
        • if you go over that limit further alias table updates are blocked for all aliases

        For the duplicate removal but not restored bug discovered again in this thread, and referenced in your thread, I can't see that being resolved any time soon. Learning how best to live with it is more sensible imo.

        Easy bug trigger

        Tested in a clean pfsense v2.8.1 install, enable WAN GUI access. Save that configuration as baseline. Done to ensure easy repeatability and reduce the validity of the shooting the messenger crap.

        The easiest way of triggering the bug is entering the following 1024 consecutive element alias -> 473 element are shown in the corresponding alias table.
        92 Combined FQDN x1 IPv4 x1024 consecutive.jpg

        Random IP addresses & lower bound

        To put a bound on the lower limit and confirm sequential IP addresses are irrelevant
        Reload the baseline configuration and enter the a 513 element alias such as the following, -> 514 elements are shown in the corresponding alias table
        91 Combined FQDN x1 IPv4 x512.jpg

        Reload the baseline configuration and enter a 1025 element alias (using the method illustrated above) -> 473 element are shown in the corresponding alias table. Which is exactly the same as the sequential IP initial test case. Waiting longer, Filter reload and "killall filterdns" all make no difference.

        Limit per alias or total record count

        To illustrate the limit depends on the load implied by other alias
        Restore the baseline configuration
        Enter Alias like IP_set1 with 50 IP + FQDN x1 as shown below -> 52 records in Alias table
        93 IP_set1 FQDN x1 IPv4 x50.jpg

        Enter Alias like IP_set2 with 50 IP + FQDN x1 using the same method (mine started 202.) -> 52 records shown in the alias table

        Now again try and enter IP_set3 with 512 IP + FQDN x1 as shown below -> zero records. After several filter reloads 271 records. Waiting longer, Filter reload and "killall filterdns" all make no difference.
        Which contrasts with the 514 records when no other aliases were entered.
        94 IP_set1 FQDN x1 IPv4 x512.jpg

        Limit for "mixed mode" alias or also pure FQDN

        To investigate if using explicit IP addresses vs FQDN addresses matter
        Reload the baseline configuration and enter a 1024 FQDN, I copied the first 1024 entires form pfblocker. -> 569 records after wait about 1 hour and lots of filter reloads. After which

        • "killall filterdns" -> No matching processes were found
        • create an Test_loaded containing a single FQDN -> empty alias table

        Which strongly suggests "mixed mode" alias are no different to pure FQDN aliases, just testing takes longer.

        95 Combined FQDN x1024.jpg

        P tinfoilmattT 2 Replies Last reply Reply Quote 0
        • P Online
          Patch @Patch
          last edited by Patch

          Updated above post to investigate if using explicit IP addresses vs FQDN records matters
          Updated title to more accurately describe root cause evident in hind sight

          1 Reply Last reply Reply Quote 0
          • tinfoilmattT Offline
            tinfoilmatt @Patch
            last edited by

            @Patch said in Unexpected alias behaviour - two ranges / size limits with FQDN:

            Easy bug trigger
            [ . . . ]
            The easiest way of triggering the bug is entering the following 1024 consecutive element alias -> 473 element are shown in the corresponding alias table.

            I can confirm this behavior.

            ecd46d8a-5569-45ac-b89e-a0000e4d2888-image.png

            9be71e3c-e06f-41ec-b9f3-686cab04606d-image.png

            1447f1d9-3d22-4501-8f9e-e5ff333dd837-image.png

            1 Reply Last reply Reply Quote 0
            • tinfoilmattT Offline
              tinfoilmatt
              last edited by

              I am holding here despite the...

              Aliases become Tables when loaded into the active firewall ruleset. The contents displayed on this page reflect the current addresses inside tables used by the firewall.

              ...advisory.

              Please let me know if there's something I can/should do next that would further assist the thread.

              1 Reply Last reply Reply Quote 0
              • tinfoilmattT Offline
                tinfoilmatt
                last edited by

                The 473 records are sequential from 123.123.120.0 through 123.123.121.216.

                1 Reply Last reply Reply Quote 0
                • stephenw10S Offline
                  stephenw10 Netgate Administrator
                  last edited by

                  Hmm, I don't think this is a numerical limit. It seems more like a function of timing to me. But either way I would bet the root issue is in filterdns.

                  1 Reply Last reply Reply Quote 1
                  • P Online
                    Patch
                    last edited by Patch

                    @bbcan177 You have been dealing with far larger sets of IP addresses and FQDN in pfsense. How have you been working around

                    1. The limitation in cumulative number of entries in aliases containing a FQDN as described above

                    2. The possibly related consequence of using incremental alias update with duplicate removal but then using incremental IP removal without restoration of he removed duplicate as per https://redmine.pfsense.org/issues/13792 and possibly https://redmine.pfsense.org/issues/9296 and https://redmine.pfsense.org/issues/13793

                    The short term fix

                    1. is relatively simple, just document the limit, add it to redmine and say it's being worked on. Something I can easily work with.

                    2. is more difficult as that bug essential requires the user ensure all alias containing FQDN are always non intersecting (never trigger duplicate removal). Which severely limits the value of alias for me.

                    The longer term fix maybe more difficult as it maybe a program architecture limit rather than a coding bug.

                    • The limit for "mixed mode" aliases may be resolvable by internally processing the constants (explicitly specified IP addresses) separately to the variable entries (FQDN) then combining the two "sub aliases". The constant portion only need to be evaluated at alias editing or program boot up. Optimisation to group longer runs of consecutive addresses into CIDR prefix notation would be possible.

                    • Regression analysis I suspect would show the progressive slowing of alias update and eventual lock up investigated in this thread is a result of earlier work to try and avoid the limitations of implicit duplicate removal addressed in https://redmine.pfsense.org/issues/9296

                    • A solution maybe to pre process so addition to filterdns are only made when the duplicate count for that address goes from 0->1 and similalry deletion from filterdns are only made when the duplicate count for that address goes from 1->0. Other duplicate count transitions 1<->2<->3 etc result in no filterdns calls.

                    • Alternatively incremental deletes could be avoided completely by all updated being done by a full alias rebuild to a temporary variable then full replacement of the old with the new.

                    • or combine the above two. When updating an alias containing one or more FQDN fully rebuild the alias to a temporary variable. Then compare the active and new aliases calling filterdns to update the active alias only as required

                    None of which sound trivial to me.

                    1 Reply Last reply Reply Quote 0
                    • stephenw10S Offline
                      stephenw10 Netgate Administrator
                      last edited by

                      pfBlocker uses URL table aliases which are not limited to 3000 entries like other alias types. But as I said I don't think that's what we're hitting here.

                      P 1 Reply Last reply Reply Quote 0
                      • P Online
                        Patch @stephenw10
                        last edited by

                        @stephenw10 You may well be correct.
                        I agree it has features suggestive of timing or a race condition such as

                        • performance varies as measured by the total number of entries processed correctly in aliases containing at least one FQDN
                        • This performance is worse under heavier loads (greater total number entries in aliases containing a FQDN).

                        Other feature are not typical of a race condition

                        • For a given test vector the performance (total number of entries processed correctly) is exactly the same for each run
                        • For a given test vector the performance is exactly the same on different hardware (473 for one FQDN and /22 as per my and tinfoilmatt testing)
                        • For a given test vector the performance is exactly the same across pfsense v2.81 and v2.72 software. The 256 total is remarkable not because it happens to be a binary number but shows varying the software version changes how how many entries are processed in each of the 3 aliases but still results in exactly the same total entries processed / performance
                        • With lots of FQDN the system eventually reaches an identical end point but locks up gradually, getting very slow prior to actually locking up
                        • Looking at the Resolver logs, IPv4 address showing in the alias tables are processed once. The missing IPv4 addresses are processed twice not zero times.
                        • there is an old redmine to prevent duplicate processing resulting in removal of IP addresses. List size limits of that fix is most suspicious IMO
                        1 Reply Last reply Reply Quote 0
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.