Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    error(s) loading the rules: pfctl: DIOCADDRULENV: No such file or directory

    Scheduled Pinned Locked Moved Firewalling
    17 Posts 4 Posters 933 Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • C
      coreybrett
      last edited by

      How would I troubleshoot this ?

      There were error(s) loading the rules: pfctl: DIOCADDRULENV: No such file or directory - The line in question reads [0]: @ 2024-06-07 08:57:48

      K 1 Reply Last reply Reply Quote 0
      • K
        kprovost @coreybrett
        last edited by

        @coreybrett Start by specifying the version you're seeing this on, and then confirming that pfctl -g -f /tmp/rules.debug also sees this problem.

        If it does, truss pfctl -g -f /tmp/rules.debug ought to show either an ioctl or a netlink read/write operation returning ENOENT.
        That error is very unexpected, because I can't find any place in the relevant code path that'd return that specific error.

        Finally, run dtrace -n 'fbt::pf_ioctl_addrule:return { printf("@%x => %d", arg0, arg1); stack(); }' and then re-apply the rules to trigger the error.

        Post the full output of all of those commands. Yes, that will be a lot of output.

        1 Reply Last reply Reply Quote 0
        • JonathanLeeJ
          JonathanLee
          last edited by JonathanLee

          @kprovost said in error(s) loading the rules: pfctl: DIOCADDRULENV: No such file or directory:

          dtrace -n 'fbt::pf_ioctl_addrule:return { printf("@%x => %d", arg0, arg1); stack(); }

          Screenshot 2024-07-23 at 14.50.19.png

          I also have this error

          truss pfctl.txt

          Make sure to upvote

          K 1 Reply Last reply Reply Quote 0
          • K
            kprovost @JonathanLee
            last edited by

            @JonathanLee I'm a little confused. That truss output shows no errors. It also doesn't show it printing anything.

            Did you see the error manually running pfctl -g -f /tmp/rules.debug?

            JonathanLeeJ 1 Reply Last reply Reply Quote 0
            • JonathanLeeJ
              JonathanLee @kprovost
              last edited by

              @kprovost I did the command directly after the error showed in the gui

              Make sure to upvote

              K 1 Reply Last reply Reply Quote 0
              • K
                kprovost @JonathanLee
                last edited by

                @JonathanLee That would mean that the error only happens intermittently, which is even stranger.

                Can you reproduce the error?
                If so, keep the above dtrace command running while you reproduce it and then supply that output.

                JonathanLeeJ 1 Reply Last reply Reply Quote 0
                • JonathanLeeJ
                  JonathanLee @kprovost
                  last edited by

                  @kprovost it only happens in 24 it does not occur on my other boot environments, it also only occurs directly after a reboot. It also occurs when I have access control lists that are marked both IPv6 and IPv4 if I do not have them set that way it does not occur.

                  Example block both IPv4 Ipv6 to OPT1 rules like that

                  Make sure to upvote

                  K 1 Reply Last reply Reply Quote 0
                  • K
                    kprovost @JonathanLee
                    last edited by

                    @JonathanLee I have a theory about what's happening here. Basically, the error message is misleading because we're not actually getting 'ENOENT'. The error handling code in pfctl is printing the wrong error.

                    The cause is likely to be a simple conflict between two processes trying to update rules at the same time. That's something the should PHP handle, but because the error doesn't match what it expects it doesn't.

                    If you're comfortable editing the PHP code it's a fairly simple thing to test:

                    --- /etc/inc/filter.inc.orig    2024-07-26 12:09:54.964680000 +0000
                    +++ /etc/inc/filter.inc 2024-07-26 12:10:15.221720000 +0000
                    @@ -624,7 +624,7 @@
                                            break;
                                    }
                                    if (strstr($_grbg, "DIOCADDALTQ: Device busy") ||
                    -                   strstr($_grbg, "DIOCADDRULE: Device busy") ||
                    +                   strstr($_grbg, "DIOCADDRULE") ||
                                        strstr($_grbg, "DIOCXCOMMIT: Device busy")) {
                                            // when busy status is returned retry after a short pause
                                            usleep(200000);//try again after 200 ms..unless it still fails after 10x
                    

                    So, in human terms, edit /etc/inc/filter.inc and on line 627 and change strstr($_grbg, "DIOCADDRULE: Device busy") to strstr($_grbg, "DIOCADDRULE") || (i.e. remove ': Device busy'). That ought to make the code match the error and retry in case of concurrent updates.

                    JonathanLeeJ 1 Reply Last reply Reply Quote 1
                    • JonathanLeeJ
                      JonathanLee @kprovost
                      last edited by

                      @kprovost should this also be a Redmine? This could be a patch also…

                      Make sure to upvote

                      K 1 Reply Last reply Reply Quote 0
                      • K
                        kprovost @JonathanLee
                        last edited by

                        @JonathanLee Let’s confirm first.

                        JonathanLeeJ 2 Replies Last reply Reply Quote 1
                        • JonathanLeeJ
                          JonathanLee @kprovost
                          last edited by

                          @kprovost give me a min I have to boot that I am doing testing with IPv6 static assignments and squid right now it is working well. Let me swap boot environments and use this config for 24 too

                          Make sure to upvote

                          1 Reply Last reply Reply Quote 0
                          • JonathanLeeJ
                            JonathanLee @kprovost
                            last edited by

                            @kprovost Done...
                            Version 24.03-RELEASE (arm64)
                            with ipv6 tunnel broker over functional ssl intercept squid

                            Before.....

                            Screenshot 2024-07-26 at 11.01.58.png

                            After

                            Screenshot 2024-07-26 at 11.02.35.png

                            Will update if error returns

                            Make sure to upvote

                            1 Reply Last reply Reply Quote 1
                            • JonathanLeeJ
                              JonathanLee
                              last edited by

                              This caused issues with rule creating and the ACL rules order would move around if you changed the busy condition. This is during configuration changes.

                              Make sure to upvote

                              K 1 Reply Last reply Reply Quote 0
                              • K
                                kprovost @JonathanLee
                                last edited by

                                @JonathanLee What sort of issues?

                                (I'm on holiday, so there will be no further progress for the next two weeks.)

                                JonathanLeeJ 1 Reply Last reply Reply Quote 2
                                • JonathanLeeJ
                                  JonathanLee @kprovost
                                  last edited by

                                  @kprovost have a great vacation.

                                  Make sure to upvote

                                  C 1 Reply Last reply Reply Quote 0
                                  • C
                                    clawsonn @JonathanLee
                                    last edited by clawsonn

                                    !_upload_to_netgate.txt

                                    Experienced same error issue recently. Attached the file of the output from putty cli commands requested in earlier posts. I do not believe I was able to reproduce the issue. The putty file output text was approx 6MB and the netgate file upload accepts only 2MB so I cutoff a large portion of the end and don't know if the good content is missing or not. A lot of the lines appear identical.

                                    Machine is:
                                    Boot Environment
                                    default
                                    Current Base System
                                    24.03
                                    Latest Base System
                                    24.03
                                    Status
                                    Up to date.

                                    C 1 Reply Last reply Reply Quote 0
                                    • C
                                      coreybrett @clawsonn
                                      last edited by

                                      @clawsonn In my case, I had a bad WAN connection that was triggering this issue. It was also making HAProxy crash. As soon as I disabled that WAN (it was a 4g backup), everything went back to normal.

                                      1 Reply Last reply Reply Quote 0
                                      • First post
                                        Last post
                                      Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.