Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times

    Scheduled Pinned Locked Moved DHCP and DNS
    176 Posts 6 Posters 21.6k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • R
      RickyBaker @johnpoz
      last edited by RickyBaker

      @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

      If you set a debug you can get all kinds of info, the response who was asked, etc..

      27ce0d7f-96d0-4a14-b98e-8198773e32f8-image.png
      bd192b9c-f0f7-4a16-99d8-901f1b8ce678-image.png
      seems bad
      When i attempted it on my unraid server the command wasn't found. when i did on pfsense itself and my plex server nslookup just seemed to hang looking for more input.

      @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

      What browser are you using?

      chrome but i can't imagine that's better

      @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

      If logs are delayed or time is off in them

      My logs were still stuck on 11:11:03 (last config update). i restarted the service and they are updating again.
      5218ab5f-392f-49a1-b307-4a71aeee0fd9-image.png
      a9e12ff4-6166-4ee8-af54-134445d17a5d-image.png
      mine doens't seem to have an answer section like yours
      And this I believe is the corresponding failure in the log:
      c93e0ece-bbca-41a4-b62e-3b5b41742f1d-image.png
      the previous fail in the log was from the browser...

      I'm kind of swimming in all the different steps that were needed. Was this helpful? What have I discovered about my devices and their usage of the DNS?

      johnpozJ 1 Reply Last reply Reply Quote 0
      • johnpozJ
        johnpoz LAYER 8 Global Moderator @RickyBaker
        last edited by johnpoz

        @RickyBaker all those are failing.. You see servfail.. So no its never going to work.

        Is 10.10.10.1 your actual IP, or are you pointing to the vip of pfblocker?

        In your nslookup debug you never even asked for just www.msn.com - you just asked for www.msn.com.localdomain.

        Put a . on the end with your nslookup.. You see how mine did search, with my home.arpa but then it dropped that and did my actual query. Your never did that.

        What is asking for HTTPS record vs just A record? You see where you see query from 10.10.10.10 its doing both a A record query and a HTTPS query?

        You might want to add these two options.. So easier to see what is query and what is reply.. And prob want to add the servfail option so might get some info to why it failed.

        log-tag-queryreply: yes
        log-servfail: yes

        Add those to what you already have in your options box and save and apply.. This can give you more info..

        So your not behind a vpn here, pfsense has no vpn client connection? You need to see in your debug for what your actually asking for www.msn.com.localdomain is never going to resolve.. Unless you had created that record locally.

        And didn't we go over that 127.0.0.53, you need to know who exactly that is asking.. If you going to do a dig - do a directed query with the @ipaddress...

        An intelligent man is sometimes forced to be drunk to spend time with his fools
        If you get confused: Listen to the Music Play
        Please don't Chat/PM me for help, unless mod related
        SG-4860 24.11 | Lab VMs 2.7.2, 24.11

        R 1 Reply Last reply Reply Quote 1
        • R
          RickyBaker @johnpoz
          last edited by

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          So no its never going to work.

          but it DOES SOMETIME work! that's why it's so infuriating

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          Is 10.10.10.1 your actual IP, or are you pointing to the vip of pfblocker?

          i don't have pfblocker installed 10.10.10.1 is the ip address of my pfsense router.

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          In your nslookup debug you never even asked for just www.msn.com - you just asked for www.msn.com.localdomain.

          I def did not intend to ask for www.msn.com.localdomain and I def did not type the words localdomain when I was running the sample you suggested. I merely enacted the samples you suggested as well as pointing my browser at www.msn.com. I'm guessing the https request is a browser feature that forces https, but that's just conjecture

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          So your not behind a vpn here, pfsense has no vpn client connection? You need to see in your debug for what your actually asking for www.msn.com.localdomain is never going to resolve.. Unless you had created that record locally.

          I am not behind a VPN here (intentionally at least) and I have not created a record for msn.com locally (intentionally at least).

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          And didn't we go over that 127.0.0.53, you need to know who exactly that is asking.. If you going to do a dig - do a directed query with the @ipaddress...

          yes, i knew there was a detail i forgot in that troubleshooting
          04d0af05-6d2f-4528-a854-eafa07af9277-image.png
          bfd227f7-19b4-446d-a21c-f1f1aad4186a-image.png
          8fdb80b5-6444-4ad6-a6fa-756a8afe64fa-image.png
          i'm not 100% sure of the middle one and i have no idea what 127.0.0.53 is. Is there another test i should run to get more color?

          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          log-tag-queryreply: yes
          log-servfail: yes

          1bb58492-7a7c-40a2-aafc-c2aecfaf329d-image.png

          johnpozJ S 2 Replies Last reply Reply Quote 0
          • johnpozJ
            johnpoz LAYER 8 Global Moderator @RickyBaker
            last edited by

            @RickyBaker well lets see what happens with logging of servfail detaills. Because clearly its running and resolved your pfsense.localdomain name from from 10.10.10.1 when you did your nslookup.

            Another thing I notice on your servfail your not getting the ede back..

            You should be able to enable that with ede: yes in your custom box

            See here

            servfail.jpg

            An intelligent man is sometimes forced to be drunk to spend time with his fools
            If you get confused: Listen to the Music Play
            Please don't Chat/PM me for help, unless mod related
            SG-4860 24.11 | Lab VMs 2.7.2, 24.11

            1 Reply Last reply Reply Quote 0
            • S
              SteveITS Galactic Empire @RickyBaker
              last edited by

              @RickyBaker

              127.0.0.53

              Your screenshot shows Ubuntu, that’s the local DNS resolver.
              https://unix.stackexchange.com/questions/612416/why-does-etc-resolv-conf-point-at-127-0-0-53

              Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
              When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
              Upvote 👍 helpful posts!

              R 1 Reply Last reply Reply Quote 1
              • R
                RickyBaker @SteveITS
                last edited by

                @SteveITS said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                Your screenshot shows Ubuntu, that’s the local DNS resolver.

                does this mean that my plex server isn't using pfsense for dns resolving?

                @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                well lets see what happens with logging of servfail detaills.

                tbc you want me to simply rerun those dig/nslookup sample tests you listed earlier right?
                f8db3a4e-fa6e-4200-ba4f-9e5e40136dee-image.png
                DNSKEY MIssing? also apparently way longer to complete
                ab74b3b0-df98-4d67-8da7-39375242ab28-image.png
                25792f22-3f94-40c9-9491-8c0bf15e4d7c-image.png
                37d0f840-9b6d-458e-8fa1-b6c2d24f5723-image.png
                8695d7a7-c1d5-4af5-8146-bea66547fcd0-image.png
                "unfortunately" i was not experiencing an outage at this time
                26664837-5c86-4da2-b3d1-eb0d26b99468-image.png
                9fc63e2c-a4c1-4b7e-8853-744550f6f165-image.png
                68f7b089-32df-4c10-9df5-72808f326f91-image.png

                johnpozJ 2 Replies Last reply Reply Quote 0
                • johnpozJ
                  johnpoz LAYER 8 Global Moderator @RickyBaker
                  last edited by

                  @RickyBaker there you go - some actual useful info

                  2024-05-12_105937.jpg

                  So your having some sort of issue with dnssec.. I would expect that to fail with that query - that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

                  So now when normal queries fail we might get to the bottom of why your getting servfail vs an answer to what you ask for.

                  An intelligent man is sometimes forced to be drunk to spend time with his fools
                  If you get confused: Listen to the Music Play
                  Please don't Chat/PM me for help, unless mod related
                  SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                  GertjanG R 2 Replies Last reply Reply Quote 0
                  • johnpozJ
                    johnpoz LAYER 8 Global Moderator @RickyBaker
                    last edited by

                    @RickyBaker said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                    does this mean that my plex server isn't using pfsense for dns resolving?

                    No what it means is its asking the local cache at 127.0.0.53, your command shows that points to 10.10.10.1

                    Clearly went over this already like 6 days ago...

                    An intelligent man is sometimes forced to be drunk to spend time with his fools
                    If you get confused: Listen to the Music Play
                    Please don't Chat/PM me for help, unless mod related
                    SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                    1 Reply Last reply Reply Quote 1
                    • GertjanG
                      Gertjan @johnpoz
                      last edited by

                      @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                      So your having some sort of issue with dnssec.

                      When looking up dnssec-failed.org, what would you expect ?

                      https://www.internetsociety.org/resources/deploy360/2013/dnssec-test-sites/

                      https://dnsviz.net/d/www.dnssec-failed.org/dnssec/

                      No "help me" PM's please. Use the forum, the community will thank you.
                      Edit : and where are the logs ??

                      johnpozJ S 2 Replies Last reply Reply Quote 1
                      • johnpozJ
                        johnpoz LAYER 8 Global Moderator @Gertjan
                        last edited by

                        @Gertjan exactly - like I said ;)

                        An intelligent man is sometimes forced to be drunk to spend time with his fools
                        If you get confused: Listen to the Music Play
                        Please don't Chat/PM me for help, unless mod related
                        SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                        1 Reply Last reply Reply Quote 0
                        • R
                          RickyBaker @johnpoz
                          last edited by

                          First, I would like to again apologize for my lack of knowledge. I promise I'm not trying to be difficult or annoying. This is all foreign terminology and concepts to me, but I'm trying my best and can't quantify how much I appreciate the time you're taking

                          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                          So your having some sort of issue with dnssec.. I would expect that to fail with that query - that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

                          So now when normal queries fail we might get to the bottom of why your getting servfail vs an answer to what you ask for.

                          What do you mean by a normal query? How is this NOT a normal query? (ducks:)) What's the next step you'd like to see to further clarify?

                          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                          No what it means is its asking the local cache at 127.0.0.53, your command shows that points to 10.10.10.1

                          Clearly went over this already like 6 days ago...

                          ahh that makes sense, sorry I missed that earlier. so does this mean i should be constantly trying new websites i don't ever visit to avoid it falling back to local cache? or is that a fundamental misunderstanding of the steps

                          @Gertjan said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                          When looking up dnssec-failed.org, what would you expect ?

                          thank you for the links. It somehow moved me closer AND farther away from understanding. I have AT&T fiber, why did it attempt a comcast run dnssec fail website. Is going to this website something built into the dig command? Also, correct me if I'm wrong, but I believe y'all had me re-enable DNSSEC just cause it was good practice. I can see how this failing is symptomatic of my greater problems but it's odd to me that whats manifesting itself is something I've been told is really optional and best practice, not required.

                          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                          that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

                          All of this leaves me a little lost as to next steps. I keep going back to this line. I know what fqdn stands for, but this collection of words together just doesn't make sense to me, and I believe it's the key to understanding what I need to do next. as always, thanks for everything and further guidance would be greatly appreciated.

                          johnpozJ 1 Reply Last reply Reply Quote 0
                          • S
                            SteveITS Galactic Empire @Gertjan
                            last edited by

                            dnssec-failed.org

                            Just for reference I see SERVFAIL for it via Google or others.

                            >dig dnssec-failed.org @8.8.8.8
                            
                            ; <<>> DiG 9.16.44 <<>> dnssec-failed.org @8.8.8.8
                            ;; global options: +cmd
                            ;; Got answer:
                            ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 64906
                            ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1
                            
                            ;; OPT PSEUDOSECTION:
                            ; EDNS: version: 0, flags:; udp: 512
                            ; EDE: 9 (DNSKEY Missing): (No DNSKEY matches DS RRs of dnssec-failed.org)
                            ;; QUESTION SECTION:
                            ;dnssec-failed.org.             IN      A
                            
                            ;; Query time: 120 msec
                            ;; SERVER: 8.8.8.8#53(8.8.8.8)
                            ;; WHEN: Mon May 13 10:38:02 Central Daylight Time 2024
                            ;; MSG SIZE  rcvd: 97
                            

                            https://bluecatnetworks.com/blog/the-top-four-dns-response-codes-and-what-they-mean/
                            "a SERVFAIL is the DNS server telling you, “Hey, I can’t give you the answer for that query.”"

                            Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                            When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                            Upvote 👍 helpful posts!

                            johnpozJ 1 Reply Last reply Reply Quote 1
                            • johnpozJ
                              johnpoz LAYER 8 Global Moderator @SteveITS
                              last edited by johnpoz

                              @SteveITS well yeah forwarding and trying to do dnssec is going to be problematic.. But that dnssec-failed.org should always fail.. It meant to fail.. As a way to validate your dnssec is working..

                              So yeah if you query any NS that is doing dnssec, google, quad9, etc.. then it would fail.. But if you query some NS that isn't doing dnssec than it would pass..

                              example

                              ; <<>> DiG 9.16.50 <<>> @8.8.8.8 dnssec-failed.org
                              ; (1 server found)
                              ;; global options: +cmd
                              ;; Got answer:
                              ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 3602
                              ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1
                              
                              ;; OPT PSEUDOSECTION:
                              ; EDNS: version: 0, flags:; udp: 512
                              ; EDE: 9 (DNSKEY Missing): (No DNSKEY matches DS RRs of dnssec-failed.org)
                              ;; QUESTION SECTION:
                              ;dnssec-failed.org.             IN      A
                              
                              ;; Query time: 95 msec
                              ;; SERVER: 8.8.8.8#53(8.8.8.8)
                              ;; WHEN: Mon May 13 10:54:20 Central Daylight Time 2024
                              ;; MSG SIZE  rcvd: 97
                              

                              But if say ask something not doing dnssec..

                              $ dig @4.2.2.2 dnssec-failed.org
                              
                              ; <<>> DiG 9.16.50 <<>> @4.2.2.2 dnssec-failed.org
                              ; (1 server found)
                              ;; global options: +cmd
                              ;; Got answer:
                              ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 39041
                              ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 1
                              
                              ;; OPT PSEUDOSECTION:
                              ; EDNS: version: 0, flags:; udp: 8192
                              ;; QUESTION SECTION:
                              ;dnssec-failed.org.             IN      A
                              
                              ;; ANSWER SECTION:
                              dnssec-failed.org.      300     IN      A       96.99.227.255
                              
                              ;; Query time: 52 msec
                              ;; SERVER: 4.2.2.2#53(4.2.2.2)
                              ;; WHEN: Mon May 13 10:55:08 Central Daylight Time 2024
                              ;; MSG SIZE  rcvd: 62
                              

                              This is another example where it makes no sense to check to use dnssec if your forwarding.. Either where you forward is doing dnssec already. Most of the major players do, some have some different IPs you can query that don't.. But pretty much all of them do dnssec. If where you forward does not do dnssec, asking for it in unbound settings isn't going to do anything other then more than likely cause failures..

                              An intelligent man is sometimes forced to be drunk to spend time with his fools
                              If you get confused: Listen to the Music Play
                              Please don't Chat/PM me for help, unless mod related
                              SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                              1 Reply Last reply Reply Quote 0
                              • johnpozJ
                                johnpoz LAYER 8 Global Moderator @RickyBaker
                                last edited by

                                @RickyBaker said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                All of this leaves me a little lost as to next steps.

                                The next step is to wait till you fail again.. You were seeing servfail - but we didn't know why or what was the reason for it. Now that you have enabled logging of servfail details.. Next time you have a problem - we can hope to see why.. And then address that..

                                Also have you updated to 2.7.2 yet? This should be your next step to be honest..

                                An intelligent man is sometimes forced to be drunk to spend time with his fools
                                If you get confused: Listen to the Music Play
                                Please don't Chat/PM me for help, unless mod related
                                SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                                R 2 Replies Last reply Reply Quote 0
                                • R
                                  RickyBaker @johnpoz
                                  last edited by

                                  @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                  Also have you updated to 2.7.2 yet? This should be your next step to be honest..

                                  no i have not but I can prioritize. i know it SHOULD be easy and smooth but i'm so nervous. especially with it not updating by itself.

                                  @SteveITS said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                  https://bluecatnetworks.com/blog/the-top-four-dns-response-codes-and-what-they-mean/

                                  thanks this is a very useful article

                                  1 Reply Last reply Reply Quote 0
                                  • R
                                    RickyBaker @johnpoz
                                    last edited by

                                    @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                    The next step is to wait till you fail again..

                                    and what specific commands should i be running? I assume you don't need the resolvectl one, just "dig www.netgate.com" or www.msn.com?

                                    johnpozJ 1 Reply Last reply Reply Quote 0
                                    • johnpozJ
                                      johnpoz LAYER 8 Global Moderator @RickyBaker
                                      last edited by

                                      @RickyBaker yeah and looking in the log.. So we can see what it logs for failure if the dig output doesn't show us as much detail on it, etc.

                                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                                      If you get confused: Listen to the Music Play
                                      Please don't Chat/PM me for help, unless mod related
                                      SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                                      R 2 Replies Last reply Reply Quote 0
                                      • R
                                        RickyBaker @johnpoz
                                        last edited by

                                        @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                        yeah and looking in the log.. So we can see what it logs for failure if the dig output doesn't show us as much detail on it, etc.

                                        great thanks for the clarification

                                        R 1 Reply Last reply Reply Quote 0
                                        • R
                                          RickyBaker @RickyBaker
                                          last edited by

                                          @RickyBaker spent all afternoon waiting with my computer for an outage, finally went to bed around 1130pm. Happened right away and resolved by the time i sprinted downstairs. stay tuned

                                          R 1 Reply Last reply Reply Quote 0
                                          • R
                                            RickyBaker @RickyBaker
                                            last edited by

                                            been hunting non-stop but the network has "unfortunately" been very stable this week. This morning my wife said she was experienceng the DNS NX issue on her phone right when we woke up but when i fired up my phone I wasn't experiencing the problem. Went about my morning and a few minutes later, while on the head without a laptop, it happened to me. I fired up my ssh app and ssh'ed into the plex server and got this:
                                            4ae8f116-5d79-4326-88ba-7db61ae9dde9-image.png

                                            ...but then i realized i was doing it on another machine that may not be experiencing the problem. I don't know why I didn't put that together before but the DNS issue USUALLY affects all devices at once but obviously not always. unfortunately my phone's local ssh session doesn't have the dig command. I'll look at installing it to increase my chance of catching it. Unfortunately I forgot that i only have a few minutes to screenshot the logs before they roll off and I missed it. I'm optimistic i'll catch it this weekend.

                                            R 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.