Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times

    Scheduled Pinned Locked Moved DHCP and DNS
    176 Posts 6 Posters 21.6k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • R
      RickyBaker @SteveITS
      last edited by

      @SteveITS said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

      Your screenshot shows Ubuntu, that’s the local DNS resolver.

      does this mean that my plex server isn't using pfsense for dns resolving?

      @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

      well lets see what happens with logging of servfail detaills.

      tbc you want me to simply rerun those dig/nslookup sample tests you listed earlier right?
      f8db3a4e-fa6e-4200-ba4f-9e5e40136dee-image.png
      DNSKEY MIssing? also apparently way longer to complete
      ab74b3b0-df98-4d67-8da7-39375242ab28-image.png
      25792f22-3f94-40c9-9491-8c0bf15e4d7c-image.png
      37d0f840-9b6d-458e-8fa1-b6c2d24f5723-image.png
      8695d7a7-c1d5-4af5-8146-bea66547fcd0-image.png
      "unfortunately" i was not experiencing an outage at this time
      26664837-5c86-4da2-b3d1-eb0d26b99468-image.png
      9fc63e2c-a4c1-4b7e-8853-744550f6f165-image.png
      68f7b089-32df-4c10-9df5-72808f326f91-image.png

      johnpozJ 2 Replies Last reply Reply Quote 0
      • johnpozJ
        johnpoz LAYER 8 Global Moderator @RickyBaker
        last edited by

        @RickyBaker there you go - some actual useful info

        2024-05-12_105937.jpg

        So your having some sort of issue with dnssec.. I would expect that to fail with that query - that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

        So now when normal queries fail we might get to the bottom of why your getting servfail vs an answer to what you ask for.

        An intelligent man is sometimes forced to be drunk to spend time with his fools
        If you get confused: Listen to the Music Play
        Please don't Chat/PM me for help, unless mod related
        SG-4860 24.11 | Lab VMs 2.7.2, 24.11

        GertjanG R 2 Replies Last reply Reply Quote 0
        • johnpozJ
          johnpoz LAYER 8 Global Moderator @RickyBaker
          last edited by

          @RickyBaker said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

          does this mean that my plex server isn't using pfsense for dns resolving?

          No what it means is its asking the local cache at 127.0.0.53, your command shows that points to 10.10.10.1

          Clearly went over this already like 6 days ago...

          An intelligent man is sometimes forced to be drunk to spend time with his fools
          If you get confused: Listen to the Music Play
          Please don't Chat/PM me for help, unless mod related
          SG-4860 24.11 | Lab VMs 2.7.2, 24.11

          1 Reply Last reply Reply Quote 1
          • GertjanG
            Gertjan @johnpoz
            last edited by

            @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

            So your having some sort of issue with dnssec.

            When looking up dnssec-failed.org, what would you expect ?

            https://www.internetsociety.org/resources/deploy360/2013/dnssec-test-sites/

            https://dnsviz.net/d/www.dnssec-failed.org/dnssec/

            No "help me" PM's please. Use the forum, the community will thank you.
            Edit : and where are the logs ??

            johnpozJ S 2 Replies Last reply Reply Quote 1
            • johnpozJ
              johnpoz LAYER 8 Global Moderator @Gertjan
              last edited by

              @Gertjan exactly - like I said ;)

              An intelligent man is sometimes forced to be drunk to spend time with his fools
              If you get confused: Listen to the Music Play
              Please don't Chat/PM me for help, unless mod related
              SG-4860 24.11 | Lab VMs 2.7.2, 24.11

              1 Reply Last reply Reply Quote 0
              • R
                RickyBaker @johnpoz
                last edited by

                First, I would like to again apologize for my lack of knowledge. I promise I'm not trying to be difficult or annoying. This is all foreign terminology and concepts to me, but I'm trying my best and can't quantify how much I appreciate the time you're taking

                @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                So your having some sort of issue with dnssec.. I would expect that to fail with that query - that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

                So now when normal queries fail we might get to the bottom of why your getting servfail vs an answer to what you ask for.

                What do you mean by a normal query? How is this NOT a normal query? (ducks:)) What's the next step you'd like to see to further clarify?

                @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                No what it means is its asking the local cache at 127.0.0.53, your command shows that points to 10.10.10.1

                Clearly went over this already like 6 days ago...

                ahh that makes sense, sorry I missed that earlier. so does this mean i should be constantly trying new websites i don't ever visit to avoid it falling back to local cache? or is that a fundamental misunderstanding of the steps

                @Gertjan said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                When looking up dnssec-failed.org, what would you expect ?

                thank you for the links. It somehow moved me closer AND farther away from understanding. I have AT&T fiber, why did it attempt a comcast run dnssec fail website. Is going to this website something built into the dig command? Also, correct me if I'm wrong, but I believe y'all had me re-enable DNSSEC just cause it was good practice. I can see how this failing is symptomatic of my greater problems but it's odd to me that whats manifesting itself is something I've been told is really optional and best practice, not required.

                @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                that fqdn is test fqdn for making sure dnssec is working.. But we are seeing the servfail reason..

                All of this leaves me a little lost as to next steps. I keep going back to this line. I know what fqdn stands for, but this collection of words together just doesn't make sense to me, and I believe it's the key to understanding what I need to do next. as always, thanks for everything and further guidance would be greatly appreciated.

                johnpozJ 1 Reply Last reply Reply Quote 0
                • S
                  SteveITS Galactic Empire @Gertjan
                  last edited by

                  dnssec-failed.org

                  Just for reference I see SERVFAIL for it via Google or others.

                  >dig dnssec-failed.org @8.8.8.8
                  
                  ; <<>> DiG 9.16.44 <<>> dnssec-failed.org @8.8.8.8
                  ;; global options: +cmd
                  ;; Got answer:
                  ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 64906
                  ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1
                  
                  ;; OPT PSEUDOSECTION:
                  ; EDNS: version: 0, flags:; udp: 512
                  ; EDE: 9 (DNSKEY Missing): (No DNSKEY matches DS RRs of dnssec-failed.org)
                  ;; QUESTION SECTION:
                  ;dnssec-failed.org.             IN      A
                  
                  ;; Query time: 120 msec
                  ;; SERVER: 8.8.8.8#53(8.8.8.8)
                  ;; WHEN: Mon May 13 10:38:02 Central Daylight Time 2024
                  ;; MSG SIZE  rcvd: 97
                  

                  https://bluecatnetworks.com/blog/the-top-four-dns-response-codes-and-what-they-mean/
                  "a SERVFAIL is the DNS server telling you, “Hey, I can’t give you the answer for that query.”"

                  Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                  When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                  Upvote 👍 helpful posts!

                  johnpozJ 1 Reply Last reply Reply Quote 1
                  • johnpozJ
                    johnpoz LAYER 8 Global Moderator @SteveITS
                    last edited by johnpoz

                    @SteveITS well yeah forwarding and trying to do dnssec is going to be problematic.. But that dnssec-failed.org should always fail.. It meant to fail.. As a way to validate your dnssec is working..

                    So yeah if you query any NS that is doing dnssec, google, quad9, etc.. then it would fail.. But if you query some NS that isn't doing dnssec than it would pass..

                    example

                    ; <<>> DiG 9.16.50 <<>> @8.8.8.8 dnssec-failed.org
                    ; (1 server found)
                    ;; global options: +cmd
                    ;; Got answer:
                    ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 3602
                    ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1
                    
                    ;; OPT PSEUDOSECTION:
                    ; EDNS: version: 0, flags:; udp: 512
                    ; EDE: 9 (DNSKEY Missing): (No DNSKEY matches DS RRs of dnssec-failed.org)
                    ;; QUESTION SECTION:
                    ;dnssec-failed.org.             IN      A
                    
                    ;; Query time: 95 msec
                    ;; SERVER: 8.8.8.8#53(8.8.8.8)
                    ;; WHEN: Mon May 13 10:54:20 Central Daylight Time 2024
                    ;; MSG SIZE  rcvd: 97
                    

                    But if say ask something not doing dnssec..

                    $ dig @4.2.2.2 dnssec-failed.org
                    
                    ; <<>> DiG 9.16.50 <<>> @4.2.2.2 dnssec-failed.org
                    ; (1 server found)
                    ;; global options: +cmd
                    ;; Got answer:
                    ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 39041
                    ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 1
                    
                    ;; OPT PSEUDOSECTION:
                    ; EDNS: version: 0, flags:; udp: 8192
                    ;; QUESTION SECTION:
                    ;dnssec-failed.org.             IN      A
                    
                    ;; ANSWER SECTION:
                    dnssec-failed.org.      300     IN      A       96.99.227.255
                    
                    ;; Query time: 52 msec
                    ;; SERVER: 4.2.2.2#53(4.2.2.2)
                    ;; WHEN: Mon May 13 10:55:08 Central Daylight Time 2024
                    ;; MSG SIZE  rcvd: 62
                    

                    This is another example where it makes no sense to check to use dnssec if your forwarding.. Either where you forward is doing dnssec already. Most of the major players do, some have some different IPs you can query that don't.. But pretty much all of them do dnssec. If where you forward does not do dnssec, asking for it in unbound settings isn't going to do anything other then more than likely cause failures..

                    An intelligent man is sometimes forced to be drunk to spend time with his fools
                    If you get confused: Listen to the Music Play
                    Please don't Chat/PM me for help, unless mod related
                    SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                    1 Reply Last reply Reply Quote 0
                    • johnpozJ
                      johnpoz LAYER 8 Global Moderator @RickyBaker
                      last edited by

                      @RickyBaker said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                      All of this leaves me a little lost as to next steps.

                      The next step is to wait till you fail again.. You were seeing servfail - but we didn't know why or what was the reason for it. Now that you have enabled logging of servfail details.. Next time you have a problem - we can hope to see why.. And then address that..

                      Also have you updated to 2.7.2 yet? This should be your next step to be honest..

                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                      If you get confused: Listen to the Music Play
                      Please don't Chat/PM me for help, unless mod related
                      SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                      R 2 Replies Last reply Reply Quote 0
                      • R
                        RickyBaker @johnpoz
                        last edited by

                        @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                        Also have you updated to 2.7.2 yet? This should be your next step to be honest..

                        no i have not but I can prioritize. i know it SHOULD be easy and smooth but i'm so nervous. especially with it not updating by itself.

                        @SteveITS said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                        https://bluecatnetworks.com/blog/the-top-four-dns-response-codes-and-what-they-mean/

                        thanks this is a very useful article

                        1 Reply Last reply Reply Quote 0
                        • R
                          RickyBaker @johnpoz
                          last edited by

                          @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                          The next step is to wait till you fail again..

                          and what specific commands should i be running? I assume you don't need the resolvectl one, just "dig www.netgate.com" or www.msn.com?

                          johnpozJ 1 Reply Last reply Reply Quote 0
                          • johnpozJ
                            johnpoz LAYER 8 Global Moderator @RickyBaker
                            last edited by

                            @RickyBaker yeah and looking in the log.. So we can see what it logs for failure if the dig output doesn't show us as much detail on it, etc.

                            An intelligent man is sometimes forced to be drunk to spend time with his fools
                            If you get confused: Listen to the Music Play
                            Please don't Chat/PM me for help, unless mod related
                            SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                            R 2 Replies Last reply Reply Quote 0
                            • R
                              RickyBaker @johnpoz
                              last edited by

                              @johnpoz said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                              yeah and looking in the log.. So we can see what it logs for failure if the dig output doesn't show us as much detail on it, etc.

                              great thanks for the clarification

                              R 1 Reply Last reply Reply Quote 0
                              • R
                                RickyBaker @RickyBaker
                                last edited by

                                @RickyBaker spent all afternoon waiting with my computer for an outage, finally went to bed around 1130pm. Happened right away and resolved by the time i sprinted downstairs. stay tuned

                                R 1 Reply Last reply Reply Quote 0
                                • R
                                  RickyBaker @RickyBaker
                                  last edited by

                                  been hunting non-stop but the network has "unfortunately" been very stable this week. This morning my wife said she was experienceng the DNS NX issue on her phone right when we woke up but when i fired up my phone I wasn't experiencing the problem. Went about my morning and a few minutes later, while on the head without a laptop, it happened to me. I fired up my ssh app and ssh'ed into the plex server and got this:
                                  4ae8f116-5d79-4326-88ba-7db61ae9dde9-image.png

                                  ...but then i realized i was doing it on another machine that may not be experiencing the problem. I don't know why I didn't put that together before but the DNS issue USUALLY affects all devices at once but obviously not always. unfortunately my phone's local ssh session doesn't have the dig command. I'll look at installing it to increase my chance of catching it. Unfortunately I forgot that i only have a few minutes to screenshot the logs before they roll off and I missed it. I'm optimistic i'll catch it this weekend.

                                  R 1 Reply Last reply Reply Quote 0
                                  • R
                                    RickyBaker @RickyBaker
                                    last edited by

                                    @RickyBaker Still hunting, frustratingly the problem has def gotten less frequent an shorter in duration (but still ever present, my wife agrees, i'm not crazy). It's also happening more on individual devices where other devices work fine more often than before. It happened on my PC and when I ran the dig command on my plex debian box it was fine
                                    35945eaa-e74d-4f92-a6fe-97ab9e08ff50-image.png
                                    In the log though I did find this around when I tried the dig command:
                                    64ca1ece-7294-49d1-9de5-c7c5e52f0999-image.png
                                    I also found this which looks shady to me:
                                    2cc00f36-7728-4eb0-b544-b9ecb6cae6c1-image.png

                                    Since it seems to be singular devices at a time now i'm slowing figuring out how to run dig commands on all the different OS's in my house. I have Android and linux and am following a tutorial for Windows now...

                                    R 2 Replies Last reply Reply Quote 0
                                    • R
                                      RickyBaker @RickyBaker
                                      last edited by

                                      @RickyBaker said in DNS_PROBE_FINISHED_NXDOMAIN sporadically for anywhere from 30secs to 10min. works flawlessly at all other times:

                                      following a tutorial for Windows now...
                                      So i was trying to follow this tutorial but when I went to install it said it was already installed (I used the legacy windows version). I had tried to install it before but then typing the dig command still returned a "command not found". The installer suggested i remove the old one from add/remove programs but I couldn't find anything under BIND or ISC and the last installed program was discord a LONG time ago.
                                      d2c25974-fd05-49ac-8c2d-cdc30f4ad623-image.png

                                      I tried to continue with the tutorial but it asked where BIND was installed to add it to the PATH (which I'm sure was my problem the first time around) but I don't know where it's installed and a windows search for BIND or ISC is expectedly noisy. any suggestions?

                                      I'll keep plugging at it but it's an annoying speed bump that's really slowing down the troublshooting...

                                      1 Reply Last reply Reply Quote 0
                                      • R
                                        RickyBaker @RickyBaker
                                        last edited by

                                        Finally got one!!!
                                        a11f2b0d-4e85-4142-a359-69d7e0f7f15c-image.png
                                        I pasted everything in the log back a few minutes here in case the totality of it is useful

                                        https://pastebin.com/w2SGh8P0

                                        d69e124e-ae98-4210-ac36-e018efc38964-image.png

                                        @johnpoz Sorry for the delay in getting this I swear i was trying the whole time. thanks for the patience.

                                        1 Reply Last reply Reply Quote 0
                                        • S SteveITS referenced this topic on
                                        • T The Party of Hell No referenced this topic on
                                        • R
                                          RickyBaker @johnpoz
                                          last edited by

                                          @johnpoz got another one! though it does seem to be happening with a lot less frequency for some reason, i've just gotten better at catching them during the quick window of opportunity:

                                          https://pastebin.com/WjyAukgm

                                          a961d400-b54b-4b2b-98bf-e2b48d4f5c1c-image.png

                                          4331c59c-e8ce-4017-b1c1-18b9bce53af8-image.png

                                          1 Reply Last reply Reply Quote 0
                                          • R
                                            RickyBaker
                                            last edited by

                                            This one got a NXDOMAIN error:

                                            b0fd4629-6122-4e55-9564-a02f4bcc6cd1-image.png

                                            cb230e77-8fdb-4c0b-ab79-9e21fd940514-image.png

                                            S GertjanG 2 Replies Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.