• Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login
Netgate Discussion Forum
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login

Major DNS Bug 23.01 with Quad9 on SSL

General pfSense Questions
27
185
149.4k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • N
    N0m0fud @Gertjan
    last edited by May 12, 2023, 1:51 PM

    @gertjan Windows 11 after a certain version supports DOT and DOH

    1 Reply Last reply Reply Quote 0
    • J
      JonH @stephenw10
      last edited by May 12, 2023, 9:19 PM

      @stephenw10 The long waits to resolve have plagued me since upgrade to 23.01-Release with python mode & TLS. For the past week+ I've been using unbound/53 with no problems. I updated unbound as soon as I saw Chris's post. For past 2 days I've been back on python mode/853 and it's working well for me. Currently using localhost w/ fallback to dot1 & quad9. Hope this was the 'fix'.

      1 Reply Last reply Reply Quote 1
      • R
        RobbieTT @stephenw10
        last edited by RobbieTT May 14, 2023, 11:21 AM May 13, 2023, 6:38 PM

        @stephenw10 said in Major DNS Bug 23.01 with Quad9 on SSL:

        I would love to see anyone who was hitting this issue repeatedly confirm the ASLR workaround here.

        I don't know the syntax to reverse the ASLR command - anyone?

        I did a crude but repeatable test - hammered a load of name servers, including my pfSense resolver which is pointing at Quad9 using DoT:

        Before the ASLR hack:

        login-to-view

        After the ASLR hack:

        login-to-view

        • Uncached minimums down from 34ms to 9ms
        • Uncached maximums down from 663ms to 392ms
        • Uncached average down from 103ms to 67ms
        • Uncached SD down from 159ms to 90ms

        What's not to like?

        β˜•οΈ

        [NB capturing the random 'pauses' and 'fail to loads' suffered (as described earlier) is much harder to represent]

        J 1 Reply Last reply May 13, 2023, 6:46 PM Reply Quote 0
        • J
          jimp Rebel Alliance Developer Netgate @RobbieTT
          last edited by May 13, 2023, 6:46 PM

          @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

          @stephenw10 said in Major DNS Bug 23.01 with Quad9 on SSL:

          I would love to see anyone who was hitting this issue repeatedly confirm the ASLR workaround here.

          I don't know the syntax to reverse the ASLR command - anyone?

          # elfctl /usr/local/sbin/unbound
          File '/usr/local/sbin/unbound' features:
          noaslr          'Disable ASLR' is unset.
          [...]
          # killall -9 unbound
          # elfctl -e +noaslr /usr/local/sbin/unbound
          # elfctl /usr/local/sbin/unbound
          File '/usr/local/sbin/unbound' features:
          noaslr          'Disable ASLR' is set.
          [...]
          # elfctl -e -noaslr /usr/local/sbin/unbound
          # elfctl /usr/local/sbin/unbound
          File '/usr/local/sbin/unbound' features:
          noaslr          'Disable ASLR' is unset.
          [...]
          

          Remember: Upvote with the πŸ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

          Need help fast? Netgate Global Support!

          Do not Chat/PM for help!

          R 1 Reply Last reply May 14, 2023, 7:57 AM Reply Quote 2
          • R
            RobbieTT @jimp
            last edited by May 14, 2023, 7:57 AM

            @jimp
            Thanks Jim πŸ‘

            1 Reply Last reply Reply Quote 0
            • R
              RobbieTT @stephenw10
              last edited by RobbieTT May 14, 2023, 12:35 PM May 14, 2023, 10:05 AM

              @stephenw10

              I should probably add that even with the ASLR unset I still get weird looking results when I attempt an individual DNS Lookup on a domain name that I know hasn't been cached:

              login-to-view

              If I understand the pfSense diagnostics screen, when the internal DNS resolver has to use forwarding to answer a query I would expect a similar time to answer the query as the fastest responding name server (2629:fe::fe at 7ms in this example) plus the almost negligible processing delay from checking the cache. Yet it actually takes a snooze-worthy 168ms.

              Why does the DNS resolver take 168ms for a simple forwarded (uncached) query when the forwarder itself has an answer from an upstream provider in just 7ms or, in other words, around 24 times slower than expected?

              β˜•οΈ

              M 1 Reply Last reply May 14, 2023, 5:22 PM Reply Quote 0
              • S SteveITS referenced this topic on May 14, 2023, 1:44 PM
              • M
                MoonKnight @RobbieTT
                last edited by MoonKnight May 14, 2023, 5:28 PM May 14, 2023, 5:22 PM

                @robbiett

                Have been wondering about the same for some time now. It doesn't make sense

                login-to-view

                And if you do the same lookup just seconds after the first time "The query time" is on 0.
                Wait 1 minute then back to 60 msec.

                I have been having this behavior since 23.01 and maybe on 22.05 also .

                --- 24.11 ---
                Intel(R) Xeon(R) CPU D-1518 @ 2.20GHz
                Kingston DDR4 2666MHz 16GB ECC
                2 x HyperX Fury SSD 120GB (ZFS-mirror)
                2 x Intel i210 (ports)
                4 x Intel i350 (ports)

                R J 2 Replies Last reply May 14, 2023, 5:37 PM Reply Quote 0
                • R
                  RobbieTT @MoonKnight
                  last edited by May 14, 2023, 5:37 PM

                  @moonknight said in Major DNS Bug 23.01 with Quad9 on SSL:

                  @robbiett
                  And if you do the same lookup just seconds after first time "The query time" is on 0.
                  Wait 1 minute then back to 60 msec.

                  I don't suffer the second part of your observation. Once my query is cached it stays cached until it is removed or reset - it obeys the settings I have given it.

                  If you stop the resolver for a moment and run the command:

                  unbound-control -c /var/unbound/unbound.conf dump_cache

                  ...you can poke around and see what is in your cache.

                  β˜•οΈ

                  M 1 Reply Last reply May 14, 2023, 5:45 PM Reply Quote 1
                  • M
                    MoonKnight @RobbieTT
                    last edited by May 14, 2023, 5:45 PM

                    @robbiett
                    Thanks for the command, I'm going to test I later.
                    But I did enable "Serve Expired" and now the lookup stays on 0 msec on 2nd lookup of the same domain.

                    login-to-view

                    --- 24.11 ---
                    Intel(R) Xeon(R) CPU D-1518 @ 2.20GHz
                    Kingston DDR4 2666MHz 16GB ECC
                    2 x HyperX Fury SSD 120GB (ZFS-mirror)
                    2 x Intel i210 (ports)
                    4 x Intel i350 (ports)

                    1 Reply Last reply Reply Quote 0
                    • J
                      johnpoz LAYER 8 Global Moderator @MoonKnight
                      last edited by May 14, 2023, 5:55 PM

                      @moonknight problem with cnn.com is they have the TTL set to 60 seconds..

                      ;; QUESTION SECTION:
                      ;cnn.com.                       IN      A
                      
                      ;; ANSWER SECTION:
                      cnn.com.                60      IN      A       151.101.67.5
                      cnn.com.                60      IN      A       151.101.195.5
                      cnn.com.                60      IN      A       151.101.131.5
                      cnn.com.                60      IN      A       151.101.3.5
                      

                      So if you forward to somewhere the ttl you can cache is going to be something shorter then 60 seconds, could be 59, could be 2..

                      There is no sane reason for them to have the ttl set so freaking low - other than they want lots of queries.. They charge their customers maybe by queries - that is hosted on aws dns..

                      ;; AUTHORITY SECTION:
                      cnn.com.                3600    IN      NS      ns-1086.awsdns-07.org.
                      cnn.com.                3600    IN      NS      ns-1630.awsdns-11.co.uk.
                      cnn.com.                3600    IN      NS      ns-47.awsdns-05.com.
                      cnn.com.                3600    IN      NS      ns-576.awsdns-08.net.
                      

                      So what you can do on your side is yeah allow for serving expired, and you could also set your min ttl.. I do both, have min ttl of 3600, and serve expired..

                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                      If you get confused: Listen to the Music Play
                      Please don't Chat/PM me for help, unless mod related
                      SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                      M 1 Reply Last reply May 14, 2023, 6:01 PM Reply Quote 2
                      • M
                        MoonKnight @johnpoz
                        last edited by MoonKnight May 14, 2023, 6:29 PM May 14, 2023, 6:01 PM

                        @johnpoz

                        Thanks for the information :)
                        I set "Minimum TTL for RRsets and Messages" to 3600 and seems to work :)

                        --- 24.11 ---
                        Intel(R) Xeon(R) CPU D-1518 @ 2.20GHz
                        Kingston DDR4 2666MHz 16GB ECC
                        2 x HyperX Fury SSD 120GB (ZFS-mirror)
                        2 x Intel i210 (ports)
                        4 x Intel i350 (ports)

                        R 1 Reply Last reply May 14, 2023, 6:28 PM Reply Quote 0
                        • R
                          RobbieTT @MoonKnight
                          last edited by May 14, 2023, 6:28 PM

                          @moonknight
                          Yep, that is the one. I have mine set at 2400 for reasons I read in a technical paper that I have long since forgotten.

                          M S 2 Replies Last reply May 14, 2023, 6:29 PM Reply Quote 1
                          • M
                            MoonKnight @RobbieTT
                            last edited by May 14, 2023, 6:29 PM

                            Thank you very much @robbiett :)

                            --- 24.11 ---
                            Intel(R) Xeon(R) CPU D-1518 @ 2.20GHz
                            Kingston DDR4 2666MHz 16GB ECC
                            2 x HyperX Fury SSD 120GB (ZFS-mirror)
                            2 x Intel i210 (ports)
                            4 x Intel i350 (ports)

                            1 Reply Last reply Reply Quote 0
                            • S
                              SteveITS Galactic Empire @RobbieTT
                              last edited by May 14, 2023, 6:32 PM

                              @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                              for reasons I read in a technical paper that I have long since forgotten

                              The hallmark of so many IT decisions carried forward decades into the future. πŸ˜†

                              Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                              When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                              Upvote πŸ‘ helpful posts!

                              R 1 Reply Last reply May 14, 2023, 7:19 PM Reply Quote 1
                              • R
                                RobbieTT @SteveITS
                                last edited by RobbieTT May 14, 2023, 7:28 PM May 14, 2023, 7:19 PM

                                @steveits said in Major DNS Bug 23.01 with Quad9 on SSL:

                                The hallmark of so many IT decisions carried forward decades into the future. πŸ˜†

                                Yeah, I resemble those remarks - especially as I am so many decades in. Started with a BBC Micro only to see ARM come around again. Next came the Apple IIe and off I went down that path, only to find myself down the line with BSD again.

                                I have forgotten so much along the way!

                                β˜•οΈ

                                1 Reply Last reply Reply Quote 0
                                • W
                                  w0w
                                  last edited by w0w May 15, 2023, 3:39 AM May 15, 2023, 3:37 AM

                                  I've been playing with these two settings for a few months now.
                                  "Minimum TTL for RRsets and Messages"
                                  "Serve Expired"
                                  They have no effect on this issue.
                                  Perhaps however there is a relationship with the use of IPv6. I previously said that I had no more problems on 23.05, however at that moment I did not check that IPv6 worked for me. Due to another bug, IPv6 did not work for me and maybe that's why there was no issue. The other day I restored IPv6 and without ASLR the same issues began as before. Now testing with ASLR disabled….

                                  R 1 Reply Last reply May 15, 2023, 10:37 AM Reply Quote 0
                                  • R
                                    RobbieTT @w0w
                                    last edited by May 15, 2023, 10:37 AM

                                    @w0w said in Major DNS Bug 23.01 with Quad9 on SSL:

                                    "Minimum TTL for RRsets and Messages"
                                    "Serve Expired"
                                    They have no effect on this issue.

                                    I don't think any of us were suggesting that. Of course, if you can get an answer from the DNS cache it does dodge the issue for that particular query and those 2 settings do improve cache performance and make a hit more likely.

                                    Regarding your thoughts on IPv6, that could indeed be part of the issue but unproven. I run IPv6 and most of my traffic tends to use it. I have not tried disabling it as part of the DNS diagnostics but happy to do so if NetGate thinks there is merit in it.

                                    β˜•οΈ

                                    G W 2 Replies Last reply May 15, 2023, 12:13 PM Reply Quote 0
                                    • G
                                      Gertjan @RobbieTT
                                      last edited by May 15, 2023, 12:13 PM

                                      @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                                      Regarding your thoughts on IPv6, that could indeed be part of the issue but unproven. I run IPv6 and most of my traffic tends to use it. I have not tried disabling it as part of the DNS diagnostics but happy to do so if NetGate thinks there is merit in it.

                                      Thing thoughts on my side.
                                      If a ISP delivers a broken IPv4, they wouldn't exist.
                                      If their IPv4 works fine, and they offer a broken IPv6, then thinks become complicated.

                                      Best choice : forget about pfSense, use the ISP router.
                                      IPv6 might be ok then, but "with Quad9 on SSL " will be a no no. Exception : set it up for every LAN device manually.

                                      Or use pfSense, and disable IPv6 on the WAN side.
                                      It's as easy as :

                                      login-to-view

                                      Your LAN devices will still use IPv6 among them, that's ok.

                                      Btw : when this thread started, I disables my perfectly working unbound resolve mode, to activate "Quad9 on SSL using IPv6 and IPv4".
                                      I apply the ASLR (see above) trick.
                                      Up until today (3 weeks now ?) : I did not see any issue what so ever.
                                      I've about 50 LAN devices connected, and a bunch of hotel clients on the captive portal.
                                      No one ever yelled : "Internet doesn't work".

                                      No "help me" PM's please. Use the forum, the community will thank you.
                                      Edit : and where are the logs ??

                                      R 1 Reply Last reply May 15, 2023, 1:26 PM Reply Quote 0
                                      • R
                                        RobbieTT @Gertjan
                                        last edited by May 15, 2023, 1:26 PM

                                        @gertjan said in Major DNS Bug 23.01 with Quad9 on SSL:

                                        Thing thoughts on my side.
                                        If a ISP delivers a broken IPv4, they wouldn't exist.
                                        If their IPv4 works fine, and they offer a broken IPv6, then thinks become complicated.

                                        Best choice : forget about pfSense, use the ISP router.
                                        IPv6 might be ok then, but "with Quad9 on SSL " will be a no no. Exception : set it up for every LAN device manually.

                                        Or use pfSense, and disable IPv6 on the WAN side.

                                        I would have a few issues with that:

                                        • My ISP does not provide a router; even if they did it would not have the CPU power, interfaces and features that I, along with many others, require
                                        • Forgetting about pfSense would be an odd thing to do with a NetGate router
                                        • I have no issue with IPv6 and it is provided universally to UK broadband users - I am also dependant on it
                                        • No link to IPv6 has been proven and NetGate has not offered it as a concern or potential cause of the forwarded DNS-over-TLS issues

                                        β˜•οΈ

                                        G 1 Reply Last reply May 15, 2023, 1:57 PM Reply Quote 0
                                        • G
                                          Gertjan @RobbieTT
                                          last edited by Gertjan May 15, 2023, 1:58 PM May 15, 2023, 1:57 PM

                                          @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                                          My ISP does not provide a router;

                                          That's actually a good thing. No need to to support one neither. They have to concentrate on the "IPv4 RFC"and the same way on "RFC IPv6" and you'll be a happy user.

                                          @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                                          Forgetting about pfSense would be an odd thing to do with a NetGate router

                                          Humm. Didn't thinks about that one, πŸ‘ for you
                                          Anyway : I can't remove my ISP router as nothing else exists to terminate their fiber cable. This means that this ISP router doesn't only have to hand out GUA's for its LAN devices, as pfSense is just another LAN device, but it should also hand over entire /64 prefixes to pfSense, and that's already much more an issue these days.

                                          @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                                          I have no issue with IPv6 and ....

                                          You, maybe not.
                                          Neither your ISP, maybe.
                                          For IPv6 to work, also the entire IPv6 peering needs to work also.

                                          @robbiett said in Major DNS Bug 23.01 with Quad9 on SSL:

                                          No link to IPv6 ....

                                          You're right.
                                          I'm probably 'out of subject'.
                                          I brought it up because, if I use the same unbound (pfSense 23.01) using a Netgate device (4100) and the same "forward" settings to Quad9 over TLS, for 3 weeks, why can't I see any issue ? While others do.

                                          No "help me" PM's please. Use the forum, the community will thank you.
                                          Edit : and where are the logs ??

                                          1 Reply Last reply Reply Quote 0
                                          145 out of 185
                                          • First post
                                            145/185
                                            Last post
                                          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.