• Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login
Netgate Discussion Forum
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login

Major DNS Bug 23.01 with Quad9 on SSL

General pfSense Questions
27
185
149.6k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • J
    jimp Rebel Alliance Developer Netgate @Isotope1842
    last edited by Mar 15, 2023, 4:03 PM

    @isotope1842 said in Major DNS Bug 23.01 with Quad9 on SSL:

    @jimp Bullet number four of the previously included screenshot explicitly directs the user to check that setting. The pfSense documentation is actually more than advising. It is directing.

    The page you quoted is a document about configuring DNS over TLS -- it's saying to check that if you want DNS over TLS. That's what that entire document is for.

    It's not a part of the general setup or DNS resolver docs and so on. It's a recipe for users who are interested in that feature and want to know how to set it up.

    There is nothing saying users should be following all of those recipes, they're there for reference for things users may want to do.

    Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

    Need help fast? Netgate Global Support!

    Do not Chat/PM for help!

    I 1 Reply Last reply Mar 15, 2023, 4:10 PM Reply Quote 0
    • I
      Isotope1842 @jimp
      last edited by Isotope1842 Mar 15, 2023, 4:16 PM Mar 15, 2023, 4:10 PM

      @jimp The context in which the instructions are provided is exactly for users who want to use an upstream DNS over TLS provider. The original poster here was reporting a bug in using Quad9 and I reported the same bug when using Cloudflare.

      See the top of the same linked documentation:

      "Pick a DNS over TLS upstream provider, such as a private upstream DNS server or a public service like Cloudflare, Quad9, or Google public DNS."

      Following the instructions prior to 23.01 worked. Immediately after upgrading to 23.01, DNS fails until unchecking a single setting.

      1 Reply Last reply Reply Quote 0
      • J
        jimp Rebel Alliance Developer Netgate
        last edited by Mar 15, 2023, 4:16 PM

        Yes, and? They still work for that and for many providers and users who want to enable that feature.

        But you're trying to imply this is something the docs have told everyone they should be doing which isn't true. They don't advise everyone to do it, just people who are interested in that feature.

        But none of this is helpful. We still need more information about how and why it's failing. We have yet to be able to reproduce this in a lab environment, and there are plenty of us running DNS over TLS without problems.

        Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

        Need help fast? Netgate Global Support!

        Do not Chat/PM for help!

        I 1 Reply Last reply Mar 15, 2023, 4:19 PM Reply Quote 0
        • I
          Isotope1842 @jimp
          last edited by Mar 15, 2023, 4:19 PM

          @jimp Steps to reproduce the problem:

          1. Install pfSense 22.05 on a netgate device.
          2. Configure DNS over TLS with Cloudflare.
          3. Upgrade netgate device to pfSense 23.01.
          4. Observe broken DNS for downstream clients.
          J 1 Reply Last reply Mar 15, 2023, 4:23 PM Reply Quote 0
          • J
            jimp Rebel Alliance Developer Netgate @Isotope1842
            last edited by Mar 15, 2023, 4:23 PM

            @isotope1842 said in Major DNS Bug 23.01 with Quad9 on SSL:

            @jimp Steps to reproduce the problem:

            1. Install pfSense 22.05 on a netgate device.
            2. Configure DNS over TLS with Cloudflare.
            3. Upgrade netgate device to pfSense 23.01.
            4. Observe broken DNS for downstream clients.

            It is not that simple.

            I have multiple lab VMs using DNS over TLS to Cloudflare and Quad9 that successfully resolve and have no problems.

            Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

            Need help fast? Netgate Global Support!

            Do not Chat/PM for help!

            I 1 Reply Last reply Mar 16, 2023, 2:43 PM Reply Quote 0
            • S
              stephenw10 Netgate Administrator
              last edited by Mar 15, 2023, 4:28 PM

              Mmm, I assume you are seeing the same intermittent behaviour as other users? It's not failing for every query with that configuration?

              1 Reply Last reply Reply Quote 0
              • I
                Isotope1842 @jimp
                last edited by Mar 16, 2023, 2:43 PM

                @jimp I just re-checked that single setting. DNS appears to continue to work. Curious to see whether it starts to fail again at some point.

                J S 2 Replies Last reply Mar 16, 2023, 2:49 PM Reply Quote 0
                • J
                  jimp Rebel Alliance Developer Netgate @Isotope1842
                  last edited by Mar 16, 2023, 2:49 PM

                  @isotope1842 said in Major DNS Bug 23.01 with Quad9 on SSL:

                  @jimp I just re-checked that single setting. DNS appears to continue to work. Curious to see whether it starts to fail again at some point.

                  Since you hit it once it's likely to fail again at some point, but nobody has yet to be able to pinpoint exactly when/why it happens.

                  I've been periodically checking my lab systems and they all just keep resolving no matter what I do. But they are lab systems so the load is considerably lower than it would be in a live environment.

                  Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                  Need help fast? Netgate Global Support!

                  Do not Chat/PM for help!

                  1 Reply Last reply Reply Quote 0
                  • S
                    SteveITS Galactic Empire @Isotope1842
                    last edited by Mar 16, 2023, 3:05 PM

                    @isotope1842 There are a few threads on this topic, or variations thereof, and in another one someone posted their problem seemed likely to happen when opening a group/folder of bookmarks/favorites at once...implying a higher number of simultaneous requests might trigger it.

                    I was also unable to replicate my issue by simply (re)checking the DNSSEC option, but I left it off as recommended.

                    Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                    When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                    Upvote 👍 helpful posts!

                    J 1 Reply Last reply Mar 21, 2023, 6:27 AM Reply Quote 0
                    • N
                      nononono
                      last edited by nononono Mar 16, 2023, 4:17 PM Mar 16, 2023, 4:16 PM

                      After playing alot more - it might be an issue with Quad9's TLS DNS limiting responses more than anything.

                      While there is nothing helpful in the pfsense logs, Quad9 just appears to stop replying and then start responding again - almost as if there is a limit being imposed by Quad9 on requests - but of course pfsense must have some role as it never occurred before 23.01

                      This network is a very high traffic network, so maybe others that see the same thing manage high traffic networks as well - either way the only long term solution has been doing TLS DNS through Cloudflare

                      As another point - DHCP lease registrations is definitely not fixed as claimed in 23.01, unbound still likes to reboot too much to consider enabling it - as such at the most I am still only registering the clients that need it via static mapping.

                      C J 2 Replies Last reply Mar 16, 2023, 6:08 PM Reply Quote 1
                      • S
                        stephenw10 Netgate Administrator
                        last edited by Mar 16, 2023, 4:27 PM

                        Hmm, that would explain why we haven't been able to replicate it in test setups without the loading a production box has.

                        1 Reply Last reply Reply Quote 0
                        • C
                          cmcdonald Netgate Developer @nononono
                          last edited by Mar 16, 2023, 6:08 PM

                          @nononono

                          There were memory leaks and a segfault that were fixed. There is still ongoing work to eliminate the unbound reloads associated with lease registration events. I'm hopeful it will make it in 23.05.

                          Need help fast? https://www.netgate.com/support

                          N 1 Reply Last reply Mar 16, 2023, 6:09 PM Reply Quote 3
                          • N
                            nononono @cmcdonald
                            last edited by Mar 16, 2023, 6:09 PM

                            @cmcdonald said in Major DNS Bug 23.01 with Quad9 on SSL:

                            @nononono

                            There were memory leaks and a segfault that were fixed. There is still ongoing work to eliminate the unbound reloads associated with lease registration events. I'm hopeful it will make it in 23.05.

                            That is great to hear! Hopefully that finally is solved

                            1 Reply Last reply Reply Quote 0
                            • J
                              johnpoz LAYER 8 Global Moderator @nononono
                              last edited by Mar 16, 2023, 6:13 PM

                              @nononono said in Major DNS Bug 23.01 with Quad9 on SSL:

                              DHCP lease registrations is definitely not fixed as claimed in 23.01

                              Where is that stated? What it says is a "crash" was fixed when unbound restarting

                              https://redmine.pfsense.org/issues/11316?#note-79

                              Where does it say that unbound is not going to restart on dhcp registrations.. The issue with unbound restarting every time there is something going on with dhcp is ok for some users.. but if you have clients renewing lever few minutes then your still going to have issues with unbound and dhcp - even if they fixed some crash that could happen.

                              An intelligent man is sometimes forced to be drunk to spend time with his fools
                              If you get confused: Listen to the Music Play
                              Please don't Chat/PM me for help, unless mod related
                              SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                              S 1 Reply Last reply Mar 16, 2023, 6:25 PM Reply Quote 0
                              • S
                                SteveITS Galactic Empire @johnpoz
                                last edited by Mar 16, 2023, 6:25 PM

                                @johnpoz said in Major DNS Bug 23.01 with Quad9 on SSL:

                                What it says is a "crash" was fixed when unbound restarting

                                It may be referring to https://docs.netgate.com/pfsense/en/latest/releases/23-01.html "A long-standing difficult-to-reproduce crash in Unbound...It is now safe again to enable DHCP registration alongside Unbound Python mode in pfBlockerNG." ...which is talking about pfBlocker vs DHCP, not that DHCP registration won't restart Unbound anymore. (I've seen others post about this sentence as well...)

                                Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                                When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                                Upvote 👍 helpful posts!

                                1 Reply Last reply Reply Quote 0
                                • J
                                  JonH @SteveITS
                                  last edited by Mar 21, 2023, 6:27 AM

                                  @steveits said in Major DNS Bug 23.01 with Quad9 on SSL:

                                  someone posted their problem seemed likely to happen when opening a group/folder of bookmarks/favorites at once...implying a higher number of simultaneous requests might trigger it.

                                  Playing around with various settings, and I don't know if this has anything to do with the quoted comment, but I raised incoming and outgoing tcp buffers in resolver adv settings from default 10 to 20. I believe (or imagine) that it has reduced the frequency of unbound 'hangs' on my system.

                                  1 Reply Last reply Reply Quote 1
                                  • S SteveITS referenced this topic on Mar 21, 2023, 12:46 PM
                                  • J
                                    joedan @nononono
                                    last edited by Mar 26, 2023, 8:22 AM

                                    @nononono

                                    Just thought I would comment on this as I am seeing the same symptoms. It only just started happening recently and thought it was load related but not much has changed on my end except upgrading PFSense and associated packages.

                                    I have turned parameter off this for now... Use SSL/TLS for outgoing DNS Queries to Forwarding Servers as I need the stability.

                                    Summary:

                                    • Using Cloudflare SSL/TLS on 23.01 (ipv4 / ipv6 resolvers)
                                    • Build: 23.01-RELEASE (amd64) built on Fri Feb 10 20:06:33 UTC 2023
                                    • System Patches package with recommended fixes applied including a manual patch for redmine #13851
                                    • Machine Core(TM) i5-8365U, 16GB Ram, 128GB SSD
                                    • Internet Connection 100/20
                                    • Running a number of wireguard gateways (3 servers, both ipv4 and 6)
                                    • Running a number of packages including...
                                      e257f7ae-ab01-4aa5-91ef-1c449961926f-image.png

                                    According to PFBlockerNG graph, my dns reply stats are around 10-20K per hour during the busiest times.

                                    Over the last week or two, uptime Kuma randomly alerted me to dns issues which is the same time my family complained they couldn't access any websites or their game would fail stating their was no internet. This would happen randomly and would take a couple of minutes to clear up again.

                                    6aa79f4f-4ceb-4366-8bd1-ba147a3abf0b-image.png

                                    310ac4e7-91e3-4a68-951e-4ad8dc03ca93-image.png

                                    Fortunately I had graylog setup and unbound in debug mode for the most recent errors above.

                                    Looking at the logs I note the following messages around the time where DNS briefly cut out...

                                    5a24ac71-1017-42a2-bfec-b70abba94bc9-image.png

                                    Looks like PfSense is not processing DNS requests quick enough for me. The sample above occurred whilst the internet link wasn't even loaded, neither was my machine.

                                    I have turned this setting off and will monitor... Use SSL/TLS for outgoing DNS Queries to Forwarding Servers

                                    I haven't done any further debugging and I hope this helps someone out, if not I apologise for this long post.

                                    J 1 Reply Last reply Mar 26, 2023, 11:55 AM Reply Quote 0
                                    • J
                                      johnpoz LAYER 8 Global Moderator @joedan
                                      last edited by Mar 26, 2023, 11:55 AM

                                      @joedan said in Major DNS Bug 23.01 with Quad9 on SSL:

                                      10-20K per hour during the busiest times.

                                      In your home? how many devices / users - you have a lot shit banging its head trying to resolve stuff your blocking.. That seems really high for a home..

                                      That is what like 240K to 480K queries in 24 hours.. I see like 50k in 24 hours, and that is with 28 active devices doing dns.. Many of them banging their heads looking for shit that is blocked.

                                      queries.jpg

                                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                                      If you get confused: Listen to the Music Play
                                      Please don't Chat/PM me for help, unless mod related
                                      SG-4860 24.11 | Lab VMs 2.7.2, 24.11

                                      J 1 Reply Last reply Apr 4, 2023, 5:40 AM Reply Quote 0
                                      • S
                                        skogs
                                        last edited by Mar 26, 2023, 7:08 PM

                                        I also had occasional dns failures when using quad9 dns. I simply turned off forwarding mode and have had no issues with root servers doing the work.
                                        I find this to be non-ideal, but at least functional.

                                        JeGrJ 1 Reply Last reply Apr 4, 2023, 1:41 PM Reply Quote 0
                                        • N
                                          n8rfe
                                          last edited by Apr 4, 2023, 5:23 AM

                                          Just reporting in here that I have the same issues with 23.01 and Quad 9 DNS when using unbound DNS

                                          My settings were as follows on 22.05 since 22.05 was released with no issues with internet and DNS resolution.

                                          • pfsense ubound DNS resolver enabled

                                          • Using forwarding to Quad 9 DNS servers

                                          • DNSSEC unticked

                                          • Use SSL/TLS for outgoing DNS Queries to Forwarding Servers TICKED

                                          Upgraded to 23.01 this weekend just gone 2nd April 2023. No other settings have changed on LAN network and no other firmware's have been upgraded on any devices on network so I am confident that the only change was the upgrade to pfsense from 22.05 TO 23.01.

                                          Everything worked fine yesterday 3rd April until this morning when I now get multiple DNS resolution issues and cannot reach websites.

                                          I have now changed the following single setting.

                                          • Use SSL/TLS for outgoing DNS Queries to Forwarding Servers UNTICKED

                                          Websites are now resolving correctly.

                                          I will continue to monitor DNS resolution over the coming days to see if the issue reappears and report back.

                                          N 1 Reply Last reply Apr 5, 2023, 10:12 AM Reply Quote 0
                                          • First post
                                            Last post
                                          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.