Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Unbound seems to be restarting frequently

    Scheduled Pinned Locked Moved DHCP and DNS
    178 Posts 43 Posters 96.5k Views 11 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • C Offline
      CiscoKid85
      last edited by

      This sounds like a bug to me.

      1 Reply Last reply Reply Quote 0
      • johnpozJ Online
        johnpoz LAYER 8 Global Moderator
        last edited by

        So your saying unbound restarts on a dhcp request… Yeah I have not seen this, but when I get home I will check.. If a bug then it would be very simple to duplicate.

        An intelligent man is sometimes forced to be drunk to spend time with his fools
        If you get confused: Listen to the Music Play
        Please don't Chat/PM me for help, unless mod related
        SG-4860 25.07.1 | Lab VMs 2.8.1, 25.07.1

        1 Reply Last reply Reply Quote 0
        • K Offline
          kejianshi
          last edited by

          With the proper settings you can break anything.

          1 Reply Last reply Reply Quote 0
          • johnpozJ Online
            johnpoz LAYER 8 Global Moderator
            last edited by

            Well I know for sure I have both those options enabled and have not seen or atleast noticed unbound restarting.. And I would assume I would see lots of dhcp requests.  When I get home I will take a double look and force a request, etc.

            To be honest if this was a real bug you would think the forums would be blowing up with this happening.

            An intelligent man is sometimes forced to be drunk to spend time with his fools
            If you get confused: Listen to the Music Play
            Please don't Chat/PM me for help, unless mod related
            SG-4860 25.07.1 | Lab VMs 2.8.1, 25.07.1

            1 Reply Last reply Reply Quote 0
            • C Offline
              CiscoKid85
              last edited by

              For me it was configuring DNS servers in System -> General Setup, regardless if I had Forwarding Mode disabled/enabled. Simply having external servers caused the restarts for me.

              dparring, what are your System -> General Setup DNS settings? If you remove them and only use local host, do the reboots stop?

              1 Reply Last reply Reply Quote 0
              • D Offline
                dparring
                last edited by

                CiscoKid85 is on to something.  I had Google DNS configured under general settings (8.8.8.8/8.8.4.4).  When I cleared those entries, saved, and redid the test, DHCP did NOT cause unbound to restart.  Even stranger is that when I put the entries back in, the test no longer causes unbound to restart.  However, when I ran my previous tests again, it started happening after I reanabled "Register DHCP leases in the DNS Resolver".  Here is the sequence:

                New DHCP causes unbound restart
                +Removed 8.8.8.8/8.8.4.4 from general settings DNS
                New DHCP does NOT cause unbound restart
                +Added 8.8.8.8/8.8.4.4 back to general settings
                New DHCP does NOT cause unbound restart
                +Rebooted pfSense
                New DHCP does NOT cause unbound restart
                +Unchecked "Register DHCP leases in the DNS Resolver"
                New DHCP does NOT cause unbound restart
                +Checked "Register DHCP leases in the DNS Resolver"
                New DHCP causes unbound restart
                +Removed 8.8.8.8/8.8.4.4 from general settings DNS
                New DHCP does NOT cause unbound restart
                +Unchecked "Register DHCP leases in the DNS Resolver"
                New DHCP does NOT cause unbound restart
                +Checked "Register DHCP leases in the DNS Resolver"
                New DHCP causes unbound restart

                Confusing right?  If I had to guess, there is a bug related to enabling "Register DHCP leases in the DNS Resolver" that causes unbound to restart when dhcpd issues leases, and that making other changes to the DNS system somehow fixes it.  There is another thread on the forum about unbound restarts where people seem to have resolved it by playing with similar settings, so it makes sense that this is only happening in certain difficult to reproduce scenarios.  Also, unless you're digging in the logs or experiencing DNS outages, most people wouldn't even notice this is happening.  I'd be interested to see if anyone else can produce similar findings by enabling and disabling the "Register DHCP leases in the DNS Resolver" setting.

                1 Reply Last reply Reply Quote 0
                • D Offline
                  dparring
                  last edited by

                  One more data point as a direct continuation from the sequence above:

                  +Left general settings exactly the same (dns servers are blank) and hit save
                  New DHCP does NOT cause unbound restart

                  My best guess of the bug at this point:

                  Enabling "Register DHCP leases in the DNS Resolver" causes it
                  Something on the general settings page save sequence fixes it (logs indicate the save kicks off a dhcpd and unbound restart, possibly other actions)

                  1 Reply Last reply Reply Quote 0
                  • D Offline
                    dparring
                    last edited by

                    Even more craziness when I cleared out some of the old DHCP leases from my testing earlier:

                    +Deleted leases using the delete button in DHCP Leases page (this seems to trigger a dhcpd and unbound restart with each delete request)
                    New DHCP causes unbound restart
                    +Left general settings exactly the same (dns servers are blank) and hit save
                    New DHCP does NOT cause unbound restart

                    1 Reply Last reply Reply Quote 0
                    • johnpozJ Online
                      johnpoz LAYER 8 Global Moderator
                      last edited by

                      Well dhcp doesn't seem to be restarting mine..  But now that take a closer look it does seem to have restarted a few time when seems odd that it did.  But must not be happening enough for me to notice.  Last time I looked at the log didn't see any craziness there.. But now there is more restarts than you would think should be there.

                      If its a combinations of things, and something removes the issue like save or no dns in general, etc.  Then sure that makes sense that less people would see it that had the specific settings and combinations of things.

                      Will keep an eye on it more, have not noticed any issue with resolving anything.  But does seem to have been restarting more than it should..

                      Mar 3 04:15:10 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 04:03:19 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 03:25:24 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 03:05:42 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 01:33:22 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 01:10:28 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 00:36:41 unbound: [26324:0] notice: Restart of unbound 1.5.1.
                      Mar 3 00:12:51 unbound: [26324:0] notice: Restart of unbound 1.5.1.

                      If I look in the dhcp log there is lots of dhcp stuff going on with renews and such at 2 in the morning but no restart that matches up to then, etc.  I don't see any dhcp traffic that matches up with these restart times.

                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                      If you get confused: Listen to the Music Play
                      Please don't Chat/PM me for help, unless mod related
                      SG-4860 25.07.1 | Lab VMs 2.8.1, 25.07.1

                      1 Reply Last reply Reply Quote 0
                      • K Offline
                        kejianshi
                        last edited by

                        Hmmm - Strange.  Mine is also often showing "notice: Restart of unbound" now that I take a closer look, but I'm not noticing any performance issues.

                        1 Reply Last reply Reply Quote 0
                        • E Offline
                          edmund
                          last edited by

                          When I upgraded to the current 2.2 release from the previous version I switched to using unbound but I've been finding that it stops working ever now and then - suddenly nothing on the network resolves.  Manually restarting unbound fixes this … until the next time.  I've just switched back to dnsmasq on my work system and rebooted in the hope that this will fix it.

                          I'm seeing this both on my work system and on my home firewall too.  Looking at the status display it seems that unbound is still working - it doesn't show up as stopped, it just doesn't work.  Sorry if these notes aren't very helpful but there does seem to be an issue here.

                          Both firewalls are pretty much vanilla systems, different hardware (DELL and Netgate) with similar configurations.  The only non-standard thing about them both is that I have two WAN connections on each machine - other than that they are pretty boring configurations.

                          1 Reply Last reply Reply Quote 0
                          • D Offline
                            dparring
                            last edited by

                            edmund, that is basically the same behavior I see.  Unbound restarts frequently but it generally doesn't affect anything; only occasionally does it stop resolving.  When that happens, the service shows as running but it just doesn't resolve properly.  My nagios monitor reports it as "DNS CRITICAL - 0.129 seconds response time (No ANSWER SECTION found)" when it happens.  A manual restart via the webgui, or even just waiting for unbound to restart itself will fix it.  I don't know if this is directly related to the DHCP bug or if it is just a consequence of the service restarting so frequently.  I'm running pfSense in a VM on ESXi with a single WAN connection.

                            Another followup to my testing from yesterday:  Unbound eventually resumed its restart behavior a few hours after I had "solved" it by pushing the save button on the general settings page.  According to the logs, it looked like it resumed after dhcp did a routine write of the leases file to disk.  I still haven't changed any settings in the system 20 hours later, and unbound is restarting on some, but not all, DHCPREQUEST events.  Interestingly, in the current system state my test scenario (new MAC requesting DHCP) currently doesn't trigger an unbound restart like it did before.  So hitting save under general settings isn't a perfect fix but it seems to get it into a slightly more reliable state.

                            This is my best understanding of the issue so far:

                            • Enabling "Register DHCP leases in the DNS Resolver" reliably puts unbound into a state where it restarts on brand new DHCP leases
                            • Pressing the save button on the general settings screen seems to stop unbound from restarting on DHCP requests for a short time
                            • However, unbound still manages to get into a slightly unstable state in the course of normal dhcpd activities, possibly precipitated by dhcpd writing to the leases file
                            1 Reply Last reply Reply Quote 0
                            • dennypageD Offline
                              dennypage
                              last edited by

                              Just checking… did you did turn on "Harden Glue" and "Harden DNSSEC data"?

                              There have been a couple threads about Unbound ceasing to resolve if these were not enabled.

                              @edmund:

                              When I upgraded to the current 2.2 release from the previous version I switched to using unbound but I've been finding that it stops working ever now and then - suddenly nothing on the network resolves.  Manually restarting unbound fixes this … until the next time.  I've just switched back to dnsmasq on my work system and rebooted in the hope that this will fix it.

                              1 Reply Last reply Reply Quote 0
                              • E Offline
                                edmund
                                last edited by

                                @dennypage:

                                Just checking… did you did turn on "Harden Glue" and "Harden DNSSEC data"?

                                No - both unchecked, my general philosophy is not to check boxes unless there's a good reason and I didn't think that either of these were relevant.  So unbound was running with the defaults.  They get rather upset at work if the resolver goes walkabout so I'll leave unbound disabled here and see what the home configuration is doing when I get home tonight.

                                1 Reply Last reply Reply Quote 0
                                • K Offline
                                  kejianshi
                                  last edited by

                                  Those defaults are already being changed for next release I believe - because they matter…

                                  1 Reply Last reply Reply Quote 0
                                  • D Offline
                                    dparring
                                    last edited by

                                    I enabled both the Harden Glue and Harden DNSSEC data options (it looks like these are best practices that should be enabled by default).  However, this does not appear to have an effect on the unbound restart behavior.  I've observed that even if unbound is acting somewhat stable and not restarting on every DHCPREQUEST, the following sequence by dhcpd appears to always trigger a restart:

                                    Mar 4 12:25:17 dhcpd: Wrote 22 leases to leases file.
                                    Mar 4 12:25:17 dhcpd: Wrote 0 new dynamic host decls to leases file.
                                    Mar 4 12:25:17 dhcpd: Wrote 0 deleted host decls to leases file.

                                    This seems to happen on a regular basis, perhaps hourly based on the logs.  I'm guessing this is a routine operation of dhcpd, although I don't know if unbound's expected behavior is to also restart as part of this operation.

                                    1 Reply Last reply Reply Quote 0
                                    • E Offline
                                      edmund
                                      last edited by

                                      @kejianshi:

                                      Those defaults are already being changed for next release I believe - because they matter…

                                      That's interesting - so I can select Harden Glue and Harden DNSSEC data in Advanced settings without actually Enabling DNSSEC in General Settings?  Seems a little odd to me…

                                      Maybe I should also check Enable DNSSEC although it doesn't seem to be required by the interface?

                                      1 Reply Last reply Reply Quote 0
                                      • K Offline
                                        kejianshi
                                        last edited by

                                        I haven't tried without selecting DNSSEC since it doesn't make much sense, but if you can click that strange combo of buttons, I'd call that a bug.  haha

                                        Not sure if we should call it a user bug or interface bug (-;

                                        1 Reply Last reply Reply Quote 0
                                        • E Offline
                                          edmund
                                          last edited by

                                          I agree, it doesn't make any sense to do that but the interface does allow it - and probably shouldn't but I have not idea what the logic is behind the GUI.  From a human interface POV there's just too many boxes to check on unbound.

                                          1 Reply Last reply Reply Quote 0
                                          • K Offline
                                            kejianshi
                                            last edited by

                                            x.y.z versions of pfsense are usually very well sorted out.

                                            the ones with just x.y (2 digits) usually pretty solid but still being polished abit.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.