Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Unbound seems to be restarting frequently

    Scheduled Pinned Locked Moved DHCP and DNS
    178 Posts 43 Posters 96.4k Views 11 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • johnpozJ Online
      johnpoz LAYER 8 Global Moderator
      last edited by

      Well dhcp doesn't seem to be restarting mine..  But now that take a closer look it does seem to have restarted a few time when seems odd that it did.  But must not be happening enough for me to notice.  Last time I looked at the log didn't see any craziness there.. But now there is more restarts than you would think should be there.

      If its a combinations of things, and something removes the issue like save or no dns in general, etc.  Then sure that makes sense that less people would see it that had the specific settings and combinations of things.

      Will keep an eye on it more, have not noticed any issue with resolving anything.  But does seem to have been restarting more than it should..

      Mar 3 04:15:10 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 04:03:19 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 03:25:24 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 03:05:42 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 01:33:22 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 01:10:28 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 00:36:41 unbound: [26324:0] notice: Restart of unbound 1.5.1.
      Mar 3 00:12:51 unbound: [26324:0] notice: Restart of unbound 1.5.1.

      If I look in the dhcp log there is lots of dhcp stuff going on with renews and such at 2 in the morning but no restart that matches up to then, etc.  I don't see any dhcp traffic that matches up with these restart times.

      An intelligent man is sometimes forced to be drunk to spend time with his fools
      If you get confused: Listen to the Music Play
      Please don't Chat/PM me for help, unless mod related
      SG-4860 25.07.1 | Lab VMs 2.8.1, 25.07.1

      1 Reply Last reply Reply Quote 0
      • K Offline
        kejianshi
        last edited by

        Hmmm - Strange.  Mine is also often showing "notice: Restart of unbound" now that I take a closer look, but I'm not noticing any performance issues.

        1 Reply Last reply Reply Quote 0
        • E Offline
          edmund
          last edited by

          When I upgraded to the current 2.2 release from the previous version I switched to using unbound but I've been finding that it stops working ever now and then - suddenly nothing on the network resolves.  Manually restarting unbound fixes this … until the next time.  I've just switched back to dnsmasq on my work system and rebooted in the hope that this will fix it.

          I'm seeing this both on my work system and on my home firewall too.  Looking at the status display it seems that unbound is still working - it doesn't show up as stopped, it just doesn't work.  Sorry if these notes aren't very helpful but there does seem to be an issue here.

          Both firewalls are pretty much vanilla systems, different hardware (DELL and Netgate) with similar configurations.  The only non-standard thing about them both is that I have two WAN connections on each machine - other than that they are pretty boring configurations.

          1 Reply Last reply Reply Quote 0
          • D Offline
            dparring
            last edited by

            edmund, that is basically the same behavior I see.  Unbound restarts frequently but it generally doesn't affect anything; only occasionally does it stop resolving.  When that happens, the service shows as running but it just doesn't resolve properly.  My nagios monitor reports it as "DNS CRITICAL - 0.129 seconds response time (No ANSWER SECTION found)" when it happens.  A manual restart via the webgui, or even just waiting for unbound to restart itself will fix it.  I don't know if this is directly related to the DHCP bug or if it is just a consequence of the service restarting so frequently.  I'm running pfSense in a VM on ESXi with a single WAN connection.

            Another followup to my testing from yesterday:  Unbound eventually resumed its restart behavior a few hours after I had "solved" it by pushing the save button on the general settings page.  According to the logs, it looked like it resumed after dhcp did a routine write of the leases file to disk.  I still haven't changed any settings in the system 20 hours later, and unbound is restarting on some, but not all, DHCPREQUEST events.  Interestingly, in the current system state my test scenario (new MAC requesting DHCP) currently doesn't trigger an unbound restart like it did before.  So hitting save under general settings isn't a perfect fix but it seems to get it into a slightly more reliable state.

            This is my best understanding of the issue so far:

            • Enabling "Register DHCP leases in the DNS Resolver" reliably puts unbound into a state where it restarts on brand new DHCP leases
            • Pressing the save button on the general settings screen seems to stop unbound from restarting on DHCP requests for a short time
            • However, unbound still manages to get into a slightly unstable state in the course of normal dhcpd activities, possibly precipitated by dhcpd writing to the leases file
            1 Reply Last reply Reply Quote 0
            • dennypageD Offline
              dennypage
              last edited by

              Just checking… did you did turn on "Harden Glue" and "Harden DNSSEC data"?

              There have been a couple threads about Unbound ceasing to resolve if these were not enabled.

              @edmund:

              When I upgraded to the current 2.2 release from the previous version I switched to using unbound but I've been finding that it stops working ever now and then - suddenly nothing on the network resolves.  Manually restarting unbound fixes this … until the next time.  I've just switched back to dnsmasq on my work system and rebooted in the hope that this will fix it.

              1 Reply Last reply Reply Quote 0
              • E Offline
                edmund
                last edited by

                @dennypage:

                Just checking… did you did turn on "Harden Glue" and "Harden DNSSEC data"?

                No - both unchecked, my general philosophy is not to check boxes unless there's a good reason and I didn't think that either of these were relevant.  So unbound was running with the defaults.  They get rather upset at work if the resolver goes walkabout so I'll leave unbound disabled here and see what the home configuration is doing when I get home tonight.

                1 Reply Last reply Reply Quote 0
                • K Offline
                  kejianshi
                  last edited by

                  Those defaults are already being changed for next release I believe - because they matter…

                  1 Reply Last reply Reply Quote 0
                  • D Offline
                    dparring
                    last edited by

                    I enabled both the Harden Glue and Harden DNSSEC data options (it looks like these are best practices that should be enabled by default).  However, this does not appear to have an effect on the unbound restart behavior.  I've observed that even if unbound is acting somewhat stable and not restarting on every DHCPREQUEST, the following sequence by dhcpd appears to always trigger a restart:

                    Mar 4 12:25:17 dhcpd: Wrote 22 leases to leases file.
                    Mar 4 12:25:17 dhcpd: Wrote 0 new dynamic host decls to leases file.
                    Mar 4 12:25:17 dhcpd: Wrote 0 deleted host decls to leases file.

                    This seems to happen on a regular basis, perhaps hourly based on the logs.  I'm guessing this is a routine operation of dhcpd, although I don't know if unbound's expected behavior is to also restart as part of this operation.

                    1 Reply Last reply Reply Quote 0
                    • E Offline
                      edmund
                      last edited by

                      @kejianshi:

                      Those defaults are already being changed for next release I believe - because they matter…

                      That's interesting - so I can select Harden Glue and Harden DNSSEC data in Advanced settings without actually Enabling DNSSEC in General Settings?  Seems a little odd to me…

                      Maybe I should also check Enable DNSSEC although it doesn't seem to be required by the interface?

                      1 Reply Last reply Reply Quote 0
                      • K Offline
                        kejianshi
                        last edited by

                        I haven't tried without selecting DNSSEC since it doesn't make much sense, but if you can click that strange combo of buttons, I'd call that a bug.  haha

                        Not sure if we should call it a user bug or interface bug (-;

                        1 Reply Last reply Reply Quote 0
                        • E Offline
                          edmund
                          last edited by

                          I agree, it doesn't make any sense to do that but the interface does allow it - and probably shouldn't but I have not idea what the logic is behind the GUI.  From a human interface POV there's just too many boxes to check on unbound.

                          1 Reply Last reply Reply Quote 0
                          • K Offline
                            kejianshi
                            last edited by

                            x.y.z versions of pfsense are usually very well sorted out.

                            the ones with just x.y (2 digits) usually pretty solid but still being polished abit.

                            1 Reply Last reply Reply Quote 0
                            • C Offline
                              cmb
                              last edited by

                              @kejianshi:

                              I haven't tried without selecting DNSSEC since it doesn't make much sense, but if you can click that strange combo of buttons, I'd call that a bug.  haha

                              Not sure if we should call it a user bug or interface bug (-;

                              It doesn't make Unbound fail to function, so not really a big deal. Still, I added input validation to prevent enabling that option if DNSSEC support isn't enabled.

                              1 Reply Last reply Reply Quote 0
                              • C Offline
                                cmb
                                last edited by

                                @kejianshi:

                                Those defaults are already being changed for next release I believe - because they matter…

                                The only default we're changing is hard coding harden-glue to yes. That checkbox is gone in 2.2.1, and the config.xml setting ignored if it exists.

                                1 Reply Last reply Reply Quote 0
                                • C Offline
                                  cmb
                                  last edited by

                                  @edmund:

                                  From a human interface POV there's just too many boxes to check on unbound.

                                  I agree there's a lot there. It has a lot of options. We either have a ton of boxes, or force people to use manual configuration in the advanced box which is error-prone and could break on upgrade where the checkboxes won't. We'll be putting out guidance on usage that should help clarify things.

                                  Make sure harden glue is enabled (on 2.2 and earlier), and defaults are otherwise fine. You don't have to be pushing buttons there unless you have atypical needs (mostly very large networks).

                                  1 Reply Last reply Reply Quote 0
                                  • D Offline
                                    dparring
                                    last edited by

                                    I have only been observing for about 5 hours after upgrading to 2.2.1, but it appears the frequent unbound restarts triggered by DHCP may be resolved with the latest update. Unbound has not restarted since the update, even during routine DHCP events like writes to the leases file that previously triggered it.  Perhaps it is something that was updated in pfSense 2.2.1 or perhaps there was a change with unbound 1.5.3, but I would suggest that anyone who has been reporting unbound instability try the new version.

                                    1 Reply Last reply Reply Quote 0
                                    • I Offline
                                      Inq
                                      last edited by

                                      It's still restarting as soon as you change anything in the resolver settings or change the DNS addresses from general setup. It's not restarting if you don't touch any settings after a reboot….go figure.

                                      The problem with making something idiot proof is that the world keeps making better idiots.

                                      1 Reply Last reply Reply Quote 0
                                      • K Offline
                                        kejianshi
                                        last edited by

                                        I like the number of boxes and options.  I'd rather be intimidated by options than limited.

                                        1 Reply Last reply Reply Quote 0
                                        • E Offline
                                          Evad
                                          last edited by

                                          @edmund:

                                          When I upgraded to the current 2.2 release from the previous version I switched to using unbound but I've been finding that it stops working ever now and then - suddenly nothing on the network resolves.  Manually restarting unbound fixes this … until the next time.  I've just switched back to dnsmasq on my work system and rebooted in the hope that this will fix it.

                                          I'm seeing this both on my work system and on my home firewall too.  Looking at the status display it seems that unbound is still working - it doesn't show up as stopped, it just doesn't work.  Sorry if these notes aren't very helpful but there does seem to be an issue here.

                                          edmund

                                          Thank you for posting that. It is very helpful to me.  ;D
                                          I was running 2.2 x64 on a dell 780 and 620 for a few months without issue. After adding a LAN  gateway and creating an asymmetric routing issue that was resolved with forum help I moved to two check point U10s with 2.2 i386 6 days ago so I could use the third nic to implement the gateway correctly. Since then I have had to restart Unbound DNS Resolver every couple of days or so. As you said "it doesn't show up as stopped, it just doesn't work".
                                          I am now Select Harden Glue and Harden DNSSEC data!!!  We will see if this fixes my issue..

                                          1 Reply Last reply Reply Quote 0
                                          • B Offline
                                            Barnabas
                                            last edited by

                                            I had this problem a few days ago as well.  I am using the current 2.2.1 release as well.
                                            What fixed the problem was to check the following in the DNS Resolver: Advanced settings:

                                            • Hide Identity
                                            • Hide Version

                                            Any ideas as to why this would work?
                                            Also i noticed that after I did this I was no longer getting strange messages in my DHCP log complaining about incorrect length of DHCP headers (UDP I think).  I no longer have these logs because the log has cycled and they are no longer there.

                                            Could BandwidthD be doing something that Unbound does not like?
                                            I get messages on my dynamic IPs in BandwidthD that say "xxx.xxx.xxx.103 - Configure DNS to reverse this IP".  My static IPs are reversed with no problems.
                                            I have Register DHCP leases in the DNS Resolver checked as well as Register DHCP static mappings in the DNS Resolver checked.  If I understand correctly those should let BandwidthD resolve all the IPs on my network.

                                            Thanks.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.