Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    KEA service stopping through the day

    DHCP and DNS
    16
    43
    7.1k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • jimpJ
      jimp Rebel Alliance Developer Netgate
      last edited by

      Curious. I wonder what the contents of /var/run/kea/kea-dhcp4.kea-dhcp4.pid were during that time.

      I could maybe see something like that happening if the PID file in /var/run/kea/kea-dhcp4.kea-dhcp4.pid contained a PID that didn't match the daemon that was running.

      Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

      Need help fast? Netgate Global Support!

      Do not Chat/PM for help!

      GertjanG chudakC 2 Replies Last reply Reply Quote 0
      • GertjanG
        Gertjan @jimp
        last edited by

        @jimp

        That's the one I'll be looking at - the content - the next time.

        No "help me" PM's please. Use the forum, the community will thank you.
        Edit : and where are the logs ??

        1 Reply Last reply Reply Quote 1
        • GertjanG Gertjan referenced this topic on
        • C
          Cobrax2
          last edited by

          I switched to kea too, and it can't restart if it somehow gets hung or killed by unbound because of that lock file. Is there a way to add the rm command in the startup file of kea? What is the startup command for it, where is it located? I'd hate to have to run a cron job every minute just to remove it...
          Thanks

          1 Reply Last reply Reply Quote 0
          • M
            Markito
            last edited by

            My KEA service (in an HA setup with two pfSense instances, both 2.7.2) kept behaving oddly, so I reverted to the ISC DHCP server which so far has been 100% stable. I don't think KEA is ready for prime time yet. Here's what I've observed:

            • Once, I entered 10-15 static IP mappings, and the next day I noticed they were all gone.
            • KEA server sometimes ran OK on both HA nodes, sometimes it stopped on one node but not on the other
            • While adding static IP mappings today, I went back to edit a handful to enable Static ARP on the MASTER HA NODE. Reproduction steps:
              ** Click EDIT on a static DHCP rule.
              ** Click on static Arp checkbox to enable
              ** Hit Enter
              ** Go to Status -> DHCP Leases
              ** It's blank. The KEA Service on MASTER NODE had stopped itself with the error: "failed to initialize Kea server: configuration error using file '/usr/local/etc/kea/kea-dhcp4.conf': cannot lock socket lockfile, /tmp/kea4-ctrl-socket.lock, : Resource temporarily unavailable"
              ** This was very reproducible
            • Also, when doing CARP failover, sometimes KEA would stop on one node and not run on both after I disabled the forced-failover

            Again, wayyyyy too many issues.

            S 1 Reply Last reply Reply Quote 0
            • S
              SteveITS Galactic Empire @Markito
              last edited by

              @Markito said in KEA service stopping through the day:

              sometimes ran OK on both HA nodes

              https://docs.netgate.com/pfsense/en/latest/releases/23-09.html#kea-dhcp-server-feature-preview-now-available

              "Currently the Kea implementation lacks the following DHCP server features:
              ... High Availability Failover"

              Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
              When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
              Upvote ๐Ÿ‘ helpful posts!

              M 1 Reply Last reply Reply Quote 1
              • M
                Markito @SteveITS
                last edited by

                @SteveITS Thanks :) I had not realized that.

                1 Reply Last reply Reply Quote 0
                • T
                  ThomasDr @w0w
                  last edited by

                  @w0w
                  This can happen if you have switched from dhcpd to kea but have not changed the service watchdog.

                  1 Reply Last reply Reply Quote 1
                  • chudakC
                    chudak @jimp
                    last edited by

                    @jimp Today see TS state unexpected state: NoState and removing /tmp/kea4-ctrl-socket.lock does no help

                    something new?

                    1 Reply Last reply Reply Quote 0
                    • M
                      marcosm Netgate
                      last edited by

                      This issue should be handled with the 24.11-RC. Feedback on it would be helpful if you were hitting this previously.

                      D 1 Reply Last reply Reply Quote 0
                      • D
                        darkrolder @marcosm
                        last edited by

                        @marcosm This issue is happening to me
                        a few nights prior i woke up to some "IOT" things flashing as they couldnt connect to their wifi.
                        and found i didnt have internet, however when i got up at 6 it was working again without user intervention so i am not sure..

                        this morning i woke up to no "internet"
                        (some statically set things over ethernet were working, obv) but everything wifi was offline.

                        on the router, kea ipv4 was offline i had to click the start button, for now i have installed the watch dog server to auto restart. id send logs if i knew where and which ones you wanted to help diag this? or if this is even related? (running 24.11)

                        best regards
                        -Rolder

                        D 1 Reply Last reply Reply Quote 0
                        • D
                          DavidIr @darkrolder
                          last edited by

                          @darkrolder said in KEA service stopping through the day:

                          on the router, kea ipv4 was offline i had to click the start button, for now i have installed the watch dog server to auto restart. id send logs if i knew where and which ones you wanted to help diag this? or if this is even related? (running 24.11)

                          I have also had this experience on my Netgate 3100. Only details I could find in the logs was:

                          Dec 5 20:58:54	kernel		pid 67465 (kea-dhcp4), jid 0, uid 0: exited on signal 6 (core dumped)
                          

                          For some reason the kia-dhcp4 process does not seem to be generating any log entries on my device so really hard to work out if it's connected.
                          I have after reading a few posts increased the size of my DHCP pool in case some IoT devices are doing something odd (I saw this as a possibility in another thread.

                          I am now in the monitoring phase but I would not have expected a DHCP service to fail so spectacularly if it ran out of addresses in it's pool to give out.

                          D M 2 Replies Last reply Reply Quote 1
                          • D
                            darkrolder @DavidIr
                            last edited by

                            @DavidIr I don't think i have that many IOT devices, maybe 10-15? mostly light switches and my
                            cell phones, tv, etc. i have them on their own vlan with the DHCP pool size at about 150 IPs, my brother (IT admin) has suggested changing the DHCP lease from the default 2 hours to 24 hours. i have done this, so far so good. but its only been 1 day. s: only time will tell if this helps

                            -Rolder

                            1 Reply Last reply Reply Quote 0
                            • M
                              marcosm Netgate @DavidIr
                              last edited by

                              @DavidIr Did this happen on 24.11? The core dump file should be in /root - sharing that would help determine what happened.

                              D 1 Reply Last reply Reply Quote 0
                              • D
                                DavidIr @marcosm
                                last edited by

                                @marcosm

                                @marcosm said in KEA service stopping through the day:

                                @DavidIr Did this happen on 24.11? The core dump file should be in /root - sharing that would help determine what happened.

                                Yes it did.
                                cefcd600-2ba6-4aa8-97fe-f820e1c96dc1-image.png
                                I assume the kea-dhcp4.core file?
                                I'm not familiar with these files - is it safe to just attach to the forum, or should I send in some other way?

                                M 1 Reply Last reply Reply Quote 0
                                • M
                                  marcosm Netgate @DavidIr
                                  last edited by

                                  @DavidIr Yes. Check the timestamp of the core files, e.g. with ls -lha /root/*.core and if they are smilar (i.e. potentially related), upload them here.

                                  D 1 Reply Last reply Reply Quote 0
                                  • D
                                    DavidIr @marcosm
                                    last edited by DavidIr

                                    @marcosm I have uploaded the file I downloaded the other day, but will also upload another from today which definitely aligns to when the service unexpectedly stopped this morning - I resolved by simply restarting the service. I was surprised that the watchdog did not restart the service for me.

                                    9a05935d-f121-4067-b75c-b70a99c6ecc5-image.png

                                    D 1 Reply Last reply Reply Quote 0
                                    • D
                                      DavidIr @DavidIr
                                      last edited by

                                      @DavidIr Hi @marcosm any joy looking at the dump files?

                                      I have implemented the changes in https://forum.netgate.com/post/1199521 for now in the hope this will restart the DHCP service when it fails, but would love to understand what's going on and help solve the potentially wider issue.

                                      Thank you

                                      M 1 Reply Last reply Reply Quote 0
                                      • M
                                        marcosm Netgate @DavidIr
                                        last edited by

                                        @DavidIr It would help to have some additional info about the system. You can get that by going to /status.php.

                                        D 1 Reply Last reply Reply Quote 0
                                        • D
                                          DavidIr @marcosm
                                          last edited by DavidIr

                                          @marcosm status_output.tgz uploaded to the same link provided above.

                                          Since the previous messages I have installed and configured the Service Watchdog plugin

                                          D 1 Reply Last reply Reply Quote 0
                                          • D
                                            DavidIr @DavidIr
                                            last edited by

                                            In case you need any additional info I am now on holiday until Jan 5th so will not see or be able to respond to any posts or requests for info until I return.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.