KEA DHCP continuously rebooting with error message after 24.11 upgrade and switch from ISC
-
Error messages as below with it occurring every second it seems and causing it to be restarted by the watchdog every hour or so.
Thoughts as this error message isn't leading me down to a specific problem I can identify.
Thanks,
PabloDec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Dec 5 19:19:24 kea-dhcp4 22525 ERROR [kea-dhcp4.commands.0x2f054a812000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor -
In looking at this in more detail, this error message also caused my CPU to be stuck around 50% and was also associated intermittent network loss. Switching to ISC resolved both issues.
-
@PabloAbonia I am seeing the same issue on my Netgate 4200 running 24.11
-
@PabloAbonia said in KEA DHCP continuously rebooting with error message after 24.11 upgrade and switch from ISC:
COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket
Can't be that bad.
See here : https://kea.readthedocs.io/en/kea-2.0.1/kea-messages.html
I propose : stop kill zap all kea process.
Console, or better : SSH : option 8 and thenps ax | grep 'kea'
and kill them all.
Then, check that you still have this file /var/run/kea4-ctrl-socket (socket actually).
rm it.Now start all the kea stuff with the GUI.
edit :
When kea starts, and runs fine, you can actually use this socket to talk to the process.Run this, on the command lie, and you'll see it answers you with loads of information :
echo '{"command":"lease4-get-all"}' | nc -U /var/run/kea4-ctrl-socket | jq
-
I just had this happen on my Netgate 1541. The effect was that it generated a dhcp.log file with 14923504 rows in under 7 minutes, rolled the log file due to size and the automatic bzip compression ended up taking 100% of all CPUs trying to compress all the dhcp log files.
24.11-RELEASE (amd64)
built on Wed Nov 27 12:22:00 CST 2024
FreeBSD 15.0-CURRENTEveryone on the LAN experienced loss of internet access, as it was sometimes allowing packets to flow, but not reliably.
Interestingly, IPv6 DHCP worked, so I could access the firewall GUI from the LAN. It was slow due to the bzip processes (based on running top from the shell)
The repeated error message I saw is nearly identical to the OP:
Feb 2 13:38:20 pfsense kea-dhcp4[91496]: ERROR [kea-dhcp4.commands.0xc6b64e12000] COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptorChecking the log file to see how quickly it generated - 36,217 errors/second!
$ grep 'Feb 2 13:38:20' dhcpd.log.6 | wc -l
36217I tried rebooting the Netgate 1541, but it immediately happened again.
I tried disabling DHCP, restarting DHCP from within the GUI, but neither option resolved the error messages.I switched back to ISC DHCP for now, but that isn't a long term solution since it is deprecated.
Because it recurred after a reboot, I am not sure what else to do to get kea DHCP to work reliably. Are there any other settings that could be causing this?
-
@PVuchetich2 said in KEA DHCP continuously rebooting with error message after 24.11 upgrade and switch from ISC:
COMMAND_SOCKET_ACCEPT_FAIL Failed to accept incoming connection on command socket -1: Bad file descriptor
Here : https://kea.readthedocs.io/en/kea-2.1.7/kea-messages.html
A socket is just some kind of special file. Its created at kea starts, no big deal. Every process, like ngins, (web GUI) unbound (resolver) create these.
COMMAND_SOCKET_ACCEPT_FAIL
Failed to accept incoming connection on command socket %1: %2
This error indicates that the server detected incoming connection and executed accept system call on said socket, but this call returned an error. Additional information may be provided by the system as second parameter.
Try this :
In the GUI, stop all kea server services.
Then use the console or (better) SSH, menu option 8.cd /var/run
then
ls -al kea*
Normally, there should be no files anymore that starts with "kea".
If there are, remove them all.Now, start the kea server(s) again.
Check again the content of the directory, there should be a new kea-ctrl-socket file again (and a lock file).Other checks :
Systems processes like unbound, the web GUI and kea gets restarted when there is an up down interface. This happens when an interfaces goes down for a moment. You don't have these ?