Slow DNS after 22.05
-
Netgate SG-1100, after updating to 22.05, log file is full of lines like
Jun 30 17:54:10 unbound 35746 [35746:0] error: recvfrom 23 failed: Protocol not available
Then it will stop the service and restart it.
-
@domnado said in Slow DNS after 22.05:
Netgate SG-1100, after updating to 22.05, log file is full of lines like
Jun 30 17:54:10 unbound 35746 [35746:0] error: recvfrom 23 failed: Protocol not available
Then it will stop the service and restart it.
Do you have DNSSEC enabled by chance? If so, try disabling it for a test and restartingunbound
. I'm basing this hunch on the results of a quick Google search for error messages similar to yours withunbound
.If that clears up the problem, then I suspect your SG-1100 needs to be powered all the way down gracefully, and then restarted. So that means doing a shutdown from the DIAGNOSTICS menu, waiting for the shutdown to complete and box to halt, then remove the power for several seconds. Restore power and let it boot up again. The SG-1100, if I recall correctly, can sometimes have its crypto chip get into a state where it fails and the ONLY way to fix it is a power-off reboot. A typical restart will not reset the hardware.Scratch my first idea, misread the Google result against your error message.
Other Google hits suggest perhaps something going on with the NIC. For example, one user reported similar error messages when his NIC was dropping checksum error packets due to a driver problem.
-
Hope I'm not speaking too soon, I tried the halt and pull power cord for 30 seconds, system has been up for 10 minutes now with zero error messages.
-
@domnado said in Slow DNS after 22.05:
Hope I'm not speaking too soon, I tried the halt and pull power cord for 30 seconds, system has been up for 10 minutes now with zero error messages.
Resetting the crypto chip hardware certainly won't hurt anything, and it may be the solution. I did a some quick scans through the
unbound
source code but was unable to locate that specific error message template text. I was hoping if I found the error message in the source code that it would help identify a possible cause. -
Unbound started acting up again, same error messages. I did make a change to the DNS Resolver settings, but only to the Network Interfaces section. I had to halt the system and unplug power for it to operate normally again.
-
@domnado said in Slow DNS after 22.05:
Unbound started acting up again, same error messages. I did make a change to the DNS Resolver settings, but only to the Network Interfaces section. I had to halt the system and unplug power for it to operate normally again.
I think you tickled a clue there -- "make a change to the DNS Resolver settings, but only to the Network Interfaces section."
What specifically did you change there? What setting was working versus what setting you changed it to that resulted in the error message?
-
At first "Network Interfaces" was set to ALL, first I changed it to everything but ALL (LAN, WAN IPv6 Link-Local, LAN IPv6 Link-Local, and Localhost), then I just changed it to LAN and Localhost. I also turned off both Prefetch options in Advanced Settings when I selected LAN and Localhost interfaces. Both changes were fine after a halt and power cord pull. The errors only started after clicking the Apply Changes button.
-
I'm having the same problem with slow DNS after 22.05.
I've had my setup (Netgate 2100) for over a year, everything has been fine.
Suddenly DNS queries are timing out.
No, I didn't change anything, other than to install the upgrade when prompted to do so.
Any suggestions? -
@jax I moved to a virtualized OPNsense instance since the start of the thread. For now, having a better experience. No problems resolving DNS.
-
@mihaifpopa said in Slow DNS after 22.05:
virtualized OPNsense instance
That's good. I'm on a Netgate device and I'd like it to go back to working correctly!
-
@jax What are you seeing when you go to the Diagnostics->DNS Lookup page?
-
First try: about a 9 second wait followed by the correct answer.
Second try: about a 22 second wait followed by the correct answer.The pfSense display shows that 127.0.0.1 is timing out.
I have no idea why the Netgate device is querying itself.
As soon as it queries the next device upstream it gets an answer.Name server Query time 127.0.0.1 938 msec 192.168.xx.xx 48 msec
-
Ha! In General Setup -> DNS Resolution Behavior I chose "Use remote DNS servers, ignore local DNS" and things look better now. We'll see if that fixes it.
-
@jax Sounds like DNS Resolver is stopped.
Go to the Service->DNS Resolver page and click the "start" icon in the header, of Status->Services and click it there.FWIW reliance on the ISP DNS servers may result in being handed misleading DNS records. Remember when ISPs would resolve unresolving IPs and pass you to a search page? This helps you avoid that, among other things.
-
@rcoleman-netgate Okay, I restarted the DNS Resolver and have set the DNS Resolution Behavior back to use local DNS with fallback to remote. We'll see how this goes.
-
Do you run Suricata by any chance??
-
@cool_corona No, I don't.
-
@rcoleman-netgate Goes back to lousy performance. I've set it back to using remote DNS.
-
@jax Do you have any DNS specified in general settings??
-
@cool_corona No, no specified DNS servers. It's just using the default, the upstream WAN DHCP-assigned server.