pfSense nic freeze
-
checked, but there is no crash dump in /var/crash. So it really seems the nic's just froze, as there was no way to communicate with pfSense anymore in any way.
-
if there was a problem with the NIC, you would also see it from the shell
the NIC is not a separate animal in your pfSense box that lives a separate lifeso observation remains when the collapse occurs.... (via shell)
-
Disable all those package's and try it for a period of time.
Probably using all the memory when Snort or pfBlocker update's.
-
Memory never comes above 50% for the last 2 weeks.
-
believe me it's just a guess from now on
one thing you can do is throw up a Linux on it and pressure it a hard hardware stress test-or you wait until the next collapse
-
+++++
by the way @Impatient is trying to tell you that updating the described packages has high memory usage ...
well, you don't even see these when they happen and the system crashes (randomly) -
Just ran a stress test for 10 minutes.
stress-ng --metrics --cpu 4 --vm 4 --vm-bytes 2G --io 2CPU 100%, mem 99%. Rock stable, not a glitch. Still being able to download at 250Mbps.
-
yes these mistakes are the worst
like looking for a needle in a haystackI would otherwise sharpen the test to network transmission....
since the CPU and MEM test is good in the short term, but think about it if it’s a longer term thing
for example, a thermal heat run error and only occurs if the temperature on one of the active elements on the MOBO is higher for a long time -
@DaddyGo Good point. But the load on the device is usually very low. It's just an home router/firewall. CPU is normally <5% and memory around 40. With my usage, I don't expect high temperatures very soon :)
-
what you are writing about is a relative state of rest
routing does not require power machinesI understand all this and and I know the pfSense resource needs..
I have seen huge pfBlockerNG lists loading with 98% CPU and 4GB RAM usage (on APU 4d4 board)
it was of course an overloaded pfSense box and an unreasonably large listthis can happen when you are not supervising the box and not seeing it
but eventually kills the OP system
it randomly occurs, as your problem -
I understand. However, pfBlockerNG is updating every hour, as are more things. The monitor log does show a little increase on memory usage at that moment, nothing else. Will keep an eye on it.
-
hard, hard...
it really remains to seize the moment and and you can do any investigation when the incident happens.
for now - because nowhere to find anything (crash report, suspicious logs) you can only wait.BTW: I don't think that pfSense problem is this...
-
one more idea:
what about the latest BIOS?
since the ACPI FW code can cause such a stupid situation -
The device is on the lastest bios from 2018. No newer bios was released since then.
-
this can be a problem (looks like old BIOS)
what do you see after that(?):dmesg | grep 'error'
dmesg | grep 'Firmware*' -
dmesg|grep 'error'
module_register_init: MOD_LOAD (vesa, 0xffffffff812d9960, 0) error 19
WARNING: /: mount pending error: blocks 48 files 2dmesg | grep 'Firmware*'
<no output> -
áááhhhááá,
Are you on UFS?
why not ZFS?this can be due to a lot of violent downtime (crash)
the VESA error is fine (it's strong, hihihi), who also has a VGA controller on the MOBO
but it is "WARNING: /: mount pending error: blocks 48 files 2"https://forums.freebsd.org/threads/mount-pending-error.67573/
read it and then:
https://docs.netgate.com/pfsense/en/latest/hardware/troubleshooting-disk-check-errors-fsck.html
https://www.youtube.com/watch?v=4DKr1Dvan5I -
@DaddyGo said in pfSense nic freeze:
WARNING: /: mount pending error: blocks 48 files 2"
AFAIK, his is expected, as I had to pull the power to reboot the device. So the disk was not clean shutdown.
-
yes that's okay, I also wrote this too, but
fix the file system before you scan further the boxeverything must be ruled out when searching for such an error..
- poor disk fragmentation, a typical cause of random crashes
I know you think of the NIC because the LEDs don't flash but like I said it could be part of a process