Wan periodic reset causes system reboot.
-
We think we know what might cause it but without being able to replicate it locally it's difficult to prove.
23.09 dev snapshots have been enabled for anyone able to test against that. It would be good to know if there uis any difference there.
-
@stephenw10
I only have one Netgate device and that is in production but hopefully someone has a unit they can test with.Unless you have a unit you can send for testing Steve?
️
-
I mean I have all the units but.....
-
@RobbieTT can you send the link of the redmine?
-
-
@stephenw10 installed 23.09 dev version.
Resetting wan still resets device but will check again -
@stephenw10 found another way to trigger the issue.
Status interface disconnect wan
System reboots -
@AlexanderK said in Wan periodic reset causes system reboot.:
Status interface disconnect wan
System rebootsThat applies to me too but around 50% of the time. My WAN is delivered by PPPoE so that difference may impact the rate of crashes / reboots.
It is unfortunate that the issue impacts v23.09d.
️
-
i will try to create a lab and reproduce the error. i will post progress
-
anything new?
tried to replicate issue at a lab but nothing happened.
I am using my production network without ipv6 -
There have been some backend changes to our build system preventing new snaps for a few days. Let me check.....
-
We are still digging into this. It looks like there may be several related issues here. The NDP issue being one of them.
-
@stephenw10 said in Wan periodic reset causes system reboot.:
We are still digging into this. It looks like there may be several related issues here. The NDP issue being one of them.
If the coding is not too complicated the understanding of this will wipe at least 3 bugs away. A 4th could be the unexplained DNS Resolver cache wipe following a pfBlocker cron-job. Seems you have more in mind though!
️
-
Well I hope there's not more!
-
@stephenw10 said in Wan periodic reset causes system reboot.:
Well I hope there's not more!
Surely one fix that fixes many is better than chasing down all these individual bugs? Well, unless you are the one unpicking the code...
Will this work fold into v23.09 or is it too late for that?
️
-
I hope that it will be 23.09. The ndp fix certainly will be.
-
Ok, sounds hopeful but I appreciate this discovery came very late in the .09 workflow.
️
-
anything new on this issue?
-
Not yet. At least not as far as I know since we still have yet to replicate it locally. There are fixes for other things that could be interacting to cause this on some systems. If you're able to test a 23.09 snapshot and can repeatedly trigger this issue please do so.
-
Still covered by this on redmine:
No improvement yet on 23.09 dev and the issue is (probably) being pushed to 24.03, so another 6 months+ away.
It's not ideal, I know. I'm looking for a non-pfSense option in the interim to cover the periods when I may not be around to resolve these crashes & reboots.
In the meantime I've been pushing data at the Netgate team and running stuff whenever needed and trying every development load.
️