2.0-RC2 on alix6f2 crashing
-
Running the newest snapshot 2.0-RC2 (i386) built on Fri May 13 23:17:13 EDT 2011 on an ALIX board alix6f2 with the newest BIOS.
This system is crashing every few hours and then reboots. I have tried going back to older snap shoots and it still likes to crash on me. This router was rock solid under the older release (1.2.3-RELEASE).
I have another model of Alix board running at another site with the same snap shoot and it's been running fine (Man difference is this other site currently has no load or clients using it at the moment).
Setup for the unit that is crashing:
-PPPoE WAN
-LAN
-Two wireless interfaces
-Two OpenVPN site-to-site links
-PPTP Server
-No PackagesAny idea where to start looking for the problem? Any options I should turn on or off? I'm going to setup hyper terminal and watch it so I know what's going on when it crashes.
-
Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0xc fault code = supervisor read, page not present instruction pointer = 0x20:0xc0970497 stack pointer = 0x28:0xe2ca2498 frame pointer = 0x28:0xe2ca24c4 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 11 (irq10: vr0) trap number = 12 panic: page fault cpuid = 0 Uptime: 2h35m4s Cannot dump. Device not defined or unavailable. Automatic reboot in 15 seconds - press a key on the console to abort Rebooting...
-
Doing some reading on other people have similar problems with older snap shots. I'm going to disable my OpenVPN client connections and see what happens over the next day.
-
Nope still crashing.
-
I've tried numerous snapshots of 2.0 since January and they all crashed on both my 2C0 and 2C2. I try to come back here every month or so to see if it's worth trying again. Not yet, obviously.
-
I'm starting to think this is due to failing hardware. My UPS started acting up a week ago and it might have sent a surge to my Alix board. At some point I'm going to try going back to 1.2 and see if I still have the issue. If I do then I guess I'll be buying a new board. I might need to contact APC about this as they do have those warranties to protect your hardware.
-
I've had RC1&2 builds running on 2 different Alix setups without issues at all. I think you might have bad hardware somewhere.
-
Yea I'm coming to that conclusion. Sucks when a device that is meant to protect a device is the cause of the problem.
-
Found the problem: UDP LAN net * ! Router 53 (DNS) WAN none DNS Reject
If the gateway is set to WAN instead of * on this rule every time it rejects a DNS request the router crashes and reloads. All I have to do to crash the router is "nslookup google.com. 8.8.8.8" and the router reject the connection and crashes.
-
Confirmed that really does panic a box. I'm gathering some more info, I'll add it onto the ticket.
http://redmine.pfsense.org/issues/1552 -
To bad I found this after buying a new Alix board and taking the time to setup my router again from scratch. But hey now I have an extra router, a clean configuration file, and I found a bug.