Firebox Marvel ports locking up (CORE-E SERIES)
-
sorry to hear deanot! :(
My board (igb driver) ran for 4 days before crashing twice this morning, once under almost no load at 4:30am in the morning. I've managed to compile the latest Intel driver 2.4.3 for mine and will try it later. Damn, took hours of reading and trial and error to get it going (had to compile it in a FreeBSD 10.1 image installed in a VM). Hope this solves the issue.
Anyway, good luck in the future.
Edit: Running on new driver but looks like it doesn't support traffic shaping, which is beyond my ability to fix. Oh well, I guess I will try it out still to see how well it works (or doesn't).
Edit #2: Looks like the driver source would need to be edited to allow 'altq' additions. Too much to worry about right now…will just test as is to see if it locks up or not.
Edit #3: Still locked up during the night. I'm going to move the LAN over to another, unused port to see if this is a hardware issue.
-
I do hope you get it working, I feel lost without using the red brick, kinda unsecured in a way. I am going away for a couple of weeks, being out of the country and this being so unstable is not something I can leave running.
My phone server is protected on the other side of this brick, plus all the other devices that require it for internet access.
I thought you was going to load 2.1 back on your machine? did you do that and are you still seeing the same problems? maybe once 2.2.5 is released on nano the problem will go away?
I am going to look for a couple of cards, just to get something up and running, at least I can dump my config and put it on a new build, might be something to try.
-
Since this was new, I never had 2.1.5 on it in the first place. I spent the better part of a day trying to load it with the full CD install only to be greeted by the ROOT MOUNT ERROR no matter what I did. I finally gave up on that one (for now).
Not sure what to do now other than to switch ports (or replace the switch that igb1 is attached to - Zyxel GS1900-16 smart semi managed switch - to see if the LAN port is having issues with the switch).
Edit: Probably a long shot but I did have it connected to another switch (dumb switch) when I did the week burn-in test (with no WAN though). So either the switch makes a difference or something about routing traffic from the WAN to the LAN is locking up the port needing a reset.
Edit #2: Moved the connection on the LAN to another switch and it just went down for the count again. Will move it back to the original switch and then move the LAN from igb1 to igb2 to see if there is a hardware issue with port 1 (igb1) of the board.
Edit #3: The igb2 port did a watchdog and reset also. Unless I'm mistaken and the ports run off the same chip (don't think so), this has got to be a software or configuration error. :(
-
Engineer, have you not tried 2.1.5 on the Firebox yet? I've been using that for over a year without any problems 24/7. BSD 10.1/Pfsense 2.2.4 has a bug somewhere for sure using this old hardware. I'm building a new system now so can just retire my Firebox on 2.1.5.
-
Engineer, have you not tried 2.1.5 on the Firebox yet? I've been using that for over a year without any problems 24/7. BSD 10.1/Pfsense 2.2.4 has a bug somewhere for sure using this old hardware. I'm building a new system now so can just retire my Firebox on 2.1.5.
corvey, I'm not using a firebox…just jumped in this thread because it was very similar to my issue. I'm running new hardware (Supermicro X11SBA-LN4F with Intel N3700 board with 4 Intel A210-AT ports). I tried to install 2.1.5 but couldn't get around ROOT MOUNT ERROR no matter what I did. I spent half a day trying to get it to install from both a USB stick and burned CD....no luck.
I might try again later when people aren't in the house using the Internet (they yell within 10 seconds of the watchdog triggering...sigh)
I know, I should have started my own topic (or stayed in the X11SBA-LN4F thread right below this one....just trying to solve both my and Deanot's issue).
-
I have an HP 4 port nic coming, Dual intel chipset. I will build a machine outside of the Firebox, see how it works out, I do hope this is not a driver issue with the build. If it is, I will use an older version and avoid the newer builds until certain the issues are resolved.
A few things I do know.
(1) it is not device dependent.
(2) It started to happen on the 2.2.4 Nano BSD build for myself and the regular 2.2.4 build for Engineer.
(3) Pulling the network cable from the port resets the port and the issue goes away. For a while at least.
(4) Time is not a factor, I have seen my box run anything up to 14 days without issue, then BAM. Or it could happen within minutes, hours or days of being up.
(5) Heavy traffic or minimal traffic has not much affect, for me, I could just log into the GUI and it would trigger it.
(6) Slowing the Nic Speed down, changing settings related to Nic cards had little to no effect. -
I turned off VLAN_HWTCO (ifconfig igb2 -vlanhwtco) and also apinger (WAN PORT MONITORING). Log is down to less than 5% of what it was (things were restarting and checking VERY often. One+ days up with little in the log. Time will tell.
-
Keep us posted, you might drill down into what the actual issue is.
-
Keep us posted, you might drill down into what the actual issue is.
Very sad. LAN port watchdog timeout again. :(
Not sure where I'm going from here…..
-
Hello,
I am affected too! :(
Just a question: I have the LCDProcDev version running on my Firebox. When the lock up happens on your machines is the LCDProc still running? On my machine there is no change on the display.
So I thougt it has completely chrashed and I switched it off andf on again.Matthias
-
I run it too, it continues to function like nothing is wrong.
-
I have a X750e with deactivated MSK intefaces.
I will give it a try to use them instead of the sk interface I currently use…. -
2.2.5 is out, maybe they fixed this bug.
-
I don't think 2.2.5 is a nano build is it?
-
No you're right, it is a nano build. I might try it out later, just to see if it is better or not.
-
2.2.5 Nano installed, I had to reset the ports about 18 times before I could get it on. But it is on and running, and I am testing and abusing the ports to see if I can get it to lock up again. At this time, I do believe it will do the same…..
-
That was short lived, locked up within minutes of posting this. Oh well, time to ditch this brick and find something more reliable or go back to 2.1x
-
That sucks, thanks for trying 2.2.5 for us, deanot. I will be retiring my Firebox on 2.1.5 and use it as a solid backup router.
I'm still waiting on my 1265Lv2 Xeon CPU to arrive from Fleabay. I will be going ESXI this time around to install Pfsense on. Hopefully it works.
-
I fired the box up again this morning, this time I uninstalled Snort, so far it has been up and running solid for the past few hours. I am still expecting it to lock up on me, just have to wait for when it happens.
The only extension I am running right now, is LCDproc-Dev. Nothing more, nothing less.
-
I did receive my HP 4 port intel nic card, I should be building up a new PFSense system sometime today, I am going to load a full install of the latest build on it and see what happens. I am unsure if I want to restore my settings or start out completely fresh and hope for the best.
Being unsure of what is causing this, I feel the best approach would be to do everything from fresh. For all I know, it could be my config that is doing it, it could be from a previous install doing it. All I know, when I was on 2.1.x, it worked just fine.
Without much information from devs, I have no idea if drivers got updated/changed or what could be causing this. I do know it is a known issue for some network cards, but nothing was mentioned about the Marvel nics, and this does seem a little more wide spread than just my system doing this as of right now. It is apparent, this is not just firebox related anymore.