Freezing
-
I've had this machine for about three years. It is a very, very cheap emachines W3502 that I picked up at Walmart for about $300 bucks.
I have running pfsense on it since day one (started with pfsense WELL before 1.0) and since that time it has experienced very frequent random freezing lockups. I'm using 1.2.2 now. By lockup I mean the entire machine is completely frozen - it doesn't even respond to a ctrl-alt-delete from the keyboard. The only way to recover is to push the power button on the front of the case for a reboot. There is also nothing in the logs since the logs are wiped after a reboot. The machine ran Windows XP without incident for a few weeks when I first bought the machine.
I'm thinking there is some hardware issue since the lockups happened during the installtion of pfsense as well, but I can't figure out what the problem is as there is no rhyme or reason to when it freezes. Sometimes it happens within a few hours, other times it happens only after several weeks. The machine is under a very, very LOW network and CPU load. It is mostly idle. I experienced the issue in my office with a T1 line and now at home with a consumer 50/10 comcast cable line.
I have pulled out all non-essential hardware, replaced the HD, all network cards and will replace the RAM on Monday. This afternoon I disabled as many things as I could in the BIOS. Attached is the dmesg after everything has been disabled in BIOS. Can a pfsense expert review the output and give me some tips on where to dig deeper?
Copyright 1992-2008 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 7.0-RELEASE-p8 #0: Thu Jan 8 22:07:30 EST 2009
sullrich@freebsd7-releng_1_2_1.pfsense.org:/usr/obj.pfSense/usr/src/sys/pfSense.7
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Celeron(R) D CPU 3.20GHz (3200.14-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0xf64 Stepping = 4
Features=0xbfebfbff <fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,clflush,dts,acpi,mmx,fxsr,sse,sse2,ss,htt,tm,pbe>Features2=0xe41d <sse3,rsvd2,mon,ds_cpl,cnxt-id,cx16,xtpr,pdcm>AMD Features=0x20000000 <lm>AMD Features2=0x1 <lahf>real memory = 385249280 (367 MB)
avail memory = 362618880 (345 MB)
ACPI APIC Table: <gatewa 06dt047="">ioapic0: Changing APIC ID to 4
ioapic0 <version 2.1="">irqs 0-23 on motherboard
wlan: mac acl policy registered
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
hptrr: HPT RocketRAID controller driver v1.1 (Jan 8 2009 22:07:11)
cryptosoft0: <software crypto="">on motherboard
acpi0: <gatewa 06dt047="">on motherboard
acpi0: Overriding SCI Interrupt from IRQ 9 to IRQ 21
acpi0: [ITHREAD]
acpi0: Power Button (fixed)
unknown: I/O range not supported
acpi0: reservation of 0, a0000 (3) failed
acpi0: reservation of 100000, 17f00000 (3) failed
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <32-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
cpu0: <acpi cpu="">on acpi0
p4tcc0: <cpu frequency="" thermal="" control="">on cpu0
acpi_button0: <power button="">on acpi0
pcib0: <acpi host-pci="" bridge="">port 0xcf8-0xcff iomem 0xfff00000-0xffffffff on acpi0
pci0: <acpi pci="" bus="">on pcib0
pcib1: <acpi pci-pci="" bridge="">at device 1.0 on pci0
pci1: <acpi pci="" bus="">on pcib1
vgapci0: <vga-compatible display="">port 0x2000-0x20ff mem 0x20000000-0x2fffffff,0x40100000-0x4010ffff irq 17 at device 5.0 on pci1
atapci0: <ati ixp400="" sata150="" controller="">port 0x3068-0x306f,0x3084-0x3087,0x3060-0x3067,0x3080-0x3083,0x3030-0x303f mem 0x40200600-0x402007ff irq 23 at device 17.0 on pci0
atapci0: [ITHREAD]
ata2: <ata 0="" channel="">on atapci0
ata2: [ITHREAD]
ata3: <ata 1="" channel="">on atapci0
ata3: [ITHREAD]
atapci1: <ati ixp400="" sata150="" controller="">port 0x3058-0x305f,0x307c-0x307f,0x3050-0x3057,0x3078-0x307b,0x3020-0x302f mem 0x40200400-0x402005ff irq 22 at device 18.0 on pci0
atapci1: [ITHREAD]
ata4: <ata 0="" channel="">on atapci1
ata4: [ITHREAD]
ata5: <ata 1="" channel="">on atapci1
ata5: [ITHREAD]
pci0: <serial bus,="" smbus="">at device 20.0 (no driver attached)
atapci2: <ati ixp400="" udma133="" controller="">port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x3000-0x300f at device 20.1 on pci0
ata0: <ata 0="" channel="">on atapci2
ata0: [ITHREAD]
ata1: <ata 1="" channel="">on atapci2
ata1: [ITHREAD]
isab0: <pci-isa bridge="">at device 20.3 on pci0
isa0: <isa bus="">on isab0
pcib2: <acpi pci-pci="" bridge="">at device 20.4 on pci0
pci2: <acpi pci="" bus="">on pcib2
re0: <us 997902="" robotics="" (rtl8169s)="" gigabit="" ethernet="">port 0x1100-0x11ff mem 0x40000100-0x400001ff irq 16 at device 3.0 on pci2
re0: Chip rev. 0x04000000
re0: MAC rev. 0x00000000
miibus0: <mii bus="">on re0
rgephy0: <rtl8169s 8110s="" 8211b="" media="" interface="">PHY 1 on miibus0
rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
re0: Ethernet address: 00:14:c1:32:83:a7
re0: [FILTER]
re1: <us 997902="" robotics="" (rtl8169s)="" gigabit="" ethernet="">port 0x1000-0x10ff mem 0x40000000-0x400000ff irq 17 at device 4.0 on pci2
re1: Chip rev. 0x04000000
re1: MAC rev. 0x00000000
miibus1: <mii bus="">on re1
rgephy1: <rtl8169s 8110s="" 8211b="" media="" interface="">PHY 1 on miibus1
rgephy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
re1: Ethernet address: 00:14:c1:32:63:eb
re1: [FILTER]
speaker0: <pc speaker="">port 0x61 on acpi0
atkbdc0: <keyboard controller="" (i8042)="">port 0x60,0x64 irq 1 on acpi0
atkbd0: <at keyboard="">irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
atkbd0: [ITHREAD]
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio0: [FILTER]
pmtimer0 on isa0
orm0: <isa option="" roms="">at iomem 0xc0000-0xcefff,0xcf000-0xd8fff pnpid ORM0000 on isa0
ppc0: parallel port not found.
sc0: <system console="">at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <generic isa="" vga="">at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounter "TSC" frequency 3200144208 Hz quality 800
Timecounters tick every 1.000 msec
Fast IPsec: Initialized Security Association Processing.
hptrr: no controller detected.
ad0: 381554MB <seagate st3400832a="" 3.03="">at ata0-master UDMA100
acd0: CDRW <lite-on combo="" sohc-4836v="" sg$4="">at ata1-master UDMA33
Trying to mount root from ufs:/dev/ad0s1a___
/ f \
/ p \/ Sense
\/ \
\/Welcome to pfSense 1.2.2 on the 'pfSense' platform…
Mounting filesystems...
done.
Creating symlinks...
.
.
.
done.
Launching PHP init system...
done.
Initializing...
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
done.
Starting device manager (devd)...
done.
Loading configuration...
.
.
.
done.
Updating configuration...
done.
Cleaning backup cache...
done.
Setting up extended sysctls...
done.
Syncing user passwords...
done.
Starting Secure Shell Services...
done.
Setting timezone...
done.
Starting syslog...
done.
Configuring LAN interface...
re1: link state changed to UP
re1: link state changed to DOWN
done.
Configuring WAN interface...
re0: link state changed to DOWN
re1: link state changed to UP
re0: link state changed to UP
re0: link state changed to DOWN
re0: link state changed to UP
re0: link state changed to DOWN
done.
Configuring OPT interfaces...
done.
Configuring CARP interfaces...
done.
Starting OpenNTP time client...
done.
Configuring firewall...
.
pflog0: promiscuous mode enabled
.
.
done.
Starting webConfigurator...
done.
Starting DHCP service...
done.
Starting DNS forwarder...
done.
Setting up microcode and tx/rx offloading...
re0: link state changed to UP
done.
Starting FTP helpers...
done.
Final filter pass...
.
.
.
done.
Starting Secure Shell Services...
done.
Setting timezone...
done.
Starting syslog...
done.
Configuring LAN interface...
re1: link state changed to UP
re1: link state changed to DOWN
done.
Configuring WAN interface...
re0: link state changed to DOWN
re1: link state changed to UP
re0: link state changed to UP
re0: link state changed to DOWN
re0: link state changed to UP
re0: link state changed to DOWN
done.
Configuring OPT interfaces...
done.
Configuring CARP interfaces...
done.
Starting OpenNTP time client...
done.
Configuring firewall...
.
pflog0: promiscuous mode enabled
.
.
done.
Starting webConfigurator...
done.
Starting DHCP service...
done.
Starting DNS forwarder...
done.
Setting up microcode and tx/rx offloading...
re0: link state changed to UP
done.
Starting FTP helpers...
done.
Final filter pass...
.
.
.
done.
Generating RRD graphs...
done.
Starting DHCP service...
done.
Starting CRON...
done.
Syncing packages:
squid
squidcache
squidnac
squid
squidtraffic
squidupstream
squidauth
squidusers
squidcache
.
Executing rc.d items...
Starting /usr/local/etc/rc.d/proxy_monitor.sh...
done.
Starting /usr/local/etc/rc.d/squid.sh...
done.
Bootup complete
appending output to nohup.out</lite-on></seagate></generic></system></isa></at></keyboard></pc></rtl8169s></mii></us></rtl8169s></mii></us></acpi></acpi></isa></pci-isa></ata></ata></ati></serial></ata></ata></ati></ata></ata></ati></vga-compatible></acpi></acpi></acpi></acpi></power></cpu></acpi></gatewa></software></version></gatewa></lahf></lm></sse3,rsvd2,mon,ds_cpl,cnxt-id,cx16,xtpr,pdcm></fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,clflush,dts,acpi,mmx,fxsr,sse,sse2,ss,htt,tm,pbe> -
I just noticed that the VGA and my LAN NIC have the same IRQ 17. Am I reading this right? I guess I will try disabling the VGA output.
-
IRQ sharing is normal. I'd expect that you're looking at hardware problems. If you haven't already I'd suggest you try some basic diagnostics, at the very least a run through with Memtest86. I'd also check the logs to see if anything appears that can help.
-
This could also be a thermal problem.
You could try to clean all the fans and passive heat sinks.