Freezing



  • I've had this machine for about three years.  It is a very, very cheap emachines W3502 that I picked up at Walmart for about $300 bucks.

    I have running pfsense on it since day one (started with pfsense WELL before 1.0) and since that time it has experienced very frequent random freezing lockups.  I'm using 1.2.2 now.  By lockup I mean the entire machine is completely frozen - it doesn't even respond to a ctrl-alt-delete from the keyboard.  The only way to recover is to push the power button on the front of the case for a reboot.  There is also nothing in the logs since the logs are wiped after a reboot.  The machine ran Windows XP without incident for a few weeks when I first bought the machine.

    I'm thinking there is some hardware issue since the lockups happened during the installtion of pfsense as well, but I can't figure out what the problem is as there is no rhyme or reason to when it freezes.  Sometimes it happens within a few hours, other times it happens only after several weeks.  The machine is under a very, very LOW network and CPU load.  It is mostly idle.  I experienced the issue in my office with a T1 line and now at home with a consumer 50/10 comcast cable line.

    I have pulled out all non-essential hardware, replaced the HD, all network cards and will replace the RAM on Monday.  This afternoon I disabled as many things as I could in the BIOS.  Attached is the dmesg after everything has been disabled in BIOS.  Can a pfsense expert review the output and give me some tips on where to dig deeper?

    Copyright © 1992-2008 The FreeBSD Project.
    Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
            The Regents of the University of California. All rights reserved.
    FreeBSD is a registered trademark of The FreeBSD Foundation.
    FreeBSD 7.0-RELEASE-p8 #0: Thu Jan  8 22:07:30 EST 2009
        sullrich@freebsd7-releng_1_2_1.pfsense.org:/usr/obj.pfSense/usr/src/sys/pfSense.7
    Timecounter "i8254" frequency 1193182 Hz quality 0
    CPU: Intel(R) Celeron(R) D CPU 3.20GHz (3200.14-MHz 686-class CPU)
      Origin = "GenuineIntel"  Id = 0xf64  Stepping = 4
      Features=0xbfebfbff <fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,clflush,dts,acpi,mmx,fxsr,sse,sse2,ss,htt,tm,pbe>Features2=0xe41d <sse3,rsvd2,mon,ds_cpl,cnxt-id,cx16,xtpr,pdcm>AMD Features=0x20000000 <lm>AMD Features2=0x1 <lahf>real memory  = 385249280 (367 MB)
    avail memory = 362618880 (345 MB)
    ACPI APIC Table: <gatewa 06dt047="">ioapic0: Changing APIC ID to 4
    ioapic0 <version 2.1="">irqs 0-23 on motherboard
    wlan: mac acl policy registered
    kbd1 at kbdmux0
    ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
    hptrr: HPT RocketRAID controller driver v1.1 (Jan  8 2009 22:07:11)
    cryptosoft0: <software crypto="">on motherboard
    acpi0: <gatewa 06dt047="">on motherboard
    acpi0: Overriding SCI Interrupt from IRQ 9 to IRQ 21
    acpi0: [ITHREAD]
    acpi0: Power Button (fixed)
    unknown: I/O range not supported
    acpi0: reservation of 0, a0000 (3) failed
    acpi0: reservation of 100000, 17f00000 (3) failed
    Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
    acpi_timer0: <32-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
    cpu0: <acpi cpu="">on acpi0
    p4tcc0: <cpu frequency="" thermal="" control="">on cpu0
    acpi_button0: <power button="">on acpi0
    pcib0: <acpi host-pci="" bridge="">port 0xcf8-0xcff iomem 0xfff00000-0xffffffff on acpi0
    pci0: <acpi pci="" bus="">on pcib0
    pcib1: <acpi pci-pci="" bridge="">at device 1.0 on pci0
    pci1: <acpi pci="" bus="">on pcib1
    vgapci0: <vga-compatible display="">port 0x2000-0x20ff mem 0x20000000-0x2fffffff,0x40100000-0x4010ffff irq 17 at device 5.0 on pci1
    atapci0: <ati ixp400="" sata150="" controller="">port 0x3068-0x306f,0x3084-0x3087,0x3060-0x3067,0x3080-0x3083,0x3030-0x303f mem 0x40200600-0x402007ff irq 23 at device 17.0 on pci0
    atapci0: [ITHREAD]
    ata2: <ata 0="" channel="">on atapci0
    ata2: [ITHREAD]
    ata3: <ata 1="" channel="">on atapci0
    ata3: [ITHREAD]
    atapci1: <ati ixp400="" sata150="" controller="">port 0x3058-0x305f,0x307c-0x307f,0x3050-0x3057,0x3078-0x307b,0x3020-0x302f mem 0x40200400-0x402005ff irq 22 at device 18.0 on pci0
    atapci1: [ITHREAD]
    ata4: <ata 0="" channel="">on atapci1
    ata4: [ITHREAD]
    ata5: <ata 1="" channel="">on atapci1
    ata5: [ITHREAD]
    pci0: <serial bus,="" smbus="">at device 20.0 (no driver attached)
    atapci2: <ati ixp400="" udma133="" controller="">port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x3000-0x300f at device 20.1 on pci0
    ata0: <ata 0="" channel="">on atapci2
    ata0: [ITHREAD]
    ata1: <ata 1="" channel="">on atapci2
    ata1: [ITHREAD]
    isab0: <pci-isa bridge="">at device 20.3 on pci0
    isa0: <isa bus="">on isab0
    pcib2: <acpi pci-pci="" bridge="">at device 20.4 on pci0
    pci2: <acpi pci="" bus="">on pcib2
    re0: <us 997902="" robotics="" (rtl8169s)="" gigabit="" ethernet="">port 0x1100-0x11ff mem 0x40000100-0x400001ff irq 16 at device 3.0 on pci2
    re0: Chip rev. 0x04000000
    re0: MAC rev. 0x00000000
    miibus0: <mii bus="">on re0
    rgephy0: <rtl8169s 8110s="" 8211b="" media="" interface="">PHY 1 on miibus0
    rgephy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
    re0: Ethernet address: 00:14:c1:32:83:a7
    re0: [FILTER]
    re1: <us 997902="" robotics="" (rtl8169s)="" gigabit="" ethernet="">port 0x1000-0x10ff mem 0x40000000-0x400000ff irq 17 at device 4.0 on pci2
    re1: Chip rev. 0x04000000
    re1: MAC rev. 0x00000000
    miibus1: <mii bus="">on re1
    rgephy1: <rtl8169s 8110s="" 8211b="" media="" interface="">PHY 1 on miibus1
    rgephy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
    re1: Ethernet address: 00:14:c1:32:63:eb
    re1: [FILTER]
    speaker0: <pc speaker="">port 0x61 on acpi0
    atkbdc0: <keyboard controller="" (i8042)="">port 0x60,0x64 irq 1 on acpi0
    atkbd0: <at keyboard="">irq 1 on atkbdc0
    kbd0 at atkbd0
    atkbd0: [GIANT-LOCKED]
    atkbd0: [ITHREAD]
    sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
    sio0: type 16550A
    sio0: [FILTER]
    pmtimer0 on isa0
    orm0: <isa option="" roms="">at iomem 0xc0000-0xcefff,0xcf000-0xd8fff pnpid ORM0000 on isa0
    ppc0: parallel port not found.
    sc0: <system console="">at flags 0x100 on isa0
    sc0: VGA <16 virtual consoles, flags=0x300>
    sio1: configured irq 3 not in bitmap of probed irqs 0
    sio1: port may not be enabled
    vga0: <generic isa="" vga="">at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
    Timecounter "TSC" frequency 3200144208 Hz quality 800
    Timecounters tick every 1.000 msec
    Fast IPsec: Initialized Security Association Processing.
    hptrr: no controller detected.
    ad0: 381554MB <seagate st3400832a="" 3.03="">at ata0-master UDMA100
    acd0: CDRW <lite-on combo="" sohc-4836v="" sg$4="">at ata1-master UDMA33
    Trying to mount root from ufs:/dev/ad0s1a

    ___
    / f \
    / p \
    / Sense
    \/   \
        \
    /

    Welcome to pfSense 1.2.2 on the 'pfSense' platform…

    Mounting filesystems...
    done.
    Creating symlinks...
    .
    .
    .
    done.
    Launching PHP init system...
    done.
    Initializing...
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    .
    done.
    Starting device manager (devd)...
    done.
    Loading configuration...
    .
    .
    .
    done.
    Updating configuration...
    done.
    Cleaning backup cache...
    done.
    Setting up extended sysctls...
    done.
    Syncing user passwords...
    done.
    Starting Secure Shell Services...
    done.
    Setting timezone...
    done.
    Starting syslog...
    done.
    Configuring LAN interface...
    re1: link state changed to UP
    re1: link state changed to DOWN
    done.
    Configuring WAN interface...
    re0: link state changed to DOWN
    re1: link state changed to UP
    re0: link state changed to UP
    re0: link state changed to DOWN
    re0: link state changed to UP
    re0: link state changed to DOWN
    done.
    Configuring OPT interfaces...
    done.
    Configuring CARP interfaces...
    done.
    Starting OpenNTP time client...
    done.
    Configuring firewall...
    .
    pflog0: promiscuous mode enabled
    .
    .
    done.
    Starting webConfigurator...
    done.
    Starting DHCP service...
    done.
    Starting DNS forwarder...
    done.
    Setting up microcode and tx/rx offloading...
    re0: link state changed to UP
    done.
    Starting FTP helpers...
    done.
    Final filter pass...
    .
    .
    .
    done.
    Starting Secure Shell Services...
    done.
    Setting timezone...
    done.
    Starting syslog...
    done.
    Configuring LAN interface...
    re1: link state changed to UP
    re1: link state changed to DOWN
    done.
    Configuring WAN interface...
    re0: link state changed to DOWN
    re1: link state changed to UP
    re0: link state changed to UP
    re0: link state changed to DOWN
    re0: link state changed to UP
    re0: link state changed to DOWN
    done.
    Configuring OPT interfaces...
    done.
    Configuring CARP interfaces...
    done.
    Starting OpenNTP time client...
    done.
    Configuring firewall...
    .
    pflog0: promiscuous mode enabled
    .
    .
    done.
    Starting webConfigurator...
    done.
    Starting DHCP service...
    done.
    Starting DNS forwarder...
    done.
    Setting up microcode and tx/rx offloading...
    re0: link state changed to UP
    done.
    Starting FTP helpers...
    done.
    Final filter pass...
    .
    .
    .
    done.
    Generating RRD graphs...
    done.
    Starting DHCP service...
    done.
    Starting CRON...
    done.
    Syncing packages:
    squid
    squidcache
    squidnac
    squid
    squidtraffic
    squidupstream
    squidauth
    squidusers
    squidcache
    .
    Executing rc.d items...
    Starting /usr/local/etc/rc.d/proxy_monitor.sh...
    done.
    Starting /usr/local/etc/rc.d/squid.sh...
    done.
    Bootup complete
    appending output to nohup.out</lite-on></seagate></generic></system></isa></at></keyboard></pc></rtl8169s></mii></us></rtl8169s></mii></us></acpi></acpi></isa></pci-isa></ata></ata></ati></serial></ata></ata></ati></ata></ata></ati></vga-compatible></acpi></acpi></acpi></acpi></power></cpu></acpi></gatewa></software></version></gatewa></lahf></lm></sse3,rsvd2,mon,ds_cpl,cnxt-id,cx16,xtpr,pdcm></fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,clflush,dts,acpi,mmx,fxsr,sse,sse2,ss,htt,tm,pbe>



  • I just noticed that the VGA and my LAN NIC have the same IRQ 17.  Am I reading this right?  I guess I will try disabling the VGA output.



  • IRQ sharing is normal.  I'd expect that you're looking at hardware problems.  If you haven't already I'd suggest you try some basic diagnostics, at the very least a run through with Memtest86.  I'd also check the logs to see if anything appears that can help.



  • This could also be a thermal problem.
    You could try to clean all the fans and passive heat sinks.


Locked