Is this the type of issue fixed by bug fix #4445 ?
-
Hi,
I just read through the release notes on 2.2.1 and noticed a resolution for bug #4445
I have posted twice back in December when running 2.2 on Hyper-V:
Am I correct in deducing that the problems I reported are pretty similar to that reported in bug #4445?
It's strange 2.2 has been working fine on one Hyper-v server (1U SuperMicro with Intel Atom C2778). But on another (a Xeon E5-2630L v2) we had the above problem.
Thanks for any comments/insight.
- Richard
-
The symptoms you described there are exactly how bug #4445 exhibits itself. I'm not sure why it's so hit and miss between hosts, most Hyper-V hosts don't seem to hit it, but some repeatedly do. It's definitely fixed in 2.2.1 though.
-
Well, I upgraded to 2.2.1 on the 1u SuperMicro this evening and it seems to have also been bit now.
I can ssh into the firewall - from there I can access the internet or the LAN, but no connectivity from LAN to the outside.
I restored back to my pre-upgrade config, but still cannot connect.
Headed into do a physical restore on the VM now.
-
Here's a bit more info:
-
Once I got onsite I could restore a backup of the VM from the night before. Tested and all was working fine.
-
Since I was up and onsite at 2am I decided to go on and try the 2.2.1 update again. The exact same thing happened.
-
Before I restored again I used sftp to grab as many of the log, config and other files I could think of so we might have a better chance of understanding why this is happening.
I'm including the dmesg.boot file below - the end of which shows the disk I/O error and it appears to be unable to mount, no real surprises here.
What other info might be of use? I've also got the rules.debug files which was mentioned in bug #4445 as the file that get's corrupted by this problem. I don't think I want to publicly upload the whole file to the forum - but could do excerpts if that might help.
It would also be great to know if there's a good way to patch the rules.debug file as it should be when this happens. That is not ideal, but much better than restoring the VM and losing a days' log files and history.
Thank you much - Richard
/var/log/dmesg.boot:
Copyright (c) 1992-2014 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 10.1-RELEASE-p6 #0 b69ba8f(releng/10.1)-dirty: Fri Mar 13 08:37:46 CDT 2015 root@pfs22-amd64-builder:/usr/obj.amd64/usr/pfSensesrc/src/sys/pfSense_SMP.10 amd64 FreeBSD clang version 3.4.1 (tags/RELEASE_34/dot1-final 208032) 20140512 CPU: Intel(R) Atom(TM) CPU C2758 @ 2.40GHz (2400.85-MHz K8-class CPU) Origin = "GenuineIntel" Id = 0x406d8 Family = 0x6 Model = 0x4d Stepping = 8 Features=0x1f83fbff <fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,mmx,fxsr,sse,sse2,ss,htt>Features2=0xc2d82203 <sse3,pclmulqdq,ssse3,cx16,sse4.1,sse4.2,movbe,popcnt,aesni,rdrand,hv>AMD Features=0x20100800 <syscall,nx,lm>AMD Features2=0x101 <lahf,prefetch>Structured Extended Features=0x2200 <erms>real memory = 2147483648 (2048 MB) avail memory = 2041348096 (1946 MB) Event timer "LAPIC" quality 400 ACPI APIC Table: <vrtual microsft="">FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs FreeBSD/SMP: 1 package(s) x 2 core(s) cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 0 ioapic0 <version 1.1="">irqs 0-23 on motherboard wlan: mac acl policy registered random: <software, yarrow="">initialized kbd1 at kbdmux0 vmbus0: <vmbus devices="">on motherboard cryptosoft0: <software crypto="">on motherboard padlock0: No ACE support. acpi0: <vrtual microsft="">on motherboard acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, f7f00000 (3) failed cpu0: <acpi cpu="">on acpi0 cpu1: <acpi cpu="">on acpi0 attimer0: <at timer="">port 0x40-0x43 irq 0 on acpi0 Timecounter "i8254" frequency 1193182 Hz quality 0 Event timer "i8254" frequency 1193182 Hz quality 100 atrtc0: <at realtime="" clock="">port 0x70-0x71 irq 8 on acpi0 Event timer "RTC" frequency 32768 Hz quality 0 Timecounter "ACPI-fast" frequency 3579545 Hz quality 900 acpi_timer0: <32-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 pcib0: <acpi host-pci="" bridge="">port 0xcf8-0xcff on acpi0 pci0: <acpi pci="" bus="">on pcib0 isab0: <pci-isa bridge="">at device 7.0 on pci0 isa0: <isa bus="">on isab0 atapci0: <intel piix4="" udma33="" controller="">port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 7.1 on pci0 ata0: <ata channel="">at channel 0 on atapci0 ata1: <ata channel="">at channel 1 on atapci0 pci0: <bridge>at device 7.3 (no driver attached) vgapci0: <vga-compatible display="">mem 0xf8000000-0xfbffffff irq 11 at device 8.0 on pci0 vgapci0: Boot video device atkbdc0: <keyboard controller="" (i8042)="">port 0x60,0x64 irq 1 on acpi0 atkbd0: <at keyboard="">irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: <ps 2="" mouse="">irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse Explorer, device ID 4 uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 uart0: console (115200,n,8,1) uart1: <16550 or compatible> port 0x2f8-0x2ff irq 3 on acpi0 fdc0: <floppy drive="" controller="" (fde)="">port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fd0: <1440-KB 3.5" drive> on fdc0 drive 0 orm0: <isa option="" rom="">at iomem 0xc0000-0xcbfff on isa0 sc0: <system console="">at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: <generic isa="" vga="">at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ppc0: cannot reserve I/O port range Timecounter "Hyper-V" frequency 10000000 Hz quality 10000000 Timecounters tick every 10.000 msec IPsec: Initialized Security Association Processing. storvsc0 on vmbus0 storvsc1 on vmbus0 hyperv-utils0 on vmbus0 hyperv-utils0: Hyper-V Service attaching: Hyper-V Heartbeat Service hyperv-utils1 on vmbus0 hyperv-utils1: Hyper-V Service attaching: Hyper-V KVP Service hyperv-utils2 on vmbus0 hyperv-utils2: Hyper-V Service attaching: Hyper-V Shutdown Service hyperv-utils3 on vmbus0 hyperv-utils3: Hyper-V Service attaching: Hyper-V Time Synch Service hn0: <synthetic network="" interface="">on vmbus0 random: unblocking device. hn1: <synthetic network="" interface="">on vmbus0 ada0 at ata0 bus 0 scbus0 target 0 lun 0 ada0: <virtual hd="" 1.1.0="">ATA-8 device ada0: 16.700MB/s transfers (WDMA2, PIO 65536bytes) ada0: 8192MB (16777216 512 byte sectors: 16H 63S/T 16383C) ada0: Previously was known as ad0 (ada0:ata0:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00 (ada0:ata0:0:0:0): CAM status: ATA Status Error (ada0:ata0:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 00 () (ada0:ata0:0:0:0): RES: 51 00 00 00 00 00 00 00 00 00 00 (ada0:ata0:0:0:0): Retrying command cd0 at ata0 bus 0 scbus0 target 1 lun 0 cd0: <msft virtual="" cd="" rom="" 1.0="">Removable CD-ROM SCSI-5 device SMP: AP CPU #1 Launched! cd0: 16.700MB/s transfers (WDMA2, ATAPI 12bytes, PIO 65534bytes) cd0: Attempt to query device size failed: NOT READY, Medium not present (ada0:ata0:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00 (ada0:ata0:0:0:0): CAM status: ATA Status Error (ada0:ata0:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 00 () (ada0:ata0:0:0:0): RES: 51 00 00 00 00 00 00 00 00 00 00 (ada0:ata0:0:0:0): Error 5, Retries exhausted da0 at blkvsc0 bus 0 scbus2 target 0 lun 0 da0: <msft virtual="" disk="" 1.0="">Fixed Direct Access SCSI-4 device da0: 300.000MB/s transfers da0: Command Queueing enabled da0: 8192MB (16777216 512 byte sectors: 255H 63S/T 1044C) Trying to mount root from ufs:/dev/ada0s1a [rw]...</msft></msft></virtual></synthetic></synthetic></generic></system></isa></floppy></ps></at></keyboard></vga-compatible></bridge></ata></ata></intel></isa></pci-isa></acpi></acpi></at></at></acpi></acpi></vrtual></software></vmbus></software,></version></vrtual></erms></lahf,prefetch></syscall,nx,lm></sse3,pclmulqdq,ssse3,cx16,sse4.1,sse4.2,movbe,popcnt,aesni,rdrand,hv></fpu,vme,de,pse,tsc,msr,pae,mce,cx8,apic,sep,mtrr,pge,mca,cmov,pat,pse36,mmx,fxsr,sse,sse2,ss,htt>
-
-
Am I seeing this problem because #4445 was not completely fixed in 2.2.1? Or is it because the bug manifests while I am still on 2.2, but in the process of upgrading to 2.2.1?
Anything I can do to better figure out this? Or as or recover from it at all?
I read the bug description that says "results in a rules.debug omitting all config-dependent nat and rdr lines from rules.debug", but I'm not savvy enough to try and fix that on the fly (or not even sure I should try to).
Thank you - Richard
-
If you're still seeing "Dec 5 08:39:49 pfSense kernel: (ada0:ata1:0:1:0): Error 5, Retries exhausted", you're having a different issue of some sort. Any issue you're having on 2.2.1 at this point is completely different from #4445, that's definitely completely fixed in 2.2.1. Those disk error logs could potentially cause similar symptoms if they're still occurring.