Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Kernel panic 4-5 Nov (i386)

    Scheduled Pinned Locked Moved 2.0-RC Snapshot Feedback and Problems - RETIRED
    89 Posts 17 Posters 36.8k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • M
      Michael Sh.
      last edited by

      Hi,

      pfSense-Full-Update-2.0-BETA4-20101104-1041.tgz
      pfSense-Full-Update-2.0-BETA4-20101104-2246.tgz
      Kernel panic in interrupt routine after some level of network load (bfe0,ath0).
      Rollback to 30 Oct and all OK. Noting about 1-3 Nov.

      1 Reply Last reply Reply Quote 0
      • jimpJ
        jimp Rebel Alliance Developer Netgate
        last edited by

        Need a lot more info than that. Any way you can get a capture of the kernel panic message at least? Or possibly switch to a developer kernel and get a "bt" output when the panic happens too?

        Or some idea of the traffic level involved?

        There were some performance patches added on Nov 3 that may be involved, but a lot more detail would be needed to track down how/why.

        Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

        Need help fast? Netgate Global Support!

        Do not Chat/PM for help!

        1 Reply Last reply Reply Quote 0
        • P
          pakjebakmeel
          last edited by

          Just to confirm, I have today's nanoBSD 4GB snapshot running on my ALIX board and it seems to reboot when navigating the WebGUI. It rebooted out of the blue when loading the page to add a new firewall rule but also when loading the interface statistics page it rebooted a couple of times. Haven't seen that before but it has now done it about 10 to 15 times in a couple of days.

          I can reproduce the error by invoking the auto updater, when it gets to downloading the file approx 3% the interface freezes and the box reboots. Upgrading from a URL through SSH is fine.

          I would be happy to provide logfiles when it happens again but which one would you need? I checked the system log but couldn't find anything interesting. ย ???

          When I leave the box alone it doesn't seem to reboot regardless of the throughput..

          1 Reply Last reply Reply Quote 0
          • jimpJ
            jimp Rebel Alliance Developer Netgate
            last edited by

            Keep the serial console connected and get the output from around the time it resets. Before the reboot messages show up. It may be a panic message, or something else.

            You should be able to copy/paste it out of the serial terminal.

            Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

            Need help fast? Netgate Global Support!

            Do not Chat/PM for help!

            1 Reply Last reply Reply Quote 0
            • P
              pakjebakmeel
              last edited by

              @jimp:

              Keep the serial console connected and get the output from around the time it resets. Before the reboot messages show up. It may be a panic message, or something else.

              You should be able to copy/paste it out of the serial terminal.

              Ok cool, I will do that now. I just updated my previous post that I can reproduce the issue.

              1 Reply Last reply Reply Quote 0
              • M
                m4rcu5
                last edited by

                Unfortunately i can only confirm the kernel panic (page fault). It happens when booting, just after the interface configuration.
                I am reinstalling a this moment to see if i can get the system working again.

                -m4rcu5

                1 Reply Last reply Reply Quote 0
                • jimpJ
                  jimp Rebel Alliance Developer Netgate
                  last edited by

                  @m4rcu5:

                  Unfortunately i can only confirm the kernel panic (page fault). It happens when booting, just after the interface configuration.
                  I am reinstalling a this moment to see if i can get the system working again.

                  -m4rcu5

                  That sounds more like the amd64 problem, this is on i386.

                  Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

                  Need help fast? Netgate Global Support!

                  Do not Chat/PM for help!

                  1 Reply Last reply Reply Quote 0
                  • P
                    pakjebakmeel
                    last edited by

                    Ok, got it.. ย Couldn't invoke the auto updater as there is no newer snapshot.. But after randomly clicking around in the webinterface for a minute it went down when halfway through on loading "Diagnostics โ€“> ARP Table"

                    
                    Fatal trap 12: page fault while in kernel mode
                    cpuid = 0; apic id = 00
                    
                    fault virtual address	= 0x10317
                    fault code			= supervisor read, page not present
                    instruction pointer	= 0x20:0xc095f46b
                    stack pointer		= 0x28:0xe2e21bc4
                    frame pointer		= 0x28:0xe2e21bc8
                    code segment		= base 0x0, limit 0xfffff, type 0x1b
                    				= DPL 0\. pres 1, def32 1, gran 1
                    processor eflags	= interrupt enables, resume, IOPL = 0
                    current process	= 0 (ath0 taskq)
                    trap number		= 12
                    
                    panic: page fault
                    cpuid = 0
                    
                    Cannot dump. Device not defined or unavailable.
                    
                    Automatic reboot in 15 seconds - press a key on the console to abort
                    Rebooting...
                    
                    
                    1 Reply Last reply Reply Quote 0
                    • M
                      m4rcu5
                      last edited by

                      @jimp:

                      @m4rcu5:

                      Unfortunately i can only confirm the kernel panic (page fault). It happens when booting, just after the interface configuration.
                      I am reinstalling a this moment to see if i can get the system working again.

                      -m4rcu5

                      That sounds more like the amd64 problem, this is on i386.

                      jimp, i use the i386 image on a Intel core2duo.

                      I have the exact same fault code on the same process (only on interface em0) as pakjebakmeel. This time it happend when booted from the live cd.

                      -m4rcu5

                      1 Reply Last reply Reply Quote 0
                      • P
                        pakjebakmeel
                        last edited by

                        For me it only happens when navigating the webGUI.. If I leave the GUI alone and generate massive traffic the box is rock solid.

                        1 Reply Last reply Reply Quote 0
                        • M
                          Michael Sh.
                          last edited by

                          Hi jimp,
                          Sorry for screenshots, but I use real hardware ASUS Pundit and have not serial console. Before my first post I wait panic maximum 5 minutes. Now on lasts snapshot more long, but I know differences between snappshots absent.
                          This panic was appear immediately after fetch command on pfSense host.

                          pf_panic.jpg
                          pf_panic.jpg_thumb
                          systat_vm.PNG
                          systat_vm.PNG_thumb
                          wan.svg.txt
                          lan.svg.txt

                          1 Reply Last reply Reply Quote 0
                          • P
                            pakjebakmeel
                            last edited by

                            Hi all,

                            Is this bug still present or can it be fixed by upgrading to a newer snapshot?ย  8)

                            Thanks!

                            1 Reply Last reply Reply Quote 0
                            • jimpJ
                              jimp Rebel Alliance Developer Netgate
                              last edited by

                              It's still there. I just restarted the builder again to see if the fixes checked in yesterday made a difference. It should be done in a while but I'd still wait for an all-clear.

                              Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

                              Need help fast? Netgate Global Support!

                              Do not Chat/PM for help!

                              1 Reply Last reply Reply Quote 0
                              • jimpJ
                                jimp Rebel Alliance Developer Netgate
                                last edited by

                                Current snap is OK:

                                2.0-BETA4 (amd64)
                                built on Tue Nov 9 17:26:01 UTC 2010

                                Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

                                Need help fast? Netgate Global Support!

                                Do not Chat/PM for help!

                                1 Reply Last reply Reply Quote 0
                                • M
                                  Michael Sh.
                                  last edited by

                                  pfSense-Full-Update-2.0-BETA4-20101109-1641.tgz

                                  panic.jpg
                                  panic.jpg_thumb

                                  1 Reply Last reply Reply Quote 0
                                  • jimpJ
                                    jimp Rebel Alliance Developer Netgate
                                    last edited by

                                    Ah, well I was hoping it may be a similar issue to amd64. Looks like it may be different.

                                    Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

                                    Need help fast? Netgate Global Support!

                                    Do not Chat/PM for help!

                                    1 Reply Last reply Reply Quote 0
                                    • M
                                      Michael Sh.
                                      last edited by

                                      pfSense-Full-Update-2.0-BETA4-20101110-0504.tgz
                                      Same balls, but side-view. It seems to me traffic generating by router is reason of crash. Without this traffic router working more long time, but attempt to fetch any by router always cause panic.

                                      1 Reply Last reply Reply Quote 0
                                      • jimpJ
                                        jimp Rebel Alliance Developer Netgate
                                        last edited by

                                        I've been sitting here furiously loading GUI pages on my poor little ALIX running a snapshot from today and though it's gotten slow at times, I have yet to see a panic.

                                        Is there anything else people in this thread might have in common? What kind of setups do you all have? Can you give a general idea of things that are in use? (Multi-wan, IPsec, OpenVPN, PPPoE, 3G, wireless, etc)

                                        Remember: Upvote with the ๐Ÿ‘ button for any user/post you find to be helpful, informative, or deserving of recognition!

                                        Need help fast? Netgate Global Support!

                                        Do not Chat/PM for help!

                                        1 Reply Last reply Reply Quote 0
                                        • F
                                          FisherKing
                                          last edited by

                                          Not to cause you more frustration Jimp - but I'm seeing this GUI / Kernel Panic also.

                                          Running 2010-11-09 (i386) on a PIII.
                                          Reset to factory defaults yesterday & did a basic install
                                          I've got 2 dual port intel nics.
                                          fxp0 = WAN PPPoE
                                          fxp0 = Opt3 DHCP private (10.x.x.x)
                                          fxp1 = Opt1 DMZ static public
                                          fxp2 = LAN static private (172.x.x.x.)
                                          fxp3 = Opt2 static private (172.y.y.y)

                                          I'm running DHCP server on Lan and Opt2.
                                          I'm logging in under a 2nd administrator account.
                                          I'm running Captive Portal on Opt2 w/ local auth
                                          Firewall allows web, mail, dns traffic through to the public IPs on the DMZ
                                          I have freeswitch installed on the box, but it isn't the pfSense package.ย  pfSense doesn't know it's there.

                                          How does this match up with what the rest of you have?

                                          1 Reply Last reply Reply Quote 0
                                          • F
                                            FisherKing
                                            last edited by

                                            I've noticed the following on the kernel panic screen.

                                            Cannot dump. Device not defined or unavailable.

                                            Would the dump be helpful in diagnosing this?ย  Is there a place where we can find directions on how to give it a dump device?ย  Is it possible to dump to the local disk, or to a USB memory stick?

                                            I do have a null modem cable & could dump to the serial port / windows terminal app if that's how it's done, but I'd need directions for that also.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.