Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Kernel Panic

    Scheduled Pinned Locked Moved 2.0-RC Snapshot Feedback and Problems - RETIRED
    325 Posts 35 Posters 248.1k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • E
      eri--
      last edited by

      Confirmed fix since i found a way to reproduce it.
      Snapshots of tomorrow should be ok.

      1 Reply Last reply Reply Quote 0
      • jnorellJ
        jnorell
        last edited by

        A quick note, I updated my carp slave to Feb 2 04:04:51 EST 2011 with some of the recent kernel panic fixes, but I'm still able to panic it by adding a new VIP addr.  (I deleted a VIP addr, no problem; then re-added that addr, no problem; then added a new VIP addr, and it panic'd in devd again.)

        1 Reply Last reply Reply Quote 0
        • F
          fasteddy
          last edited by

          I looked through a similar topic (http://forum.pfsense.org/index.php/topic,31721.0.html) but wasn't entirely convinced I'm having the same issue, so a new topic seemed to be in order…

          I have had an IPSec  site-to-site VPN connection running trouble-free for months now using two pfSense PCs running 2.0 BETA i386 snapshots from May of 2010.  After updating to a January 27th snapshot (and snapshots from every day since, including today) I'm having problems with one of the PCs locking up.  Without fail, if I attempt to log on to the web configuration page of the remote pfSense box through the VPN connection, it will serve up the 'login' page, but crashes as soon as I hit the 'login' button.  The error message on the pfSense box's monitor says:

          
          Fatal trap 12: page fault while in kernel mode
          cpuid = 0; apic id = 00
          fault virtual address   = 0xd79a2720
          fault code              = supervisor read, page not present
          instruction pointer     = 0x20:0xd79a2720
          stack pointer           = 0x28:0xc5b26bbc
          frame pointer           = 0x28:0xc5b26bc8
          code segment            = base 0x0, limit 0xfffff, type 0x1b
                                  = DPL 0, pres 1, def32 1, gran 1
          processor eflags        = interrupt enabled, resume, IOPL = 0
          current process         = 0 (em3 taskq)
          
          

          I've attached a picture of the backtrace information as well.

          If I do a remote desktop connection to a PC on the remote side of the VPN, I can access the web configuration through that PC's browser with no issues.  The problem only occurs if I try to access the web configuration of the remote pfSense box (or ssh console, for that matter) directly from a machine on the local side of the VPN.  I've been doing this with no problems until I updated on the 27th.

          For what it's worth, this only happens on one of my two machines - if I'm on the 'sick' pfSense box's network I can access the 'healthy' pfSense box's web configuration directly through the VPN with no issues.  The two PCs are substantially different, but are using the same network cards - Intel Pro/1000MT gigabit NICs.  The 'sick' pfSense box also has an on-board Marvell Yukon Gigabit interface (skc0).

          The output of 'pciconf -lvp' is:

          
          skc0@pci0:0:10:0:       class=0x020000 card=0x811a1043 chip=0x432011ab rev=0x13 hdr=0x00
              class      = network
              subclass   = ethernet
              bar   [10] = type Memory, range 32, base 0xfa000000, size 16384, enabled
              bar   [14] = type I/O Port, range 32, base 0xa000, size 256, enabled
          em0@pci0:0:11:0:        class=0x020000 card=0x13768086 chip=0x107c8086 rev=0x05 hdr=0x00
              class      = network
              subclass   = ethernet
              bar   [10] = type Memory, range 32, base 0xfa300000, size 131072, enabled
              bar   [14] = type Memory, range 32, base 0xfa200000, size 131072, enabled
              bar   [18] = type I/O Port, range 32, base 0xa400, size 64, enabled
          em1@pci0:0:12:0:        class=0x020000 card=0x13768086 chip=0x107c8086 rev=0x05 hdr=0x00
              class      = network
              subclass   = ethernet
              bar   [10] = type Memory, range 32, base 0xfa600000, size 131072, enabled
              bar   [14] = type Memory, range 32, base 0xfa500000, size 131072, enabled
              bar   [18] = type I/O Port, range 32, base 0xa800, size 64, enabled
          em2@pci0:0:13:0:        class=0x020000 card=0x13768086 chip=0x107c8086 rev=0x05 hdr=0x00
              class      = network
              subclass   = ethernet
              bar   [10] = type Memory, range 32, base 0xfa900000, size 131072, enabled
              bar   [14] = type Memory, range 32, base 0xfa800000, size 131072, enabled
              bar   [18] = type I/O Port, range 32, base 0xb000, size 64, enabled
          em3@pci0:0:14:0:        class=0x020000 card=0x13768086 chip=0x107c8086 rev=0x05 hdr=0x00
              class      = network
              subclass   = ethernet
              bar   [10] = type Memory, range 32, base 0xfac00000, size 131072, enabled
              bar   [14] = type Memory, range 32, base 0xfab00000, size 131072, enabled
              bar   [18] = type I/O Port, range 32, base 0xb400, size 64, enabled
          
          

          Any chance someone could shed some light on this issue?  I can reliably reproduce the issue if need be, and would be happy to provide any additional information that may be required.

          Thank you, and best regards!

          fasteddy_backtrace.jpg
          fasteddy_backtrace.jpg_thumb

          1 Reply Last reply Reply Quote 0
          • jimpJ
            jimp Rebel Alliance Developer Netgate
            last edited by

            The snap from 2/3 would be the first to have the carp panic fixes.

            There is still an issue (when deleting a VIP) but the panic you're seeing has likely been fixed in the most recent snap that is up now (Thu Feb 3 00:55:19 EST 2011)

            Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

            Need help fast? Netgate Global Support!

            Do not Chat/PM for help!

            1 Reply Last reply Reply Quote 0
            • F
              FisherKing
              last edited by

              Hi Jimp - My read of the build logs suggested that Ermal's patch didn't get included in last nights build even though the build was started after the commit was made.

              Thu Feb  3 04:21:58 EST 2011 -|- >>> Sleeping for 86400 in between snapshot builder runs.  Last known commit 847e5e8257b58906a0d12ce48275cae7162aab47

              That commit listed there shows up in redmine as being a couple before ermals commit.  Am I reading that wrong?

              1 Reply Last reply Reply Quote 0
              • jimpJ
                jimp Rebel Alliance Developer Netgate
                last edited by

                I started the builder by hand after his commit.
                The patches are committed in the tools repo - the tools repo isn't tracked by the builder in the function you pasted.

                Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                Need help fast? Netgate Global Support!

                Do not Chat/PM for help!

                1 Reply Last reply Reply Quote 0
                • F
                  FisherKing
                  last edited by

                  Hmmm - Just panicked, running 2.0-BETA5 (i386) built on Thu Feb 3 00:55:19 EST 2011 on both master & slave.

                  Updated a rule on the master, slave panicked.

                  I've attached images of the panic & bt.

                  ![](http://Carp panic.png)
                  ![](http://Carp bt.png)

                  ![Carp panic.png](/public/imported_attachments/1/Carp panic.png)
                  ![Carp panic.png_thumb](/public/imported_attachments/1/Carp panic.png_thumb)
                  ![Carp bt.png](/public/imported_attachments/1/Carp bt.png)
                  ![Carp bt.png_thumb](/public/imported_attachments/1/Carp bt.png_thumb)

                  1 Reply Last reply Reply Quote 0
                  • E
                    eri--
                    last edited by

                    The issue is fixed. Just the patch i committed had some typo from copy/pasto.

                    1 Reply Last reply Reply Quote 0
                    • E
                      eri--
                      last edited by

                      You should grab the next snapshot that will come out.
                      AFAIK it has no more such issues.

                      1 Reply Last reply Reply Quote 0
                      • jimpJ
                        jimp Rebel Alliance Developer Netgate
                        last edited by

                        I just started a new build after ermal's commit - the next new snapshot should include this fix.

                        Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                        Need help fast? Netgate Global Support!

                        Do not Chat/PM for help!

                        1 Reply Last reply Reply Quote 0
                        • F
                          fasteddy
                          last edited by

                          Thanks Ermal!  I'll give the next snapshot a try when it is available and report back.

                          1 Reply Last reply Reply Quote 0
                          • F
                            FisherKing
                            last edited by

                            Running 2.0-BETA5 (i386) built on Thu Feb 3 18:55:08 EST 2011 on both master & slave.

                            Slave panicked as soon as I updated a rule on the master.

                            Attached images of both panic & bt.

                            ![](http://Carp Panic.png)

                            ![](http://carp bt.png)

                            ![Carp Panic.png](/public/imported_attachments/1/Carp Panic.png)
                            ![Carp Panic.png_thumb](/public/imported_attachments/1/Carp Panic.png_thumb)
                            ![carp bt.png](/public/imported_attachments/1/carp bt.png)
                            ![carp bt.png_thumb](/public/imported_attachments/1/carp bt.png_thumb)

                            1 Reply Last reply Reply Quote 0
                            • E
                              eri--
                              last edited by

                              Wrong snapshot sorry.
                              This is the commit that fixes the error https://rcs.pfsense.org/projects/pfsense-tools/repos/mainline/commits/08f1322c7d5d9fae8ef52dc356c75a59d2483263

                              1 Reply Last reply Reply Quote 0
                              • F
                                FisherKing
                                last edited by

                                Running 2.0-BETA5 (i386) built on Fri Feb 4 02:36:03 EST 2011 on both Master & Slave.

                                It ran longer this time before the panic.  I changed my configuration a little before it happened this time.

                                Steps:
                                1 - Pulled the plug on the master WAN
                                  (Slave became the new CARP master for all carp VIPs)
                                1a - Verified LAN clients were able to surf the web.
                                2 - Cleared XMLRPC Sync settings from the old master
                                3 - Set XMLRPC Sync settings on the new master (old slave)
                                4 - Changed DHCP settings on new master (old slave)
                                5 - Old master had a panic.

                                ![](http://Carp Panic1.png)

                                ![](http://carp bt1.png)

                                ![Carp Panic1.png](/public/imported_attachments/1/Carp Panic1.png)
                                ![Carp Panic1.png_thumb](/public/imported_attachments/1/Carp Panic1.png_thumb)
                                ![carp bt1.png](/public/imported_attachments/1/carp bt1.png)
                                ![carp bt1.png_thumb](/public/imported_attachments/1/carp bt1.png_thumb)

                                1 Reply Last reply Reply Quote 0
                                • jimpJ
                                  jimp Rebel Alliance Developer Netgate
                                  last edited by

                                  @PJ2:

                                  2 - Cleared XMLRPC Sync settings from the old master
                                  3 - Set XMLRPC Sync settings on the new master (old slave)

                                  Why on earth would you do that? That isn't necessary at all, and could lead to other issues.

                                  Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                                  Need help fast? Netgate Global Support!

                                  Do not Chat/PM for help!

                                  1 Reply Last reply Reply Quote 0
                                  • V
                                    vito
                                    last edited by

                                    Jimp/Ermal
                                    just an update. I installed an updated firmware on another system about 5 days ago.
                                    All seem good!
                                    Thanks for all your help!!!!

                                    vito

                                    1 Reply Last reply Reply Quote 0
                                    • F
                                      FisherKing
                                      last edited by

                                      As I was typing I wondered if that might be the response I would get. It's always amazing what crazy and unexpected things an end user can come up with. :)

                                      This is a test environment, so I test things.
                                      Comp1 = initial CARP master, XMLRPC master
                                      Comp2 = initial CARP slave, XMLRPC slave

                                      I wanted to know what would happen if Comp1 had an issue and Comp2 took over.  Could I also make Comp2 become the XMLRPC master since it was acting as the CARP master?  I did clear the XMLRPC settings on Comp1 first.  I wouldn't have thought that reversing the roles would cause a panic.

                                      1 Reply Last reply Reply Quote 0
                                      • jimpJ
                                        jimp Rebel Alliance Developer Netgate
                                        last edited by

                                        When a master syncs the config to a slave it makes several alterations in the process.

                                        Just flipping a slave to a master in the XMLRPC settings it going to cause you problems because of this.

                                        If you really want to swap their roles, restore the master's config to the slave box and vice versa.

                                        When a CARP slave takes over automatically - if you have configured everything right - it can function that way indefinitely until you bring the master back up, you just can't make any changes to the slave's config you want to keep. But that is all a topic for another thread. It's not the source of the panic - it's just a Bad Thing™.

                                        Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                                        Need help fast? Netgate Global Support!

                                        Do not Chat/PM for help!

                                        1 Reply Last reply Reply Quote 0
                                        • F
                                          FisherKing
                                          last edited by

                                          Good to know - I'll avoid doing that in the future.

                                          1 Reply Last reply Reply Quote 0
                                          • P
                                            pwnell
                                            last edited by

                                            Ok so I waited until I thought this snapshot would be okaish… Within 10 minutes of installing this my machine crashed:

                                            2.0-BETA5 (i386)
                                            built on Fri Feb 4 15:47:28 EST 2011

                                            Pentium(R) Dual-Core CPU E5400 @ 2.70GHz

                                            [2.0-BETA5][root@fw.home]/root(1): uname -a
                                            FreeBSD fw.home 8.1-RELEASE-p2 FreeBSD 8.1-RELEASE-p2 #1: Fri Feb  4 15:45:20 EST 2011    sullrich@FreeBSD_8.0_pfSense_2.0-snaps.pfsense.org:/usr/obj.pfSense/usr/pfSensesrc/src/sys/pfSense_SMP.8  i386

                                            –---------

                                            Did not do anything special on the firewall when it died.

                                            IMG_0233.JPG
                                            IMG_0233.JPG_thumb

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.