Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Kernel Panic

    Scheduled Pinned Locked Moved 2.0-RC Snapshot Feedback and Problems - RETIRED
    325 Posts 35 Posters 279.3k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • E
      eri--
      last edited by

      Can you describe please your setup?

      EDIT: I put even more safety belts so your carp panic should not happen on new snapshots.

      1 Reply Last reply Reply Quote 0
      • D
        disa
        last edited by

        Hi, I'll open a ticket tomorrow with all the details on my setup. By the way, I have the 2 nodes running 2.0-RC1 (amd64) built on Sun Feb 13 23:53:14 EST 2011 for a few hours without problems. I didn't work on any new carp ips, only on openvpn: the master was synced to the slave many times without problems. I'll try to play more with carp tomorrow and let you know :-)
        thanks

        1 Reply Last reply Reply Quote 0
        • D
          disa
          last edited by

          (Ticket #KZZ-134399 opened)
          I have the cluster in production for 1 day without any problem right now.
          Here is one strange thing I've seen in the logs (on the slave) the other day, what happened?

          Feb 13 22:14:50 kernel: vip7: link state changed to DOWN
          Feb 13 22:14:50 kernel: vip7: MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:50 kernel: vip8: link state changed to DOWN
          Feb 13 22:14:50 kernel: vip9: link state changed to DOWN
          Feb 13 22:14:50 kernel: nt received)
          Feb 13 22:14:50 kernel:
          Feb 13 22:14:50 kernel: e<5m>Ne
          Feb 13 22:14:50 kernel: <d6o>Wis
          Feb 13 22:14:50 kernel: e<5r>to
          Feb 13 22:14:50 kernel: d< 6>tv
          Feb 13 22:14:50 kernel: <5a>dge
          Feb 13 22:14:50 kernel: h<6a>nt
          Feb 13 22:14:50 kernel: q<u5>en c
          Feb 13 22:14:50 kernel: t<a6t>ere
          Feb 13 22:14:50 kernel: e< 5f>s
          Feb 13 22:14:50 kernel: <k6>m or
          Feb 13 22:14:50 kernel: P<5 >(n
          Feb 13 22:14:50 kernel: :< 6l>iU
          Feb 13 22:14:50 kernel: B<a5c>K1
          Feb 13 22:14:50 kernel: i<6p>
          Feb 13 22:14:50 kernel: -<5>>v
          Feb 13 22:14:50 kernel: vip8: MASTER
          Feb 13 22:14:50 kernel: vip10: link state changed to DOWN
          Feb 13 22:14:50 kernel: vip11: link state changed to DOWN
          Feb 13 22:14:50 kernel: vip9: MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:50 kernel: vip1: MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:50 kernel: : MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:50 kernel:
          Feb 13 22:14:50 kernel: p<150>N
          Feb 13 22:14:50 kernel: o <d6o>Wvi
          Feb 13 22:14:50 kernel: vip12: link state changed t
          Feb 13 22:14:50 kernel: vip11: MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:50 kernel: vip12: MASTER -> BACKUP (more frequent advertisement received)
          Feb 13 22:14:49 dhcpd: For info, please visit https://www.isc.org/software/dhcp/
          Feb 13 22:14:49 dhcpd: All rights reserved.
          Feb 13 22:14:49 dhcpd: Copyright 2004-2010 Internet Systems Consortium.
          Feb 13 22:14:49 dhcpd: Internet Systems Consortium DHCP Server 4.1.1-P1</d6o></a5c></k6></a6t></u5></d6o>

          1 Reply Last reply Reply Quote 0
          • jimpJ
            jimp Rebel Alliance Developer Netgate
            last edited by

            The next snap (building now) should have more carp panic fixes.

            Not sure about the crazy kernel message there, though it looks like maybe it was two messages overlapping, but a bit more corrupt than that usually is.

            Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

            Need help fast? Netgate Global Support!

            Do not Chat/PM for help!

            1 Reply Last reply Reply Quote 0
            • L
              LostInIgnorance
              last edited by

              I would like to inform that since its updating on Sunday (when the snapshot server was available again), the Dell P4 with the integrated em gigE, I have not recieved any panics and the machine is running well.  Thank you all for all the hard work fixing the issue!  :D

              1 Reply Last reply Reply Quote 0
              • S
                spacelui
                last edited by

                Hi,

                Like cyber7, I'm getting kernel panics (pf_state_tree_id_RB_REMOVE_COLOR, pfpurge is always the current process) since two weeks (approx. 2 to 3 times a week) on a VMWare setup. Here's a screenshot :

                1 Reply Last reply Reply Quote 0
                • jimpJ
                  jimp Rebel Alliance Developer Netgate
                  last edited by

                  spacelui,

                  amd64 or i386? snapshot date? What type of setup, CARP? Multi-WAN? anything special going on with FTP/PPTP? Any more detail might help. Posting the same panic that someone else had doesn't help much (unless you also have the bt output to go with it) but if we can track down what about your setup might be related to the panics, that would be more helpful. What we need is to find some commonality between the people still getting them.

                  Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                  Need help fast? Netgate Global Support!

                  Do not Chat/PM for help!

                  1 Reply Last reply Reply Quote 0
                  • cyber7C
                    cyber7
                    last edited by

                    Hi Guys.
                    I am still getting the same panic, just this time with a cmpl error.  I am busy updating to the new snapshot, but why is it that after months of running fine, I am starting to see this panic?

                    Kind regards
                    Aubrey Kloppers
                    ps - I am updating to the following snapshot:

                    Auto Update Download Status
                    –--------------------------------------------------
                      Current Version : 2.0-RC1
                      Latest Version  : Tue Feb 15 16:36:07 EST 2011

                    When you pause to think, do you start again?

                    2.2.4-RELEASE (amd64)
                    built on Sat Jul 25 19:57:37 CDT 2015
                    FreeBSD 10.1-RELEASE-p15
                    and
                    pfSense 2.3.2-RELEASE-p1 (amd64 full-install) on pfSense

                    1 Reply Last reply Reply Quote 0
                    • jimpJ
                      jimp Rebel Alliance Developer Netgate
                      last edited by

                      If it's a different error, it's not the same panic. We need the full text of the panic and the backtrace (bt at the db> prompt) in order to say anything.

                      But don't report anything until you've replicated it on the new snapshot you're updating to.

                      Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                      Need help fast? Netgate Global Support!

                      Do not Chat/PM for help!

                      1 Reply Last reply Reply Quote 0
                      • cyber7C
                        cyber7
                        last edited by

                        Thank you Jim.
                        ps - It was nice to chat to you yesterday.  Do you not ever sleep :)

                        Kind regards
                        Aubrey Kloppers

                        When you pause to think, do you start again?

                        2.2.4-RELEASE (amd64)
                        built on Sat Jul 25 19:57:37 CDT 2015
                        FreeBSD 10.1-RELEASE-p15
                        and
                        pfSense 2.3.2-RELEASE-p1 (amd64 full-install) on pfSense

                        1 Reply Last reply Reply Quote 0
                        • S
                          spacelui
                          last edited by

                          Sorry, I cut the details…
                          It's a 2.0-RC1 of yesterday running on an i386.

                          No multiwan, no carp and I have indeed some incoming pptp traffic going to a nated server on the lan. VMWare ESX setup. I do an upgrade every week, it began to happen on the 28th of january. I tried to look in the repo to see commits between 21st and 28th, to see if something relevant has changed, but a lot of things happened... plus, I'm not sure that couldn't happen before.

                          Next time it happens, I'll post the backtrace.

                          Edit : I reverted to the last backup I had before upgrading on the 28th, it's a  Fri Jan 7 15:25:33 EST 2011 snapshot…

                          1 Reply Last reply Reply Quote 0
                          • cyber7C
                            cyber7
                            last edited by

                            Hi Guys

                            look at my panic: http://forum.pfsense.org/index.php/topic,33403.0.html
                            and see if this does not solve your problems

                            Kind regards
                            Aubrey

                            When you pause to think, do you start again?

                            2.2.4-RELEASE (amd64)
                            built on Sat Jul 25 19:57:37 CDT 2015
                            FreeBSD 10.1-RELEASE-p15
                            and
                            pfSense 2.3.2-RELEASE-p1 (amd64 full-install) on pfSense

                            1 Reply Last reply Reply Quote 0
                            • A
                              acherman
                              last edited by

                              Hey Aubrey, changing my mtu's didn't fix anything.  In fact it caused a panic on both boxes when trying to apply the changes.  I don't think I will try that again.

                              1 Reply Last reply Reply Quote 0
                              • jimpJ
                                jimp Rebel Alliance Developer Netgate
                                last edited by

                                @acherman:

                                Hey Aubrey, changing my mtu's didn't fix anything.  In fact it caused a panic on both boxes when trying to apply the changes.  I don't think I will try that again.

                                Did you get the panic/crash info when it did that? Also, are you on i386 or amd64? What snapshot date?

                                Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                                Need help fast? Netgate Global Support!

                                Do not Chat/PM for help!

                                1 Reply Last reply Reply Quote 0
                                • A
                                  acherman
                                  last edited by

                                  No, I didn't.  Sorry.  But I'm sure it will happen again soon and I have the camera ready.  I am running i386 and build Mon Jan 31 07:16:37 EST 2011 right now (saw the same issues with Mon Feb 14 02:12:45 EST 2011 and I can't remember which one from the 15th).

                                  1 Reply Last reply Reply Quote 0
                                  • jimpJ
                                    jimp Rebel Alliance Developer Netgate
                                    last edited by

                                    Update to the most current snap ASAP and then test again. Testing on old snaps isn't likely to provide any useful feedback for this. There were patches after those dates to help prevent panics in other situations.

                                    Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

                                    Need help fast? Netgate Global Support!

                                    Do not Chat/PM for help!

                                    1 Reply Last reply Reply Quote 0
                                    • A
                                      acherman
                                      last edited by

                                      I was able to make my CARP slave fail again (seems reproducable) - I was changing the mtu's back to default, if making one change and applying it things seemed fine, when I changed a few of them and applied them but applied changes after a few (maybe 5) then it hung.

                                      I will update to the latest snap and see if I can make it fail again…

                                      1 Reply Last reply Reply Quote 0
                                      • A
                                        acherman
                                        last edited by

                                        I updated my CARP backup to the latest snap - Thu Feb 17 02:14:25 EST 2011 - and ran through similar motions that caused the panics before with no issues.  I have updated my master as well and will update if things fail again.

                                        1 Reply Last reply Reply Quote 0
                                        • E
                                          eri--
                                          last edited by

                                          Can any of you with the pf_state_tree_id_RB_REMOVE_COLOR panic
                                          try and set debug.pfpptpproxy=1 and see if they get the panic?

                                          1 Reply Last reply Reply Quote 0
                                          • A
                                            acherman
                                            last edited by

                                            Sure, let me now how to do it (sorry) and if I get the panic running today's snapshot that can be my next step.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.