Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Crashes while upgrading to 24.03 from the last stable

    Problems Installing or Upgrading pfSense Software
    4
    15
    952
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • rayrayrayraydogR
      rayrayrayraydog
      last edited by rayrayrayraydog

      I am a home user still clinging to pfSense PLUS. I just upgraded to 24.03 and my firewall apparently crashed several times while applying the update. When it didn't come back up I started watching the physical console and saw a stack trace fly by before the last reboot when it finally started normally. It logged a crash report as well as saying it had to restore the last known good config backup, which thankfully was from today.

      It DID eventually complete the upgrade after those reboots and appears to be working fine on the new version. Is there anything I need to worry about with this?

      System is running on an ASRock J3355B-ITX on BIOS v1.20. It also has a quad Intel 1GB NIC installed and I'm not using the built-in ethernet, just the ports on the card.

      Crash report is attached.
      crash-24.txt

      Dobby_D 1 Reply Last reply Reply Quote 0
      • Dobby_D
        Dobby_ @rayrayrayraydog
        last edited by

        @rayrayrayraydog

        I would install 2.7.2 CE and upgrade to 23.09.01 and then to the latest
        24.03 Release from today morning (I was doing)

        And then installing the packets you need and then playback your backup
        again.

        Would be in my eyes more fast then all the other "work around" stuff!

        #~. @Dobby

        Turris Omnia - 4 Ports - 2 GB RAM / TurrisOS 7 Release (Btrfs)
        PC Engines APU4D4 - 4 Ports - 4 GB RAM / pfSense CE 2.7.2 Release (ZFS)
        PC Engines APU6B4 - 4 Ports - 4 GB RAM / pfSense+ (Plus) 24.03_1 Release (ZFS)

        1 Reply Last reply Reply Quote 0
        • stephenw10S
          stephenw10 Netgate Administrator
          last edited by

          Hmm, the backtrace there is not very helpful unfortunately:

          db:1:pfs> bt
          Tracing pid 12 tid 100013 td 0xfffff800016e4740
          kdb_enter() at kdb_enter+0x33/frame 0xfffffe0010784da0
          kbdmux_intr() at kbdmux_intr+0x3d/frame 0xfffffe0010784dc0
          taskqueue_run_locked() at taskqueue_run_locked+0x182/frame 0xfffffe0010784e40
          taskqueue_run() at taskqueue_run+0x68/frame 0xfffffe0010784e60
          ithread_loop() at ithread_loop+0x257/frame 0xfffffe0010784ef0
          fork_exit() at fork_exit+0x7f/frame 0xfffffe0010784f30
          fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe0010784f30
          --- trap 0xa5a5a5a5, rip = 0, rsp = 0, rbp = 0xa5a5a5a5a5a5a5a5 ---
          

          But also there is no panic shown. It almost looks like it was manually interrupted:

          <118>Checking Snort Subscriber rules md5 file... done.
          <118>There is a new set of Snort Subscriber rules posted.
          <118>Downloading snortrules-snapshot-29200.tar.gz...
          KDB: enter: manual escape to debugger
          

          Is that possible something was connected to it interrupting it at that point?

          rayrayrayraydogR 1 Reply Last reply Reply Quote 0
          • rayrayrayraydogR
            rayrayrayraydog @stephenw10
            last edited by rayrayrayraydog

            Is that possible something was connected to it interrupting it at that point?

            I think that's possible. It was taking longer than usual to update and restart, as near I could tell. When I started watching the console I was seeing a static screen that was not changing, and I pressed Enter on the keyboard after a couple minutes. Could this have interrupted the upgrade?

            1 Reply Last reply Reply Quote 0
            • stephenw10S
              stephenw10 Netgate Administrator
              last edited by

              It could depending on where it was. But I still wouldn't expect it to reach the debugger like that. 🤔

              rayrayrayraydogR 1 Reply Last reply Reply Quote 0
              • rayrayrayraydogR
                rayrayrayraydog @stephenw10
                last edited by

                @stephenw10 Would a hardware issue be able to cause a problem like this one? I built this server years ago and it is using an ancient OCZ SSD that could be about to fail at any time. I did move all logging to a remote syslog server to preserve it as much as possible.

                1 Reply Last reply Reply Quote 0
                • stephenw10S
                  stephenw10 Netgate Administrator
                  last edited by

                  I would expect a drive failure to be far more obvious then this. It looks more like a hardware interrupt has been triggered somehow.

                  Have you seen any further issues since the upgrade completed?

                  rayrayrayraydogR 1 Reply Last reply Reply Quote 0
                  • rayrayrayraydogR
                    rayrayrayraydog @stephenw10
                    last edited by

                    @stephenw10 I have not. It's been running just fine since yesterday.

                    1 Reply Last reply Reply Quote 0
                    • stephenw10S
                      stephenw10 Netgate Administrator
                      last edited by

                      Hmm, well it's odd but I wouldn't be too concerned since it did complete the upgrade and there was seemingly no panic.
                      If you see anything further we look at any crash reports.

                      M 1 Reply Last reply Reply Quote 0
                      • stephenw10S stephenw10 forked this topic on
                      • M
                        mikebenna @stephenw10
                        last edited by

                        Hi @stephenw10, the problem can't be completely ignored... any time my router reboots, it has a decent chance of freezing on reboot (my guess is it freezes about 2/3rds of the time). Fortunately it's a VM so I don't need physical access, but still, a reliable reboot is pretty core to a reliable system, especially for remote access situations.

                        Mike

                        1 Reply Last reply Reply Quote 0
                        • stephenw10S
                          stephenw10 Netgate Administrator
                          last edited by

                          I agree but what you're seeing is a completely different problem. Which is why I forked it to a new thread.

                          1 Reply Last reply Reply Quote 0
                          • stephenw10S
                            stephenw10 Netgate Administrator
                            last edited by

                            After consulting with our devs here it seems this could in fact be caused by a bad or failing SSD. It's hitting whilst trying to run a checksum on the downloaded Snort ruleset.

                            So I would suggest that SSD has reached the end of it;s useful life!

                            rayrayrayraydogR 1 Reply Last reply Reply Quote 0
                            • rayrayrayraydogR
                              rayrayrayraydog @stephenw10
                              last edited by

                              @stephenw10 I'll have to bring the thing down and swap drives then, thanks!

                              rayrayrayraydogR 1 Reply Last reply Reply Quote 0
                              • rayrayrayraydogR
                                rayrayrayraydog @rayrayrayraydog
                                last edited by

                                @stephenw10 Just swapped the ancient OCZ drive with a much better one today. I just used dd to 1:1 copy data across and it's booting fine, but do I need to care about the "The backup GPT table is not on the end of the device" message? Obviously it's because the new drive is larger than the old one, but will this have any impact on pfSense?

                                1 Reply Last reply Reply Quote 0
                                • stephenw10S
                                  stephenw10 Netgate Administrator
                                  last edited by

                                  No, not unless you have some boot issue that requires the use of the secondary table.

                                  However you can try to use growfs to fill the disk if you wish. Run: touch /root/force_growfs then reboot and it should fill it during the next boot.

                                  1 Reply Last reply Reply Quote 0
                                  • First post
                                    Last post
                                  Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.