Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    2x pfsense 24.11 hard crashes in under a week - Netgate 1537

    Scheduled Pinned Locked Moved General pfSense Questions
    11 Posts 5 Posters 228 Views 5 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • GertjanG Offline
      Gertjan @joekislo
      last edited by

      @joekislo said in 2x pfsense 24.11 hard crashes in under a week - Netgate 1537:

      Both units probably at some point OOMED and ran out of swap before the kernel killed bsnmpd to survive

      As long as disk (partition) space isn't an issue, these kind of events can be found in the system log.
      For memory (RAM), as soon as the system is running again, look at Status > Monitoring ans select System => Memory.

      No "help me" PM's please. Use the forum, the community will thank you.
      Edit : and where are the logs ??

      J 2 Replies Last reply Reply Quote 0
      • J Offline
        joekislo @Gertjan
        last edited by

        This post is deleted!
        1 Reply Last reply Reply Quote 0
        • J Offline
          joekislo @Gertjan
          last edited by

          @Gertjan Gertjan No evidence of OOMs before the event, disk space was plenty good as well. FWIW the OOMs were months ago, and I sent all the details to Netgate. Memory has been stable since we started the weekly bnsmpd restarts.

          The dip is the unit hard reset.

          9fe0628d-aaa2-46c0-9ce2-3b6dcf498be8-image.png

          1 Reply Last reply Reply Quote 0
          • stephenw10S Online
            stephenw10 Netgate Administrator
            last edited by

            Did you try entering ctl+t at the console when it wasn't responding? That can sometimes show output when nothing else will.

            Has this only happened once? On each node?

            1 Reply Last reply Reply Quote 0
            • S Offline
              SteveITS Rebel Alliance @joekislo
              last edited by

              @joekislo said in 2x pfsense 24.11 hard crashes in under a week - Netgate 1537:

              Jul 24 00:19:20 fw1 syslogd: exiting on signal 15

              Just to confirm you think this entry is when you hit the Reset button to reboot?

              We have a 4200 that put itself in standby (according to its LED) on its own, and logged that at the time, like it shut itself down. Haven't seen that anywhere/anytime else, though.

              Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
              When upgrading, allow 10-15 minutes to reboot, or more depending on packages, and device or disk speed.
              Upvote ๐Ÿ‘ helpful posts!

              J 1 Reply Last reply Reply Quote 0
              • J Offline
                jcleaves @SteveITS
                last edited by

                @SteveITS Was there anything in the logs regarding going to Standby? I didn't see any power event logged.

                S 1 Reply Last reply Reply Quote 0
                • S Offline
                  SteveITS Rebel Alliance @jcleaves
                  last edited by SteveITS

                  @jcleaves Nothing mentioned standby, shutdown, etc., in fact nothing for the previous hour before the "exiting." However we were running a RAM disk on it which we normally do. But the 4200 has a standby LED pattern. Our occurrence was in early June. TAC had us reinstall it.

                  Edit: I understand our situation may not be relevant here.

                  Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                  When upgrading, allow 10-15 minutes to reboot, or more depending on packages, and device or disk speed.
                  Upvote ๐Ÿ‘ helpful posts!

                  1 Reply Last reply Reply Quote 0
                  • stephenw10S Online
                    stephenw10 Netgate Administrator
                    last edited by stephenw10

                    If you press the ATX power button that's what you would see logged:

                    Jul 28 21:57:35 	php-fpm 	8456 	/index.php: Successful login for user 'admin' from: 172.21.16.8 (Local Database)
                    Jul 29 00:21:11 	syslogd 		exiting on signal 15
                    Jul 29 00:23:07 	syslogd 		kernel boot file is /boot/kernel/kernel
                    Jul 29 00:23:07 	kernel 		---<<BOOT>>---
                    Jul 29 00:23:07 	kernel 		Copyright (c) 1992-2024 The FreeBSD Project.
                    Jul 29 00:23:07 	kernel 		Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 
                    

                    You can disable that by setting the sysctl hw.acpi.power_button_state=none

                    S J 2 Replies Last reply Reply Quote 0
                    • S Offline
                      SteveITS Rebel Alliance @stephenw10
                      last edited by

                      @stephenw10 In our case a button was not pressed, per the one person in the office. It would be nice if it logged a "button push" event.

                      Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                      When upgrading, allow 10-15 minutes to reboot, or more depending on packages, and device or disk speed.
                      Upvote ๐Ÿ‘ helpful posts!

                      1 Reply Last reply Reply Quote 0
                      • J Offline
                        jcleaves @stephenw10
                        last edited by

                        @stephenw10 This was definitely not a button push on ours either. Both units are in locked cabinets in a colo. Any access to the facility is logged.

                        @SteveITS As for it going to standby or hibernating, the person who went on site the LEDs were normal. Nothing indicating a state change or issue.

                        1 Reply Last reply Reply Quote 0
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.