• Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login
Netgate Discussion Forum
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login

Kernel Error

Scheduled Pinned Locked Moved General pfSense Questions
7 Posts 4 Posters 923 Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • V
    vcr58
    last edited by vcr58 May 20, 2022, 11:03 AM May 20, 2022, 2:44 AM

    Noticed on the monitor this error message which was also in the system general log...

    May 19 09:00:00	php	12611	[pfBlockerNG] Starting cron process.
    May 19 09:00:26	php	12611	[pfBlockerNG] No changes to Firewall rules, skipping Filter Reload
    May 19 09:06:18	kernel		MCA: Bank 2, Status 0x8400004040080151
    May 19 09:06:18	kernel		MCA: Global Cap 0x0000000000000c07, Status 0x0000000000000000
    May 19 09:06:18	kernel		MCA: Vendor "GenuineIntel", ID 0x706a1, APIC ID 2
    May 19 09:06:18	kernel		MCA: CPU 1 COR (1) ICACHE L1 IRD error
    May 19 09:06:18	kernel		MCA: Address 0x80db1530
    May 19 09:21:00	sshguard	74510	Exiting on signal.
    

    There were no performance issues as far as I know, everything seems normal.

    Is this something I should be concerned about?

    Thanks

    S 1 Reply Last reply May 20, 2022, 12:59 PM Reply Quote 0
    • S
      stephenw10 Netgate Administrator @vcr58
      last edited by May 20, 2022, 12:59 PM

      @vcr58 said in Kernel Error:

      Is this something I should be concerned about?

      Yes. MCA errors like that are almost always a hardware issue.
      Usually I would have expected it to panic and reboot after seeing that but I assume that did not happen?

      Is this the first time you've seen an error like that?

      Have you recently updated?

      Steve

      V 1 Reply Last reply May 20, 2022, 7:36 PM Reply Quote 0
      • V
        vcr58 @stephenw10
        last edited by May 20, 2022, 7:36 PM

        @stephenw10 Correct, there was no reboot after this error. I only noticed it because I had a monitor attached, then I checked the system/general logs.

        This is the first time I saw this error since replacing the SSD that was giving me trouble (see post). Nothing else has changed except I started using pfBlocker a couple days ago.

        I have been running 22.01 released version for a couple months.

        1 Reply Last reply Reply Quote 0
        • R
          revengineer
          last edited by revengineer May 20, 2022, 8:47 PM May 20, 2022, 8:42 PM

          I have seen this error a couple of times over the years on my FreeNAS/TrueNAS server. In each case, it was bad RAM. Run a few cycles of memtest86 to nail this down, then replace the bad module.

          EDIT: It could also be CPU, but I would try memory test first before blaming the CPU. Running CPU stress test with Prime 95 or AIDA64 would be the next step if memory checks out.

          1 Reply Last reply Reply Quote 0
          • S
            stephenw10 Netgate Administrator
            last edited by May 20, 2022, 11:57 PM

            Yeah, it's pretty much always hardware. Sometimes you might start seeing it after an upgrade for example which looks like a software issue but that's usually because some new driver is now hitting the hardware issue.

            Steve

            1 Reply Last reply Reply Quote 0
            • J
              jimp Rebel Alliance Developer Netgate
              last edited by May 23, 2022, 3:12 PM

              $ mcelog --no-dmi --ascii --file mce.log
              mcelog: Family 6 Model 122 CPU: only decoding architectural errors
              mcelog: Family 6 Model 122 CPU: only decoding architectural errors
              Hardware event. This is not a software error.
              CPU 1 BANK 2
              ADDR 80db1530
              MCG status:
              STATUS 8400004040080151 MCGSTATUS 0
              MCGCAP c07 APICID 2 SOCKETID 0
              CPUID Vendor Intel Family 6 Model 122 Step 1
              

              Given that it was an L1 cache error and that's the cache on the CPU, then it's almost certainly a CPU problem and not RAM. Might be overheating as well but that seems less likely.

              If it is a board with a removable CPU you can try re-seating the CPU in the socket. If it has a removable heat sink you could also try removing that, cleaning and redoing the thermal paste/grease/tape/whatever.

              Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

              Need help fast? Netgate Global Support!

              Do not Chat/PM for help!

              V 1 Reply Last reply May 23, 2022, 5:44 PM Reply Quote 2
              • V
                vcr58 @jimp
                last edited by May 23, 2022, 5:44 PM

                @jimp My CPU is soldered on a mini ITX MB but the heat sink may be removable. However I have never seen CPU temp above 40 deg C so I don't think its an issue.

                I read somewhere in these forums that there was a BIOS setting that fixed a users errors. I found a "Turbo Mode" in BIOS that I disabled so maybe that will help. I haven't seen any more errors since my first post.

                1 Reply Last reply Reply Quote 0
                7 out of 7
                • First post
                  7/7
                  Last post
                Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.
                  This community forum collects and processes your personal information.
                  consent.not_received