Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Suricata process dying due to hyperscan problem

    Scheduled Pinned Locked Moved IDS/IPS
    295 Posts 25 Posters 86.6k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • NogBadTheBadN
      NogBadTheBad @bmeeks
      last edited by NogBadTheBad

      @bmeeks said in Suricata process dying due to hyperscan problem:

      For other users experiencing the Hyperscan crash in Suricata --

      1. Do you have one or more VLANs configured on the interface that crashes?

      Yes on my LAN interface
      PPPOE on my WAN interface

      1. Does disabling blocking mode on the crashing interface result in a difference in behavior?

      Yes for no blocking, ok

      Set to AC and blocking ok

      Set to Auto and Core dumps

      If you have fiddled with the MPM Algorithm setting on the INTERFACE SETTINGS tab, be sure to return it to Auto and save the change before testing.

      Currently set not to block so I can rebaseline my suppression list.

      Andy

      1 x Netgate SG-4860 - 3 x Linksys LGS308P - 1 x Aruba InstantOn AP22

      1 Reply Last reply Reply Quote 0
      • M
        masons
        last edited by

        @bmeeks , I think I may have found a reliable way to reproduce the issue.

        Environment

        Two separate VMs.

        VM1

        • pfSense CE 2.7.0
        • 4 vCPUs on KVM
        • AES-NI CPU Crypto: Yes
        • Suricata 7.0.2_1
        • LAN interface has multiple VLANs, but Suricata is only running on one of the VLAN interfaces (PC)
        • WAN interface is running Suricata

        VM2

        • pfSense CE 2.7.1
        • 4 vCPUs on KVM
        • AES-NI CPU Crypto: Yes
        • Suricata 7.0.2_1
        • LAN interface has multiple VLANs, but Suricata is only running on one of the VLAN interfaces (PC)
        • WAN interface is running Suricata

        How to reproduce the issue

        • Start the Suricata service
        • Check the Suricata interfaces
          • WAN will be running
          • PC will not be running
        • suricata.log for the PC Suricata instance does not show the Hyperscan log error
        • System log shows
        pid 1810 (suricata), jid 0, uid 0: exited on signal 11 (core dumped)
        

        How to get the PC instance running

        • Stop the Suricata service
        • Go to Diagnostic --> Command Prompt and enter
        elfctl -e +noaslr /usr/local/bin/suricata
        
        • Start the Suricata service
        • Check the Suricata interfaces, both WAN and PC will be running
        • suricata.log for the PC Suricata instance does not show the Hyperscan log error
        • System log shows no errors

        I can cycle back and forth between +noaslr and -noaslr and the behaviour is completely repeatable. I've had one VM running with +noaslr for one day and both Suricata instances have remained up the whole time.

        I know that I previously reported that the fix didn't appear to solve the problem, but it's worth noting that in my previous report I was seeing the Hyperscan log entry and it was the WAN interface that failed, not the PC interface. In this post I'm not seeing the Hyperscan log entry and Suricata instances have remained running for much longer.

        1 Reply Last reply Reply Quote 0
        • bmeeksB
          bmeeks
          last edited by

          @masons, @tylerevers, @NogBadTheBad. @jowe78:
          Thank you all for the extra information. I will continue to dig into this.

          1 Reply Last reply Reply Quote 1
          • A
            asdjklfjkdslfdsaklj @bmeeks
            last edited by

            @bmeeks said in Suricata process dying due to hyperscan problem:

            @asdjklfjkdslfdsaklj said in Suricata process dying due to hyperscan problem:

            @bmeeks anecdotally, yes. ~8hrs after disabling blocking mode and both LAN PHY Suricata instances are still up.

            1. I need to know if you have any VLAN configured on either LAN interface.

            2. Try enabling Blocking Mode on just one of the LAN interfaces and see what happens then.

            To help me troubleshoot this, I desparately need you folks having the issue to give me some explicit details when responding. For example, answer question #1 above and also try troubleshooting suggestion #2 above. Then follow up back here with detailed results for each.

            I will repeat again for clarity: I am trying to determine if VLANs configured on the crashing interface are related or not. So, tell me if you have VLANs on the interface, and if you do, how many. Then tell me if you can relate the crash to blocking enabled or not.

            1. No VLANs.

            2. Enabled blocking mode on LAN 1, disabled bon LAN 2.

            Both ran for a few hours, and eventually LAN 1 died (same hyperscan error), while LAN 2 remains up.

            bmeeksB 1 Reply Last reply Reply Quote 0
            • tylereversT
              tylerevers @bmeeks
              last edited by

              @bmeeks

              Reconfirm Hyperscan Still Crashes

              Block Offenders = On
              Signature Group Header MPM Context = Auto
              Pattern Matcher Algorithm = Auto

              Interface failed with error:

              [101378 - W#07] 2023-11-29 12:54:32 Error: spm-hs: Hyperscan returned fatal error -1.
              

              Test with Block Offenders Off
              Block Offenders = Off
              Signature Group Header MPM Context = Auto
              Pattern Matcher Algorithm = Auto

              It has been three hours without a crash.

              bmeeksB 1 Reply Last reply Reply Quote 0
              • bmeeksB
                bmeeks @asdjklfjkdslfdsaklj
                last edited by

                @asdjklfjkdslfdsaklj said in Suricata process dying due to hyperscan problem:

                Both ran for a few hours, and eventually LAN 1 died (same hyperscan error), while LAN 2 remains up.

                Okay, now swap the blocking mode around. Disable blocking on LAN 1 and Enable blocking on LAN 2. Let's see if the hyperscan error moves over to LAN 2 and it now crashes while LAN 1 remains stable.

                If the problem does not move to LAN 1, then that would tend to take blocking mode out of the picture unless it takes that in combination with something else to trigger the hyperscan crash.

                A 1 Reply Last reply Reply Quote 0
                • bmeeksB
                  bmeeks @tylerevers
                  last edited by

                  @tylerevers said in Suricata process dying due to hyperscan problem:

                  @bmeeks

                  Reconfirm Hyperscan Still Crashes

                  Block Offenders = On
                  Signature Group Header MPM Context = Auto
                  Pattern Matcher Algorithm = Auto

                  Interface failed with error:

                  [101378 - W#07] 2023-11-29 12:54:32 Error: spm-hs: Hyperscan returned fatal error -1.
                  

                  Test with Block Offenders Off
                  Block Offenders = Off
                  Signature Group Header MPM Context = Auto
                  Pattern Matcher Algorithm = Auto

                  It has been three hours without a crash.

                  How long does it typically take to crash? Is three hours of runtime quite a bit longer than you were getting with blocking enabled?

                  tylereversT 1 Reply Last reply Reply Quote 0
                  • tylereversT
                    tylerevers @bmeeks
                    last edited by

                    @bmeeks said in Suricata process dying due to hyperscan problem:

                    @tylerevers said in Suricata process dying due to hyperscan problem:

                    @bmeeks

                    Reconfirm Hyperscan Still Crashes

                    Block Offenders = On
                    Signature Group Header MPM Context = Auto
                    Pattern Matcher Algorithm = Auto

                    Interface failed with error:

                    [101378 - W#07] 2023-11-29 12:54:32 Error: spm-hs: Hyperscan returned fatal error -1.
                    

                    Test with Block Offenders Off
                    Block Offenders = Off
                    Signature Group Header MPM Context = Auto
                    Pattern Matcher Algorithm = Auto

                    It has been three hours without a crash.

                    How long does it typically take to crash? Is three hours of runtime quite a bit longer than you were getting with blocking enabled?

                    Yes, three hours is in the realm of 3-8x longer (and it still hasn't crashed yet ~9 hours total).

                    bmeeksB 1 Reply Last reply Reply Quote 0
                    • bmeeksB
                      bmeeks @tylerevers
                      last edited by

                      @tylerevers said in Suricata process dying due to hyperscan problem:

                      @bmeeks said in Suricata process dying due to hyperscan problem:

                      @tylerevers said in Suricata process dying due to hyperscan problem:

                      @bmeeks

                      Reconfirm Hyperscan Still Crashes

                      Block Offenders = On
                      Signature Group Header MPM Context = Auto
                      Pattern Matcher Algorithm = Auto

                      Interface failed with error:

                      [101378 - W#07] 2023-11-29 12:54:32 Error: spm-hs: Hyperscan returned fatal error -1.
                      

                      Test with Block Offenders Off
                      Block Offenders = Off
                      Signature Group Header MPM Context = Auto
                      Pattern Matcher Algorithm = Auto

                      It has been three hours without a crash.

                      How long does it typically take to crash? Is three hours of runtime quite a bit longer than you were getting with blocking enabled?

                      Yes, three hours is in the realm of 3-8x longer (and it still hasn't crashed yet ~9 hours total).

                      Well, now I need to figure out how in the world the custom blocking module code could possibly interact with the Hyperscan library 😕.

                      It makes no sense as they are not even remotely related.

                      C 1 Reply Last reply Reply Quote 1
                      • C
                        chrysmon @bmeeks
                        last edited by

                        @bmeeks Can confirm that in IDS mode (no blocking) suricata has no crashes. In IPS mode it crashes. Hyperscan, no VLANS.

                        1 Reply Last reply Reply Quote 1
                        • bmeeksB bmeeks referenced this topic on
                        • A
                          asdjklfjkdslfdsaklj @bmeeks
                          last edited by

                          @bmeeks swapped, same result. Instance on interface w/blocking disabled remains up, other died.

                          bmeeksB 1 Reply Last reply Reply Quote 0
                          • bmeeksB
                            bmeeks @asdjklfjkdslfdsaklj
                            last edited by

                            @asdjklfjkdslfdsaklj said in Suricata process dying due to hyperscan problem:

                            @bmeeks swapped, same result. Instance on interface w/blocking disabled remains up, other died.

                            Thank you. This is very helpful. It tells me that somehow the custom blocking module is part of the issue.

                            I will need to dig into the code and see if something pops out. It will be a few days, though, before I can generate debug versions of the package because the ESXi host that contained all my pfSense package builders and private testing repo crashed and burned last Sunday morning due to a power blip and my UPS failing at the same time. Something is weird with the UPS. It shows the battery as good, but if power blips it drops the load. I will need to get a new one. I've started the process of rebuilding my test environment on that host, but it's going to take a few days. Also have some other non-related obligations over the next 4 days that interfere with the effort.

                            tylereversT S 2 Replies Last reply Reply Quote 1
                            • tylereversT
                              tylerevers @bmeeks
                              last edited by

                              @bmeeks said in Suricata process dying due to hyperscan problem:

                              @asdjklfjkdslfdsaklj said in Suricata process dying due to hyperscan problem:

                              @bmeeks swapped, same result. Instance on interface w/blocking disabled remains up, other died.

                              Thank you. This is very helpful. It tells me that somehow the custom blocking module is part of the issue.

                              I will need to dig into the code and see if something pops out. It will be a few days, though, before I can generate debug versions of the package because the ESXi host that contained all my pfSense package builders and private testing repo crashed and burned last Sunday morning due to a power blip and my UPS failing at the same time. Something is weird with the UPS. It shows the battery as good, but if power blips it drops the load. I will need to get a new one. I've started the process of rebuilding my test environment on that host, but it's going to take a few days. Also have some other non-related obligations over the next 4 days that interfere with the effort.

                              Godspeed to you, sir. Best wishes in all things.

                              1 Reply Last reply Reply Quote 0
                              • S
                                SteveITS Galactic Empire @bmeeks
                                last edited by

                                @bmeeks said in Suricata process dying due to hyperscan problem:

                                battery as good, but if power blips it drops the load

                                FWIW we see that a lot on older batteries, or I suppose defective ones. In our experience the UPS "self test" works to proactively alert the majority of the time but a decent amount the self test will trigger a power failure because the battery can't handle the load for the 2 seconds. :( And by "older" I mean over 4-5 years.

                                Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                                When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                                Upvote 👍 helpful posts!

                                bmeeksB 1 Reply Last reply Reply Quote 0
                                • bmeeksB
                                  bmeeks @SteveITS
                                  last edited by

                                  @SteveITS said in Suricata process dying due to hyperscan problem:

                                  @bmeeks said in Suricata process dying due to hyperscan problem:

                                  battery as good, but if power blips it drops the load

                                  FWIW we see that a lot on older batteries, or I suppose defective ones. In our experience the UPS "self test" works to proactively alert the majority of the time but a decent amount the self test will trigger a power failure because the battery can't handle the load for the 2 seconds. :( And by "older" I mean over 4-5 years.

                                  I suspect a defective battery at some level. It is a Tripp-Lite. My favorite is APC, and I think that's what I will go back with.

                                  1 Reply Last reply Reply Quote 0
                                  • C
                                    chrysmon
                                    last edited by

                                    Again I want to mention that suricata works fine (on my system at least) in IPS mode with AC-BS Pattern Match instead the default (Hyperscan). This may help the developers to find the bug and the users to stay protected.

                                    A 1 Reply Last reply Reply Quote 0
                                    • A
                                      ajohnson353 @chrysmon
                                      last edited by

                                      @chrysmon I am seeing the same thing in AC mode. It has yet to die since making the switch.

                                      C 1 Reply Last reply Reply Quote 0
                                      • C
                                        chrysmon @ajohnson353
                                        last edited by

                                        @ajohnson353 said in Suricata process dying due to hyperscan problem:

                                        @chrysmon I am seeing the same thing in AC mode. It has yet to die since making the switch.

                                        If I remember well, mine was not working in AC mode. Let it run for longer time to be sure.

                                        1 Reply Last reply Reply Quote 0
                                        • bmeeksB
                                          bmeeks
                                          last edited by bmeeks

                                          Wonder if this might be the source of the mysterious Hyperscan bug we are seeing in Suricata?

                                          https://www.freebsd.org/security/advisories/FreeBSD-EN-23:15.sanitizer.asc

                                          If so, that would explain a lot of the weirdness. I will keep tabs on this. Thanks to @RobbieTT for the link in another thread unreleated to Suricata.

                                          M S 2 Replies Last reply Reply Quote 1
                                          • M
                                            masons @bmeeks
                                            last edited by

                                            @bmeeks said in Suricata process dying due to hyperscan problem:

                                            Wonder if this might be the source of the mysterious Hyperscan bug we are seeing in Suricata?

                                            https://www.freebsd.org/security/advisories/FreeBSD-EN-23:15.sanitizer.asc

                                            @bmeeks,

                                            The two machines I posted about earlier, are both running with the default hyperscan enabled and with legacy blocking mode enabled. Both machines have not experienced a Suricata core dump since I disabled ASLR for the Suricata binary. Thus it seems increasingly plausible that the root of the issue is linked to ASLR and the link above about the LLVM sanitizer could certainly explain why this has suddenly happened.

                                            kiokomanK 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.