Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    AES-CGM and stalling IPSec

    IPsec
    2
    7
    854
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • keyserK
      keyser Rebel Alliance
      last edited by keyser

      HI all.

      I have discovered an IPSec issue today when deploying a bunch of SG-2100 boxes (ARM64 CPU) to do IPSec Site2Site to a XG-1537.
      If I use AES-CGM for encryption (both 128 and 256bit) as guided by Netgate, the SG-2100 boxes will stall/become unresponsive after a while if there is more than one Phase2 tunnels active in the Tunnel. Boxes with only one Phase2 tunnel does not seem to suffer the issue (so far at least).

      The XG-1537 does not seem to suffer issues - it has QAT enabled.

      Disabling SafeXcel (HW Acceleration) does not mitigate the Issue.
      But changing the cipher on the tunnel to AES256 (not CGM - I believe it is really AES256-CBC) resolves the issue.

      I have a lot of testing to do still, but it’s quite evident the change of cipher resolves the issue. The SG-2100’s are using 22.01, and the XG-1537 is using 21.05. I will try to upgrade them all to 22.05 and see it changes anything.

      Is this a known issue with multiple Phase two’s?

      Love the no fuss of using the official appliances :-)

      1 Reply Last reply Reply Quote 0
      • keyserK keyser referenced this topic on
      • N
        NOCling
        last edited by NOCling

        Have a look:

        Bug #13074

        But i try GCM with 22.05 and i can't reproduce it at the moment. Looks like GCM is now usable.

        Netgate 6100 & Netgate 2100

        keyserK 1 Reply Last reply Reply Quote 0
        • keyserK
          keyser Rebel Alliance @NOCling
          last edited by

          @nocling said in AES-CGM and stalling IPSec:

          Have a look:

          Bug #13074

          But i try GCM with 22.05 and i can't reproduce it at the moment. Looks like GCM is now usable.

          Yeah, I know that redmine report, but that is not the issue I’m suffering here. Disabling SafeXcel makes no difference to my setup. But then again, I need to test 22.05 in both ends and try with disabled asyncronous cryptography and so on.

          But right now it definitely manifests itself when using multiple phase 2 tunnels in a Site2Site. I seem unable to recreate the same issue when only using one Phase 2.

          Love the no fuss of using the official appliances :-)

          1 Reply Last reply Reply Quote 0
          • N
            NOCling
            last edited by

            I use 2 P2 and async crypto now with AESGCM256, no impact at the moment.

            Netgate 6100 & Netgate 2100

            keyserK 1 Reply Last reply Reply Quote 0
            • keyserK
              keyser Rebel Alliance @NOCling
              last edited by

              @nocling said in AES-CGM and stalling IPSec:

              I use 2 P2 and async crypto now with AESGCM256, no impact at the moment.

              But you had the issue before 22.05?

              Love the no fuss of using the official appliances :-)

              1 Reply Last reply Reply Quote 0
              • N
                NOCling
                last edited by

                Yes, with 22.01 my 2100 hangs up some times a day before i can find out what happened. I reproduce it and so we got the Bug Report after a other 2100 are affected to.

                Netgate 6100 & Netgate 2100

                keyserK 1 Reply Last reply Reply Quote 0
                • keyserK
                  keyser Rebel Alliance @NOCling
                  last edited by

                  @nocling said in AES-CGM and stalling IPSec:

                  Yes, with 22.01 my 2100 hangs up some times a day before i can find out what happened. I reproduce it and so we got the Bug Report after a other 2100 are affected to.

                  Yeah I saw your posts on the issue, but you could see the mbuf_clusters grow in diagnostics. The mbuf_clusters graph shows no changes when I’m suffering my issue.

                  Love the no fuss of using the official appliances :-)

                  1 Reply Last reply Reply Quote 0
                  • keyserK keyser referenced this topic on
                  • First post
                    Last post
                  Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.