Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    My IPSEC service hangs

    IPsec
    15
    76
    18.0k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • G
      gassyantelope @auroramus
      last edited by

      @auroramus The behavior occurring is definitely not normal. I understand what that post is saying and completely agree that is normal IPsec behavior. The issue here is completely different though. The tunnels will never come back up once they all go down. I can ping, send data another way, etc., and they won't ever come back up until a restart is performed.

      I've had multiple cases where I had active connections over the tunnel (sending data the whole time) and then the issue occurs and all tunnels go down. This has occurred way before the default 8 hour life span (sometimes within an hour or two).

      1 Reply Last reply Reply Quote 0
      • A
        auroramus
        last edited by

        @gassyantelope Yes 100% the behaviour is wrong.

        as it seems to crash the service. and this shouldnt happen.

        1 Reply Last reply Reply Quote 0
        • M
          mr.ortizx
          last edited by

          I just paid for Enterprise support and I was told the following:

          "Hello,

          Unfortunately, this is a somewhat rare issue that has not been solved yet. It is much less prevalent in pfSense CE 2.5.2, 2.7, and pfSense Plus 22.05. There aren't any workarounds currently, so rolling back or upgrading are the only steps you can currently take to mitigate the issue. You may track the issue here:

          https://redmine.pfsense.org/issues/13014
          "
          I hope this helps you guys. event though redmine says all tunnels continue to operate normally, Netgate support mentioned that they also see instances where all tunnels will drop which is the case for all of us.

          A G 2 Replies Last reply Reply Quote 0
          • A
            auroramus @mr.ortizx
            last edited by auroramus

            @mr-ortizx really appreciate you letting us know.

            1 Reply Last reply Reply Quote 0
            • A
              auroramus
              last edited by

              I have updated to 2.7 i will keep you guys updated.

              1 Reply Last reply Reply Quote 1
              • G
                gassyantelope @mr.ortizx
                last edited by

                @mr-ortizx Thanks man! At least we finally got an official response from them. I'm gonna do what @auroramus did and update to 2.7 as well to see if it helps at all. It can't hurt at this point.

                M 1 Reply Last reply Reply Quote 1
                • M
                  mr.ortizx @gassyantelope
                  last edited by

                  @gassyantelope @auroramus Please let me know how it went after upgrading to the version 2.7

                  abliznoA 1 Reply Last reply Reply Quote 0
                  • A
                    auroramus
                    last edited by

                    Hi Guys

                    So far so good with 2.7 have not had a single drop in the tunnels for days now soo ye give it a go and let me know.

                    A 1 Reply Last reply Reply Quote 0
                    • A
                      auroramus @auroramus
                      last edited by auroramus

                      I have been running 2.7 since 30th June and i have not had a single blip.

                      Let me know how you guys get on.

                      G 1 Reply Last reply Reply Quote 0
                      • G
                        gassyantelope @auroramus
                        last edited by

                        @auroramus I updated to 2.7 yesterday. It's only been 24 hours, but I haven't had the issue yet. That's already an improvement for me, seeing as I had to reboot the firewall once or twice a day when on 2.6. I'll provide another update in a few days. I'm crossing my fingers.

                        G 1 Reply Last reply Reply Quote 0
                        • G
                          gassyantelope @gassyantelope
                          last edited by gassyantelope

                          @gassyantelope I spoke too soon. I just had the issue occur on 2.7.

                          Disclosure: Potentially justifiable rant below :)

                          Investigating and fixing this issue really needs to be a higher priority at this point. There are reports about the issue from 5+ years ago, yet it still exists. The latest redmine issue report (from 3 months ago) hasn't had much traction, as far as someone actually investigating the problem. It just keeps having its target version pushed back over and over.

                          I get that there are other issues that need to be fixed as well, but this is an issue that, essentially, makes pfSense a nonviable option to use as a firewall in a production environment. Netgate states it to be a "somewhat rare" issue, yet there are many threads and redmine reports, spanning years, that show that this issue is more common than they make it out to be.

                          My company has primarily used WatchGuard firewalls for years, which are decent enough, but their capabilities are lacking in various areas (I'd prefer to move away from them, personally). We started installing some Netgate/pfSense devices for some "smaller" networks, that only have 5-10 IPsec tunnels, and found pfSense to run stably and have far superior capabilities. We were ready to purchase ~30 Netgate firewalls to replace all of the WatchGuards, but wanted to test pfSense on a "larger" networks (50+ IPsec tunnels) to make sure there were no issues before we pulled the trigger. That large network test led us to where we are today, exposing this issue that completely breaks IPsec VPNs constantly.

                          As much as I like pfSense (which I'll continue to use for my home lab) and really want to move away from WatchGuard and transition to Netgate/pfSense firewalls, that can't be done for as long as this issue continues to exist. A firewall with lackluster capabilities, but fully working IPsec VPNs, is better than a very capable firewall that has to be rebooted 1-2 times per day to get IPsec VPNs, which I'd consider a core feature of all firewalls, to stay up and work properly.

                          I'll be putting the WatchGuards back in place for now. I'll continue to monitor this thread and the redmine issue page for updates. I'm still willing to swap the pfSense firewall back in to assist with the testing of possible solutions, as I'd like to see this problem fixed some day. I just can't have pfSense be our day to day, primary, firewall in its current state.

                          Rant over.

                          1 Reply Last reply Reply Quote 0
                          • abliznoA
                            ablizno @mr.ortizx
                            last edited by

                            @mr-ortizx Updated to latest 2.7 dev build, issue still occurs with the same frequency as before.

                            1 Reply Last reply Reply Quote 0
                            • M
                              mr.ortizx
                              last edited by

                              I was asked by Netgate technical support to upgrade to the version. Pfsense Plus 22.05
                              Issue persisted. I will continue working with support.

                              abliznoA 1 Reply Last reply Reply Quote 1
                              • abliznoA
                                ablizno @mr.ortizx
                                last edited by

                                @mr-ortizx wish there was some way to help point them towards the root of the issue. We know its due to the vici socket getting overwhelmed/locked up. When it happens if you run sockstat | grep -i vici you can see charon is overwhelmed. It started as like once a week for me and now its every ~12 hours it seems. Tunnels expire every 8 hours, so it doesn't appear to be directly related to the tunnels reconnecting. Opening Command Prompt and running pgrep -f charon to get the PIDs then kill -9 [pid] [pid] works as long as you restart the IPSEC service twice (not sure why it needs to be restarted twice) seems to fix it. We know what the problem is, and I'd be willing to provide any logs that help as I understand it is some sort of "rare" issue.

                                If anyone from netgate sees this, I'd be more than willing to assist in getting this resolved.

                                R 1 Reply Last reply Reply Quote 0
                                • R
                                  rcoleman-netgate Netgate @ablizno
                                  last edited by

                                  @ablizno said in My IPSEC service hangs:

                                  If anyone from netgate sees this, I'd be more than willing to assist in getting this resolved.

                                  This is the redmine associated with this issue: https://redmine.pfsense.org/issues/13014

                                  Any contribution you can provide would be located in there.

                                  Ryan
                                  Repeat, after me: MESH IS THE DEVIL! MESH IS THE DEVIL!
                                  Requesting firmware for your Netgate device? https://go.netgate.com
                                  Switching: Mikrotik, Netgear, Extreme
                                  Wireless: Aruba, Ubiquiti

                                  G 1 Reply Last reply Reply Quote 0
                                  • A
                                    auroramus
                                    last edited by

                                    Since i updated to 2.7 my ipsec service has not crashed once and i have had no drops on my running VPN's

                                    G 1 Reply Last reply Reply Quote 0
                                    • G
                                      gassyantelope @auroramus
                                      last edited by

                                      @auroramus It's so strange how the update can fix the problem for some, but not others. What type hardware are you running for pfSense and how many IPsec tunnels do you have? I think the issue has more to do with the latter, rather than the former, but the more information we have to work with, the better.

                                      There's a few reasons I think it's related to the number of tunnels. Other than the crash only occurring (in my tests) on firewalls with a large number of tunnels, I also noticed various IPsec pages in the web UI started to be slow and/or timeout when trying to add new IPsec gateways/tunnels and apply the changes (editing the settings of current ones and applying the changes works as it should). This started occurring right after adding my ~30th tunneI and still happens. It's probably a separate issue, but it's odd that its occurrence seems tied to the number of gateways/tunnels, just like the crashing issues (in my tests). It's like some services are overloaded when there's too many tunnels (which we know to be the case with charon).

                                      I wouldn't mind trying to perform a test on 2.7, with a similar number of tunnels as you have, to see if there are still issues.

                                      1 Reply Last reply Reply Quote 0
                                      • G
                                        gassyantelope @rcoleman-netgate
                                        last edited by

                                        @rcoleman-netgate We can definitely post information over there, but do you have a way to get someone to look at it? The last comment from someone who works on pfSense (Brad Davis, a developer?) was 2.5 months ago when he said that they thought it was fixed, but needed more testing.

                                        Since then, people have replied saying that it was not fixed and is still an issue. I'm sure that many of us wouldn't mind testing and providing whatever information is needed to get this fixed, but that can't happen until someone who works on pfSense is actively involved and tells us what they need from us.

                                        R 1 Reply Last reply Reply Quote 1
                                        • R
                                          rcoleman-netgate Netgate @gassyantelope
                                          last edited by

                                          @gassyantelope I've seen hints at a solution that is being tested but not a lot of specifics at this time. If there's anything that would be testable it will appear in the redmine notes.

                                          Ryan
                                          Repeat, after me: MESH IS THE DEVIL! MESH IS THE DEVIL!
                                          Requesting firmware for your Netgate device? https://go.netgate.com
                                          Switching: Mikrotik, Netgear, Extreme
                                          Wireless: Aruba, Ubiquiti

                                          T 1 Reply Last reply Reply Quote 0
                                          • T
                                            Topogigio @rcoleman-netgate
                                            last edited by

                                            I have same problem, maybe it can help:

                                            • pfsense 2.6 on VMWare, with only ONE IPSEC v2 VTI tunnel configured. After some weeks of (small) work it stops the tunnel. No way to resume it: it negotiates something but it does not go up. No help restarting IPSEC service: it is needed to restart the whole pfSense to restart.
                                            • pfSense 2.5 on VMWARE with A LOT OF v2/v1 VTI tunnels configured. It works well without any problem from long time.
                                            • pfSense 2.6 on HYPER-V with three tunnels configured. It seems working (23 days uptime currently)

                                            When the problem occurs the IPSEC logs report only the error "[CFG] trap not found, unable to acquire reqid 5002" but I'm not sure it is "normal" with VTI tunnels.

                                            1 Reply Last reply Reply Quote 0
                                            • abliznoA ablizno referenced this topic on
                                            • abliznoA ablizno referenced this topic on
                                            • abliznoA ablizno referenced this topic on
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.