Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Working on getting OpenVPN server bridging to fly.

    Scheduled Pinned Locked Moved OpenVPN
    94 Posts 13 Posters 86.1k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • S
      sullrich
      last edited by

      Do a killall cron just to make sure its nothing in there stepping on it.

      1 Reply Last reply Reply Quote 0
      • N
        Numbski
        last edited by

        Okay, done.  did a addm/deletem sis0 at 4:59:10 central time per my nice little mobile phone here.  It's on the clock.  We'll see how long it lasts. :D

        1 Reply Last reply Reply Quote 0
        • N
          Numbski
          last edited by

          Died at 5:04:20 pm central with no crons.  Hmm….

          addm/deletem sis0 of course revived it.

          1 Reply Last reply Reply Quote 0
          • N
            Numbski
            last edited by

            I'm out of time to work on this for now.  I added a crontab to run the deletem/addm every 4 mins.  It's a terrible, awful, dirty hack, but I'm hoping that the robustness of tcp/ip and associated apps will be able to resend and life will go on until I can figure out what is actually causing the issue to begin with.  Any thoughts on debugging please post up! ;)

            1 Reply Last reply Reply Quote 0
            • S
              sullrich
              last edited by

              Couple things.

              When it drops again, check ifconfig and look at the bridge status.  Does it show blocking?

              1 Reply Last reply Reply Quote 0
              • N
                Numbski
                last edited by

                ifconfig bridge0 says - UP,BROADCAST,RUNNING,MULTICAST

                To be fair, I'm not sure what causes an interface to go into BLOCKING mode, because I never (intentionally) use it. :\

                I'm looking in the right place, right?  Did my deletem/addm, came back.  Shows the same thing.

                1 Reply Last reply Reply Quote 0
                • S
                  sullrich
                  last edited by

                  Look underneath that, there is a blocking / forwarding / listening entry for each interface in the bridge.

                  1 Reply Last reply Reply Quote 0
                  • N
                    Numbski
                    last edited by

                    When working, they read: learning, discover.  Waiting for the next failure….

                    Failure happened.  Same thing.  LEARNING, DISCOVER. For grins I've enabled STP on both, although I really don't think this is a packet storm problem anymore, since I'm not seeing broadcasts coming across the bridge0, sis0, or tap0 interfaces.  Probably a good measure anyway since at some point I need to duplicate this config on the other firewall.

                    1 Reply Last reply Reply Quote 0
                    • S
                      sullrich
                      last edited by

                      You may be interested in this commit:

                      http://pfsense.com/cgi-bin/cvsweb.cgi/pfSense/usr/local/www/status_interfaces.php?rev=1.29.2.7;only_with_tag=RELENG_1

                      Shows the bridge status now under Status -> Interfaces

                      1 Reply Last reply Reply Quote 0
                      • N
                        Numbski
                        last edited by

                        Hmm.  Is it safe for me to grab that one file and plug it in, or is there something more formal I should do? (ie, cvs?)

                        1 Reply Last reply Reply Quote 0
                        • S
                          sullrich
                          last edited by

                          Yeah, its safe.  Simply replace /usr/local/www/status_interfaces.php with that new one.

                          1 Reply Last reply Reply Quote 0
                          • N
                            Numbski
                            last edited by

                            Cool.  They both show learning.  Of course, after 5 mins it still dies, but they both show learning. ;)

                            Seriously, have to put this to rest for now.  I'll come back to it later. :)

                            1 Reply Last reply Reply Quote 0
                            • N
                              Numbski
                              last edited by

                              Testing remotely.  Quick note - works great, except for a minor detail.

                              If you intend to use STP, DO NOT, I repeat, DO NOT, enable STP on the tap interface.  Your actual hardware interface is fine, but doing so on the tap interface creates a really odd situation where traffic hits the endpoint tap interface, and gets to your bridge, but nothing ever returns.  Disabling STP on the tap interface resolves that problem.

                              Otherwise all is well.  Just need to figure out why CARP chokes after 5 mins.

                              1 Reply Last reply Reply Quote 0
                              • N
                                Numbski
                                last edited by

                                Another update.  Looked like all was working just fine, until the firewall seized to a halt.  Same behavior as before too.  It responds to ctrl-alt-del by trying to shut down, but fails to actually do so.  Has to be hard rebooted.

                                When I get a chance to power cycle it, I'll see if I can set up watchdog to mitigate this side effect until I can find the root cause.  Again, if you have any speculations as to the cause, post up and I'll try it.  Also, anyone without carp that wants to try this, see what happens, let me know.

                                1 Reply Last reply Reply Quote 0
                                • N
                                  Numbski
                                  last edited by

                                  For the sake of discussion, I think I left off an option that might be causing an issue.  Dunno yet:

                                  dev-node tap-bridge

                                  Here's the official OpenVPN docs on the matter.  Suprised that I overlooked that directive.

                                  http://openvpn.net/bridge.html

                                  It claims that directive is only required under windows though.  Another comment is this:

                                  A common mistake that people make when manually configuring an Ethernet bridge is that they add their primary ethernet adapter to the bridge before they have set the IP and netmask of the bridge interface. The result is that the primary ethernet interface "loses" its settings, but the equivalent bridge interface settings have not yet been defined, so the net effect is a loss of connectivity on the ethernet interface.

                                  So, despite what I was reading elsewhere, it appears that the openvpn folks would prefer we do this:

                                  ifconfig sis0 up
                                  ifconfig tap0 up
                                  ifconfig bridge0 create
                                  ifconfig bridge0 addm sis0 addm tap0
                                  ifconfig bridge0 172.16.10.2 netmask 255.255.255.0

                                  The problem here of course is the impact this would have on CARP.  I have sis0 in carp3, and I cannot do addm carp3.  I don't know (and can't easily test at this moment) whether I can ifconfig bridge0 instead of sis0, and still have it able to join a carp cluster.  If anyone wants to speak up on that point as well, please do.  It will be about a week before I can safely test that (I think?).  I might have an opportunity while in Montreal.

                                  If this is indeed correct, then from pfSense's point of view, we need to able to change the lan interface (or in my case, opt interface) to be bridge0 and not sis0.  That way all rules are being applied to the bridge and not to the physical interface, unless someone wants to step up with more information to say otherwise.  I'm honestly just not finding much info in regards to FreeBSD, bridging, and rules re: pf, only that you should only create rules for one interface and not both, as it screws things up.  I haven't found any documentation on whether rules should be applied specifically to the bridge, or to the physical ints.

                                  Also, I'm puzzled by STP hosing things up on tap0.  Doesn't make sense to me.

                                  1 Reply Last reply Reply Quote 0
                                  • N
                                    Numbski
                                    last edited by

                                    In Montreal now.  Noticed that I can't actually set up a watchdog timer, as it requires kernel support (and it isn't in GENERIC), so oops. :)

                                    Have to find another way for now.

                                    Might I suggest we officially enable watchdog in the kernel?  Seems like a very logical, sane thing to have in a firewall.  If the kernel stops responding for x seconds, reboot the system.

                                    1 Reply Last reply Reply Quote 0
                                    • S
                                      sullrich
                                      last edited by

                                      We already support the GEOD watchdog but I do not plan on adding the SW_WATCHDOG as it may interfere with systems this late in the testing cycle.

                                      We may be able to add it to 1.1.

                                      1 Reply Last reply Reply Quote 0
                                      • N
                                        Numbski
                                        last edited by

                                        Ah, cool.  Thanks.  Hopefully I'll have time later to rebuild with SW_WATCHDOG for my own purposes.  Doesn't really fix the problem at hand, but makes me feel better to know the system will kick itself. ;)

                                        1 Reply Last reply Reply Quote 0
                                        • N
                                          Numbski
                                          last edited by

                                          New observations. :D

                                          I had the opportunity to do an openvpn bridge on a pfSense RC2i box without CARP.  Worked 99% flawlessly with the current code.

                                          • Added server-bridge directive.
                                          • Assigned tap0 as an opt.
                                          • Bridged that opt to lan using the webui.
                                          • Set an any-any rule on the opt.

                                          The only thing that didn't work?  STP was enabled on the tap interface by default!  ifconfig bridge0 -stp tap0, and all was well.

                                          Really, REALLY screwy stuff here.  Wonder if I should just re-load my firewalls when I get back and start clean?  ???

                                          Would help if someone could verify my findings.

                                          1 Reply Last reply Reply Quote 0
                                          • N
                                            Numbski
                                            last edited by

                                            Been up for a couple of days, completely stable on bridging on everyone's pfSense boxes but my own.

                                            Go fig.  ;D

                                            So yeah.  Put in a statement to check if an interface is a tap interface, and if it is, don't enable STP.  Do that, OpenVPN bridging is good to go.  Works quite nicely with CARP too, despite my initial experiences.  Just do "local (CARP IP)" on both boxes, and presuming you've used the same server crt, ca cert, server key, and dh, it will fail over gracefully.

                                            Good stuff guys.  Sorry I made a three page thread on it.  At least someone else has issues that match mine, they'll have something to go on.  When I return I'll try doing a fresh load on my boxes and see what my results are.  I think you can safely say that OpenVPN bridging works though.

                                            NOTE: The change needs to be made ~ line 144 in /etc/inc/interfaces.inc.  Currently it looks like this:

                                            
                                            if(!is_interface_wireless($lancfg['if']) and
                                                               !is_interface_wireless($config['interfaces'][$lancfg['bridge']]['if']))
                                                                    mwexec("/sbin/ifconfig bridge{$bridges_total} stp {$config['interfaces'][$lancfg['bridge']]['if']} stp {$lancfg['if']}");
                                            
                                            

                                            I'm thinking on that first line we need to add something along the lines of a negated regex, maybe !/tap/?  Don't know precisely how that goes in php.  Then after the mwexec line, we add an elsif block that says the same thing, only don't negate the regex, and on the mwexec line, leave off the stp part.  Make sense?

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.