Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Weird issue

    Scheduled Pinned Locked Moved TNSR
    10 Posts 4 Posters 1.3k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • S
      schnitzel_itdept
      last edited by

      I have TNSR installed on a couple of HP servers with 4 10g NICs each, divided into 2x 20g LACP bonds per server (LAN and WAN).
      VRRP is configured, and NAT.
      When I set the VRRP LAN IP as a default gateway for a client device and ping something on the internet, I get 3 pings, a timeout, and repeat. When doing a speedtest I get a transient drop in speed every few seconds. Otherwise it works fine. What did I do wrong?...
      Thanks for any help! 😎

      kiokomanK 1 Reply Last reply Reply Quote 0
      • kiokomanK
        kiokoman LAYER 8 @schnitzel_itdept
        last edited by

        I would check the switch and the server maybe mismatched settings on lacp ?

        ̿' ̿'\̵͇̿̿\з=(◕_◕)=ε/̵͇̿̿/'̿'̿ ̿
        Please do not use chat/PM to ask for help
        we must focus on silencing this @guest character. we must make up lies and alter the copyrights !
        Don't forget to Upvote with the 👍 button for any post you find to be helpful.

        S 1 Reply Last reply Reply Quote 0
        • S
          schnitzel_itdept @kiokoman
          last edited by

          @kiokoman Good point but I think it's all OK. bond settings:

             <bond-table>
                <bond>
                   <instance>0</instance>
                   <mode>lacp</mode>
                   <load-balance>l34</load-balance>
          

          As far as I can tell this should be compatible with the Unifi switches. I am using budget Chinese SFP+ DAC cables but hopefully that has nothing to do with it...
          lacp 16xg.PNG

          kiokomanK audianA 2 Replies Last reply Reply Quote 0
          • kiokomanK
            kiokoman LAYER 8 @schnitzel_itdept
            last edited by kiokoman

            @schnitzel_itdept
            from the documentation:
            there is a default timeout of 3 seconds when monitoring bonding peers with LACP.
            could this be related to the problem? (3 sec = 3 ping -> timeout )
            https://docs.netgate.com/tnsr/en/latest/interfaces/types-bond.html#bond-interface-settings

            ̿' ̿'\̵͇̿̿\з=(◕_◕)=ε/̵͇̿̿/'̿'̿ ̿
            Please do not use chat/PM to ask for help
            we must focus on silencing this @guest character. we must make up lies and alter the copyrights !
            Don't forget to Upvote with the 👍 button for any post you find to be helpful.

            1 Reply Last reply Reply Quote 0
            • S
              schnitzel_itdept
              last edited by

              @kiokoman
              Works great if I turn off one of the servers and/or disable the bonds on one of the servers, so the other one takes over VRRP master. So I think the problem is to do with VRRP....

              S 1 Reply Last reply Reply Quote 0
              • S
                schnitzel_itdept @schnitzel_itdept
                last edited by

                I set it up as per https://docs.netgate.com/tnsr/en/latest/recipes/vrrp-nat/index.html and I can see the second node occasionally being elected master every few seconds.... Argh!

                kiokomanK 1 Reply Last reply Reply Quote 0
                • kiokomanK
                  kiokoman LAYER 8 @schnitzel_itdept
                  last edited by

                  @schnitzel_itdept
                  storm control / rate limiting on multicast?

                  master will transmit advertisements. If other nodes fail to see advertisements from a higher priority node in a timely manner defined by the settings, control of the virtual address is assumed by the backup node with the next highest priority

                  ̿' ̿'\̵͇̿̿\з=(◕_◕)=ε/̵͇̿̿/'̿'̿ ̿
                  Please do not use chat/PM to ask for help
                  we must focus on silencing this @guest character. we must make up lies and alter the copyrights !
                  Don't forget to Upvote with the 👍 button for any post you find to be helpful.

                  S 1 Reply Last reply Reply Quote 0
                  • audianA
                    audian @schnitzel_itdept
                    last edited by

                    @schnitzel_itdept

                    Can you try different cables just to rule that out as a cause?

                    1 Reply Last reply Reply Quote 0
                    • S
                      schnitzel_itdept @kiokoman
                      last edited by schnitzel_itdept

                      @kiokoman we have flow control enabled on a few Unifi switches in order to speed up wifi, but they are quite a ways downstream from where the TNSR machines are

                      @audian I tried different cables to different switches, different SFP+ cards (Intel X520), and a whole different server for node A. No luck...

                      Here are the VRRP settings... node A is internal IP .11 and external .181, B is 12 and 182. NAT is configured
                      vrrp.PNG

                      S 1 Reply Last reply Reply Quote 0
                      • S
                        stratagem-ben @schnitzel_itdept
                        last edited by

                        @schnitzel_itdept Can do you do a packet capture on server B to double check all advertisements are received correctly?

                        1 Reply Last reply Reply Quote 0
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.