Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    LACP LAGG in Silicom NICs

    Scheduled Pinned Locked Moved General pfSense Questions
    14 Posts 2 Posters 1.6k Views 2 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • stephenw10S Offline
      stephenw10 Netgate Administrator
      last edited by

      That should work. I know of no reason why it wouldn't unless you have one of the NICs with LAN bypass that require a special driver.

      Do you have that NIC? Is it failing to connect?

      Steve

      DaddyGoD 1 Reply Last reply Reply Quote 0
      • DaddyGoD Offline
        DaddyGo @stephenw10
        last edited by

        @stephenw10 said in LACP LAGG in Silicom NICs:

        Do you have that NIC? Is it failing to connect?

        Sorry for my late reply, now I get access to this hardware again...
        -so this will be a pfSense which is designed for 10Gig and some direction 1Gig

        The base is a Cisco UCS C220 M3, with the following network options:

        • 2 ports LOM I350
        • PE2G4SFPI35L Server Adapter with I350AM4 (SFP 4x1Gig)
        • PE310G4I71L Server Adapter with Intel® XL710BM1 (SFP 4x10Gig)

        All interfaces are perfectly detected by the OS (that's a total of ten).
        the problem is with the 4x1Gig card (PE2G4SFPI35L) when setting up 2 ports as LAGG with LACP...

        the configuration is almost complete, but I faced the following under the interface tests:

        step1: a test client (Intel i211) connected to the LAGG interface, the DHCP server on interface (LAGG0) detects a DHCP request, but returns an error that it is not possible to send 300 bytes of data on this interface...

        something like this:

        0c306e73-b7fa-465b-9a14-d5747196711d-image.png

        Notes:

        1. configured for 10Gig NIC with the same setting LAGG / LACP - 2 ports), DHCP works without problems

        2. strange short crashes, when adding 4x1Gig card settings in "System Tunables", such as EEE disable, FC disable
          in this case the two LEDs on the current channel of the NIC flashes and the GUI is inaccessible for about 6 to 8 minutes
          (this happens per interface, igb2 - 5)

        there is no need to restart, after a while the GUI will recover and the LEDs will stop flashing

        if I disassemble the LAGG all the interfaces here work perfectly...

        Cats bury it so they can't see it!
        (You know what I mean if you have a cat)

        1 Reply Last reply Reply Quote 0
        • stephenw10S Offline
          stephenw10 Netgate Administrator
          last edited by

          Hmm, that is odd.

          Does the LACP continue to show as UP?

          Both links good in ifconfig -vvv lagg0 ?

          Does it still do that if you do not have any custom loader variables or sysctls set?

          Steve

          DaddyGoD 1 Reply Last reply Reply Quote 0
          • DaddyGoD Offline
            DaddyGo @stephenw10
            last edited by

            @stephenw10

            THX :)

            "Does the LACP continue to show as UP?" = yes (in the DHCP logs you can see that, the interface (lagg0) goes down, but it never goes down)

            • on the other hand there is no traffic on it, I tried now, there is not even PING towards MikroTik, but it seems completely "alive"

            66a33cea-1faa-463f-bc8a-707d9b5356c6-image.png

            "Both links good in ifconfig -vvv lagg0 ?"

            bce86872-6537-4f0e-9f57-916adc82edda-image.png

            Does it still do that if you do not have any custom loader variables or sysctls set?

            just the usual, these:

            45163c0a-803e-4bfd-aa4a-c879ecb8d7d6-image.png

            and
            strangely on the other side, MikroTik (CSS610-8G-2S+IN) does not display LAGG

            2020-12-22_15h29_43.jpg

            2020-12-22_15h30_59.jpg

            the LAGG as you can see, are on igb3-4
            maybe I have to remove everything from loader.conf.local and so examine this way...(?)

            what's going on in my head:😉

            -does the group is affected, if I configure the parent intefaces separately?
            (but this is not logical, unless I configure the lagg0 interface only and not the parents

            like:

            loader.conf.local igb3-4 (disable EEE / FC) (parents)
            and / or sys. tunables...?

            instead:
            dev.lagg0..........etc?

            The final question is why the 10G NIC is behaving properly...(?)

            Cats bury it so they can't see it!
            (You know what I mean if you have a cat)

            1 Reply Last reply Reply Quote 0
            • stephenw10S Offline
              stephenw10 Netgate Administrator
              last edited by

              Mmm, that lagg does not look good.
              I expect to see something more like:

              [2.4.5-RELEASE][admin@7100.stevew.lan]/root: ifconfig -vvv lagg0
              lagg0: flags=8943<UP,BROADCAST,RUNNING,PROMISC,SIMPLEX,MULTICAST> metric 0 mtu 1500
              	options=500b8<VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,VLAN_HWFILTER,VLAN_HWTSO>
              	ether 00:e0:ed:86:a6:8c
              	inet6 fe80::2e0:edff:fe86:a68c%lagg0 prefixlen 64 scopeid 0x15
              	inet 172.21.16.206 netmask 0xffffff00 broadcast 172.21.16.255
              	nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
              	media: Ethernet autoselect
              	status: active
              	groups: lagg
              	laggproto lacp lagghash l2,l3,l4
              	lagg options:
              		flags=10<LACP_STRICT>
              		flowid_shift: 16
              	lagg statistics:
              		active ports: 2
              		flapping: 0
              	lag id: [(8000,00-E0-ED-86-A6-8C,02B2,0000,0000),
              		 (0001,60-9C-9F-54-14-F2,561F,0000,0000)]
              	laggport: ixl0 flags=1c<ACTIVE,COLLECTING,DISTRIBUTING> state=3d<ACTIVITY,AGGREGATION,SYNC,COLLECTING,DISTRIBUTING>
              		[(8000,00-E0-ED-86-A6-8C,02B2,8000,0001),
              		 (0001,60-9C-9F-54-14-F2,561F,0001,0041)]
              	laggport: ixl1 flags=1c<ACTIVE,COLLECTING,DISTRIBUTING> state=3d<ACTIVITY,AGGREGATION,SYNC,COLLECTING,DISTRIBUTING>
              		[(8000,00-E0-ED-86-A6-8C,02B2,8000,0002),
              		 (0001,60-9C-9F-54-14-F2,561F,0001,0043)]
              

              You have 0 active ports. Those that are shown are defaulted.

              Something is mis-matched there.

              Steve

              DaddyGoD 1 Reply Last reply Reply Quote 0
              • DaddyGoD Offline
                DaddyGo @stephenw10
                last edited by DaddyGo

                @stephenw10 said in LACP LAGG in Silicom NICs:

                Something is mis-matched there.

                since I saw somewhere that the Silicom, a Netgate partner or vica versa....

                I was hoping you had come across a similar Silicom issue...
                setting up a LAGG is not a big deal, so the weird behavior is likely to be sought deeper... (NIC FW or I dont know)

                so I do remove everything (loader.conf..... sys tunables) which is related to the NIC and try to set up the LAGG again

                BTW:
                have you ever seen the LEDs flashes when you save the "sys tunables" setting?
                this is the strangest fact of all

                +++edit:
                yes what still conspicuous:

                and you saw that weird thing is that DHCPOFFER is present in the DHCP log, ergo there must be some traffic on the interface

                Cats bury it so they can't see it!
                (You know what I mean if you have a cat)

                1 Reply Last reply Reply Quote 0
                • stephenw10S Offline
                  stephenw10 Netgate Administrator
                  last edited by stephenw10

                  Try enabling lacp debugging and see if anything is happening:
                  sysctl net.link.lagg.lacp.debug=1

                  It looks like there are probably no lacpdus arriving from the switch for some reason.

                  Silicom bought ADI who designed a number of our systems. I've never really used their stand alone NICs though.

                  Steve

                  DaddyGoD 2 Replies Last reply Reply Quote 1
                  • DaddyGoD Offline
                    DaddyGo @stephenw10
                    last edited by

                    @stephenw10 said in LACP LAGG in Silicom NICs:

                    Try enabling lacp debugging and see if anything is happening:

                    I don't do it today, a few days and I'll be back.
                    Somewhere here is a Cisco SG350X-24 in the lab, I will try it with this switch too...
                    Thank you for your guidance so far...

                    @stephenw10 "Silicom bought ADI who designed a number of our systems."

                    I love the Silicom stuffs and I have never had a problem with it before...
                    I didn't know about "marriage", - ADI

                    Have a nice Christmas

                    Cats bury it so they can't see it!
                    (You know what I mean if you have a cat)

                    1 Reply Last reply Reply Quote 0
                    • DaddyGoD Offline
                      DaddyGo @stephenw10
                      last edited by

                      @stephenw10 said in LACP LAGG in Silicom NICs:

                      Try enabling lacp debugging and see if anything is happening:
                      sysctl net.link.lagg.lacp.debug=1

                      What seems certain is that the Silicom i350-F4 is causing the LAGG problem.... (igb...)

                      because with an Intel X710-D4 (4 ports 10Gig SFP) everything works perfectly

                      sysctl net.link.lagg.lacp.debug=1

                      2021-02-08_18h44_48.jpg

                      ifconfig -vvv lagg0

                      2021-02-08_18h48_47.jpg

                      DHCP on the LAGG interface is also problem-free with this NIC
                      the problem now is that we don't need 10Gig in this pfSense box 😉

                      Cats bury it so they can't see it!
                      (You know what I mean if you have a cat)

                      1 Reply Last reply Reply Quote 0
                      • stephenw10S Offline
                        stephenw10 Netgate Administrator
                        last edited by

                        Do you see any incoming traffic in the lacp debug messages when using the igb NICs?

                        DaddyGoD 1 Reply Last reply Reply Quote 0
                        • DaddyGoD Offline
                          DaddyGo @stephenw10
                          last edited by

                          @stephenw10 said in LACP LAGG in Silicom NICs:

                          Do you see any incoming traffic

                          Yes the DHCP req. it arrives but the answer to it doesn't go out, due to....

                          1d39750e-d529-4001-bed4-04b9fbd573cf-image.png

                          In the meantime, I talked about this with Silicom support...
                          OP ROM is disabled at the factory on these cards, so Cisco CIMC does not recognize NIC MAC addresses...

                          SILICOM:
                          "Hi,
                          This product is shipped without the PXE boot room enabled and it is not programmed on it."

                          it is likely that the special behavior is caused by this, so this is not a pfSense problem (Cisco + Silicom w/o OP ROM)
                          if I don't use the LAGG setting on the ports (interfaces), everything behaves as expected

                          210b298a-1a77-4459-aad4-2edead7e377c-image.png

                          now it seems to me that it only affects the LAGG settings, don't ask why - I have not seen such a thing 😉

                          The solution will be a dual rate SFP module loaded with Intel code, so we will have 1 / 10G at our disposal
                          https://www.fs.com/de-en/products/36431.html

                          Cats bury it so they can't see it!
                          (You know what I mean if you have a cat)

                          1 Reply Last reply Reply Quote 0
                          • stephenw10S Offline
                            stephenw10 Netgate Administrator
                            last edited by

                            Hmm, the only time I've seen anything similar to this was on the very early 7100 ix ports. They worked in every way but not as part of an LACP LAGG. It's been a while since we saw it but if I recall they would see traffic coming in but seemingly never sent anything out.
                            Completely different NIC type though.

                            Steve

                            DaddyGoD 1 Reply Last reply Reply Quote 0
                            • DaddyGoD Offline
                              DaddyGo @stephenw10
                              last edited by

                              @stephenw10 said in LACP LAGG in Silicom NICs:

                              Completely different NIC type though.

                              Yep, I think I'll wait a bit and test again under 2.5.(?)
                              although, if I read it correctly (somewhere), only "ixl" got a brand new driver under FB12

                              Cats bury it so they can't see it!
                              (You know what I mean if you have a cat)

                              1 Reply Last reply Reply Quote 0
                              • First post
                                Last post
                              Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.