Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    High interrupts on WAN/LAN interfaces?

    Scheduled Pinned Locked Moved Hardware
    55 Posts 4 Posters 6.4k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • R
      rmeskill @stephenw10
      last edited by

      igc.0 is the WAN that is seeing the loss

      R 1 Reply Last reply Reply Quote 0
      • R
        rmeskill @rmeskill
        last edited by

        Just to add to the curiosity, all loss cleared up approximately 2 hours ago and appears to have remained stably loss-free since then. I spoke to my provider again and they seem quite insistent there is nothing wrong with my circuit and they see no reason why this should be happening. However, they also confirmed the modem they have given me is quite dumb and 1) cannot hard-code interface speeds/duplex and 2) does not support a DMZ host, so if I moved it back to router mode I'd need to manually open each and every port.

        1 Reply Last reply Reply Quote 0
        • stephenw10S
          stephenw10 Netgate Administrator
          last edited by

          @rmeskill said in High interrupts on WAN/LAN interfaces?:

          dev.igc.0.mac_stats.good_pkts_recvd: 147664355
          dev.igc.0.mac_stats.total_pkts_recvd: 147669598

          Hmm, the only thing I see there is that. Implying ~5000 bad packets received. But no errors so probably just rejected. Not really a huge number out of 147M.

          I see about the same percentages here:

          dev.igc.3.mac_stats.good_pkts_recvd: 1208385
          dev.igc.3.mac_stats.total_pkts_recvd: 1208642
          

          And have no issues.

          One difference I do see is no msix_vector values:

          dev.igc.3.iflib.allocated_msix_vectors: 2
          dev.igc.3.iflib.use_extra_msix_vectors: 0
          

          Do you see that in the boot logs?

          igc0: <Intel(R) Ethernet Controller I226-V> mem 0x67a00000-0x67afffff,0x67b00000-0x67b03fff at device 0.0 on pci2
          igc0: Using 1024 TX descriptors and 1024 RX descriptors
          igc0: Using 1 RX queues 1 TX queues
          igc0: Using MSI-X interrupts with 2 vectors
          igc0: Ethernet address: 00:08:a2:12:ec:d4
          igc0: netmap queues/slots: TX 1/1024, RX 1/1024
          
          1 Reply Last reply Reply Quote 0
          • stephenw10S
            stephenw10 Netgate Administrator
            last edited by

            Oh and note that the igc driver can only link using negotiated speed and duplex. It will fail to connect to a NIC that is actually using a fixed speed and does not negotiate.

            R 1 Reply Last reply Reply Quote 0
            • R
              rmeskill @stephenw10
              last edited by

              @stephenw10

              igc0: <Intel(R) Ethernet Controller I226-V> mem 0x80200000-0x802fffff,0x80300000-0x80303fff at device 0.0 on pci2
              igc0: Using 1024 TX descriptors and 1024 RX descriptors
              igc0: Using 4 RX queues 4 TX queues
              igc0: Using MSI-X interrupts with 5 vectors
              igc0: Ethernet address: 7c:2b:e1:13:d7:66
              igc0: netmap queues/slots: TX 4/1024, RX 4/1024

              1 Reply Last reply Reply Quote 0
              • stephenw10S
                stephenw10 Netgate Administrator
                last edited by

                Hmm, did you reboot after setting override_nrxqs to 1? You might need to set that as a loader value rather than a tunable

                R 1 Reply Last reply Reply Quote 0
                • R
                  rmeskill @stephenw10
                  last edited by rmeskill

                  @stephenw10 I did, but now I've added it to loader value, rebooted, and it's up:
                  99325ca4-b268-453a-af41-b5766bebf15f-image.png
                  loss is back, though-was completely clean for about 3h, now back to 40-60% loss...

                  R 1 Reply Last reply Reply Quote 0
                  • R
                    rmeskill @rmeskill
                    last edited by

                    If I'm honest, I'm starting to just think my provider might, on top of having some issues in their network, be blocking ICMP packets sporadically. Because despite seeing 20-60%+ loss, I'm seeing quite low RTTs that are staying quite consistent:
                    e175806f-28b5-49e9-8706-198f63516ebc-image.png
                    The issue for me has been my secondary gateways (via Wireguard tunnels) are dropping because I have them set to drop based on loss, but perhaps I should instead change the down setting to latency and see how that treats me... There are still definitely gremlins in there somewhere, as sometimes I do crap out and throughput drops to basically zero, but it seems somewhat counterintuitive that I can be allegedly running loss of 20-60% but also see fine real-world performance.

                    1 Reply Last reply Reply Quote 0
                    • stephenw10S
                      stephenw10 Netgate Administrator
                      last edited by

                      Are you seeing the NIC actually using 1 queue now?

                      Ah, also check: sysctl hw.igc.max_interrupt_rate

                      I think we bumped that default to 20000

                      R 1 Reply Last reply Reply Quote 0
                      • R
                        rmeskill @stephenw10
                        last edited by rmeskill

                        @stephenw10
                        Yes, actually seeing 1 queue now, but still loss
                        igc0: <Intel(R) Ethernet Controller I226-V> mem 0x80200000-0x802fffff,0x80300000-0x80303fff at device 0.0 on pci2
                        igc0: Using 1024 TX descriptors and 1024 RX descriptors
                        igc0: Using 1 RX queues 1 TX queues
                        igc0: Using MSI-X interrupts with 2 vectors
                        igc0: Ethernet address: 7c:2b:e1:13:d7:66
                        igc0: netmap queues/slots: TX 1/1024, RX 1/1024
                        Checked the interrupt rate and it was 20000, so no dice there

                        1 Reply Last reply Reply Quote 0
                        • stephenw10S
                          stephenw10 Netgate Administrator
                          last edited by

                          Hmm, well seems to be configured correctly. At least largely the same as the 4200 which works fine. 😕

                          I guess I'd try to test the WAN with something else.

                          R 1 Reply Last reply Reply Quote 0
                          • R
                            rmeskill @stephenw10
                            last edited by

                            @stephenw10 Yeah, I'm out of ideas as well. I can definitely 'feel' the crap performance when my packet loss is upwards of 50%, even if it feels 'better' down around 20%. I might even just try turning off my provider router for an hour or something just to see if that'll do anything-I'm absolutely beyond guessing at this point... Second option is just to go with a 5G router from another provider, but not sure what sort of throughput I can get from that.

                            Actually, unrelated, but as long as I've got your attention: if I have a USB wireless adapter would it be possible to use that to connect to my neighbor's wireless and set that as a secondary gateway in the event mine has packet loss? I plugged it in and it clearly comes up in boot, but I can't see it anywhere in the GUI...
                            Root mount waiting for: usbus0
                            ugen0.2: <Realtek 802.11ac NIC> at usbus0

                            R 1 Reply Last reply Reply Quote 0
                            • R
                              rmeskill @rmeskill
                              last edited by

                              In a hail mary tried to shut off the provider router overnight just to see if resting it somehow (heat/saved setting/etc) would fix it, but on turning it back on I'm seeing 50% loss again. Called the provider again and they confirm they're seeing no issues, but they're going to send a NID/in-line test for 24h to see if they can spot anything.

                              1 Reply Last reply Reply Quote 0
                              • stephenw10S
                                stephenw10 Netgate Administrator
                                last edited by

                                It is possible but the support for WiFi devices in FreeBSD/pfSense is very limited. That string you see there is just send to the USB subsystem by the device it doesn't imply any driver is attached.

                                You can query the device IDs using: usbconfig dump_device_desc

                                There might be a driver you cab load. It will only connect at .11n speeds though. That might be enough.

                                R 1 Reply Last reply Reply Quote 0
                                • R
                                  rmeskill @stephenw10
                                  last edited by

                                  @stephenw10 Right, well I'll see what I can find. Easiest, though, might just be to connect the secondary router (which does have integrated wireless bridging like this) to another interface on the PFSense and use it as a proper (if multi-hop) backup.

                                  R 1 Reply Last reply Reply Quote 1
                                  • R
                                    rmeskill @rmeskill
                                    last edited by

                                    Ok, so I've successfully setup the secondary router, connected via wireless to my neighbor's flat, to another port on PFSense. Haven't implemented it yet, but plan is to now setup a routing group targeting my (DIA) cabled connection directly to PFSense first, then via the secondary (GLiNet) router with failover based on loss.

                                    I'm going to try and monitor this a bit, and can now also test running a cable connection via the secondary router directly to my DIA (as it'll do multi-WAN as well) to confirm if it's an issue with my PFSense box or my internet connection once and for all

                                    ae41ce2b-ad30-4769-a67a-42e0b45ee883-image.png
                                    7b15d002-1281-458b-93c3-d4b258c77901-image.png

                                    R 1 Reply Last reply Reply Quote 0
                                    • R
                                      rmeskill @rmeskill
                                      last edited by

                                      Indeed, managed to get that working, and, I guess predictably, loss drops off when almost nothing is running across the (old, wired) WAN link:
                                      f00ed048-60ea-4708-8ea2-8e1d5f59af5e-image.png

                                      R 1 Reply Last reply Reply Quote 0
                                      • R
                                        rmeskill @rmeskill
                                        last edited by

                                        And actually, I take that back...
                                        3394900d-e290-46c0-881a-bb012326b328-image.png
                                        No reason to be seeing loss on that link (admittedly, it is connected via my neighbor next door, but I'm on it via the secondary router on a loss-free VoIP call) so I'm beginning to think it is, again, just an issue with this Topton box somehow...

                                        1 Reply Last reply Reply Quote 0
                                        • stephenw10S
                                          stephenw10 Netgate Administrator
                                          last edited by

                                          If it is a local issue you would still loss when it's monitoring just the local gateway IP rather than 8.8.4.4.

                                          R 1 Reply Last reply Reply Quote 0
                                          • T
                                            Tzvia
                                            last edited by

                                            I had a similar issue when going from an older Qotom box to my current 'Hunsn' CWWK box with 6 i226v Intel network ports. I had set it up and just expected it to work correctly once I reassigned the network ports, but I had several 'packet loss' issues a day at a minimum. Working from home using ZOOM phone, calls would drop, Citrix would drop... ugg. I set the WAN on a different port, issue followed the WAN. Never had this issue on any other port (vlans on them) just the one connected to the Spectrum cable modem. I have a limiter setup and decided to tweak it being I am on a new device with different NICs, so read up on the 'latest recommendations' but they didn't work. Can't set PASS, have to set it to 'MATCH' for it to work at all, and someone also mentioned not to set ECN ON so I set both accordingly. Then I noticed, on the main STATUS page, NTP status, that time was off and it wasn't synching (sorry can't remember the exact verbiage) so I went to NTP to look. I had left the 4 PFSense pools and had added 4 NTP.ORG pools. I started thinking... they are pools, maybe I shouldn't put too many here, and left just one NTP.ORG pool and 3 PFSense pools, with only 2 pools checked. Now the NTP widget on the status page shows I am connected to a pool server and the time is synched. I did also zip tie a fan on the side of my janky Spectrum cable modem as it gets quite warm. Bingo NO MORE DROPS. I don't get it.

                                            Some have said on other forums that the power supply that comes with these are sketchy so I had bought another online and had used it from the beginning, but switched it back at some point (don't remember if it was before or after my issues resolved). The one mine came with is 'Dajing' branded and is the same one that my old Qotom came with and it worked fine on that router.

                                            Hope some bit of my ramblings help you out but I couldn't go a day without at least 3 periods of drops, that could last from 10 seconds to a minute or more. Maybe I should have 'started new' with settings instead of importing and fixing, but I really didn't want to start over...

                                            Tzvia

                                            Current build:
                                            Hunsn/CWWK Pentium Gold 8505, 6x i226v 'micro firewall'
                                            16 gigs ram
                                            500gig WD Blue nvme
                                            Using modded BIOS (enabled CSTATES)
                                            PFSense 2.72-RELEASE
                                            Enabled Intel SpeedShift
                                            Snort
                                            PFBlockerNG
                                            LAN and 5 VLANS

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.