• Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login
Netgate Discussion Forum
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Search
  • Register
  • Login

Gateway WAN keeps on having packet lost

General pfSense Questions
6
55
7.1k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S
    stephenw10 Netgate Administrator
    last edited by Dec 30, 2021, 4:44 PM

    Is the client you are pinging from in the same 192.168.1.X subnet?

    If so pings between there and the NAS don't go through pfSense at all but are still spiking up to 26ms at the same time. That implies some issue locally. Or maybe something flooding the switch?
    A loop at the switch that it blocks and then re-enables could behave like this. The lagg incorrectly configured could do that. Try disconnecting one of the links.

    Steve

    C 1 Reply Last reply Dec 30, 2021, 4:54 PM Reply Quote 0
    • C
      cheapie408 @stephenw10
      last edited by Dec 30, 2021, 4:54 PM

      @stephanw10

      I run everything on a 24 ports POE switch that also powers 16 IP cameras and the PC that I'm doing all this on is also the one Blue Iris resides.

      I just factory reset the Pfsense box. As I'm typing this, with no additional configuration. Took the switch out of the equation, basically this PC directly into the Pfsense box and nothing else attached to the pfsense box. This is what I'm getting while it's timing out and while it's responding

      alt text

      alt text

      1 Reply Last reply Reply Quote 0
      • S
        stephenw10 Netgate Administrator
        last edited by Dec 30, 2021, 5:11 PM

        What hardware are you running pfSense on?

        C 1 Reply Last reply Dec 30, 2021, 5:18 PM Reply Quote 0
        • C
          cheapie408 @stephenw10
          last edited by Dec 30, 2021, 5:18 PM

          @stephenw10 said in Gateway WAN keeps on having packet lost:

          What hardware are you running pfSense on?

          T620 (i think)
          AMD GX-420CA SOC with Radeon(tm) HD Graphics
          16Gb ram

          Intel 4x NIC card don't remember the exact model

          1 Reply Last reply Reply Quote 0
          • S
            stephenw10 Netgate Administrator
            last edited by Dec 30, 2021, 5:23 PM

            Hmm, and you're running 2.5.2?

            And this just started happening? It was running without loss previously?

            C 1 Reply Last reply Dec 30, 2021, 5:27 PM Reply Quote 0
            • C
              cheapie408 @stephenw10
              last edited by Dec 30, 2021, 5:27 PM

              @stephenw10 said in Gateway WAN keeps on having packet lost:

              Hmm, and you're running 2.5.2?

              And this just started happening? It was running without loss previously?

              Yes on 2.5.2. I can't say how long has this been going on. In fact for many months I don't log onto PFsense. I recently tried to add a MyQ garage opener and realised that it won't report online (I have another thread here on this subject). which triggered me to look at PFSense.

              Now that I'm thinking of it, my kids play Roblox and they are constantly kicked out of their games. It's been quite some time now.

              1 Reply Last reply Reply Quote 0
              • S
                stephenw10 Netgate Administrator
                last edited by Dec 30, 2021, 5:46 PM

                Ok can we assume that without any other traffic using the connection the WAN gateways show as up and without packet loss?

                If you connect via ssh and run top -aSH and the command line do you see any CPU cores being run at 100%? (idle processes at or close to 0%).

                That CPU appears to be well capable of 1Gbps but it's single thread performance is not fantastic and em NICs run with 1 queue. I still wouldn't expect anything like this though.

                C 1 Reply Last reply Dec 30, 2021, 6:28 PM Reply Quote 0
                • C
                  cheapie408 @stephenw10
                  last edited by Dec 30, 2021, 6:28 PM

                  @stephenw10
                  The WAN interface never really get reported as being down which is very weird, but on the gateway status, it does show off. It also seems that IPv4 shows to be off more often than IPV6.

                  With nothing else in the mix, I still experience packet loss.

                  top -aSH didn't pull up any CPU info.

                  Diagnostics/System activity it does show that the WCPU cores are all running at or close too 100%
                  11 root 155 ki31 0B 64K CPU0 0 87:04 98.88% [idle{idle: cpu0}]
                  11 root 155 ki31 0B 64K RUN 3 87:12 98.68% [idle{idle: cpu3}]
                  11 root 155 ki31 0B 64K CPU1 1 87:31 95.36% [idle{idle: cpu1}]
                  11 root 155 ki31 0B 64K CPU2 2 87:14 93.07% [idle{idle: cpu2}]

                  1 Reply Last reply Reply Quote 0
                  • S
                    stephenw10 Netgate Administrator
                    last edited by Dec 30, 2021, 7:19 PM

                    top -aSH at the command line looks like:

                    last pid: 89077;  load averages:  0.35,  0.55,  0.54                                            up 0+01:13:06  19:15:26
                    179 threads:   3 running, 150 sleeping, 26 waiting
                    CPU:  0.8% user,  0.0% nice,  1.4% system,  0.0% interrupt, 97.9% idle
                    Mem: 55M Active, 976M Inact, 368M Wired, 195M Buf, 521M Free
                    Swap: 1894M Total, 1894M Free
                    
                      PID USERNAME    PRI NICE   SIZE    RES STATE    C   TIME    WCPU COMMAND
                       11 root        155 ki31     0B    32K RUN      0  71:51  99.09% [idle{idle: cpu0}]
                       11 root        155 ki31     0B    32K CPU1     1  71:32  96.63% [idle{idle: cpu1}]
                     1720 root         52    0   132M    55M accept   1   0:05   0.16% php-fpm: pool nginx (php-fpm)
                        0 root        -76    -     0B   464K -        1   0:05   0.09% [kernel{if_config_tqg_0}]
                    84138 root         20    0    13M  4028K CPU0     0   0:00   0.07% top -aSH
                       12 root        -60    -     0B   416K WAIT     0   0:01   0.06% [intr{swi4: clock (0)}]
                    80075 root         20    0    28M  9636K kqread   0   0:01   0.02% nginx: worker process (nginx)
                       18 root        -16    -     0B    16K pftm     1   0:01   0.02% [pf purge]
                        0 root        -76    -     0B   464K -        0   0:00   0.01% [kernel{if_io_tqg_0}]
                    41729 root         20    0    11M  2840K select   1   0:00   0.01% /usr/sbin/syslogd -s -c -c -l /var/dhcpd/var/run/log
                    

                    You can see both CPU cores are mostly idle there. If anything was using a lot of CPU it would show above that.

                    C 1 Reply Last reply Dec 30, 2021, 8:13 PM Reply Quote 0
                    • C
                      cheapie408 @stephenw10
                      last edited by cheapie408 Dec 30, 2021, 8:15 PM Dec 30, 2021, 8:13 PM

                      @stephenw10

                      NVM it's cash sensitive it seems. Here's my result

                      alt text

                      1 Reply Last reply Reply Quote 0
                      • S
                        stephenw10 Netgate Administrator
                        last edited by Dec 30, 2021, 9:29 PM

                        Hmm, that was whilst you are passing traffic? Like running a speedtest?

                        I expect to see far more CPU usage than that. Nothing there looks like an issue though.

                        C 1 Reply Last reply Dec 30, 2021, 9:36 PM Reply Quote 0
                        • C
                          cheapie408 @stephenw10
                          last edited by Dec 30, 2021, 9:36 PM

                          @stephenw10

                          sorry that was it idling here's one while I'm running a speed test. Noticed how I drop to around 500Mbps now

                          alt text

                          1 Reply Last reply Reply Quote 0
                          • S
                            stephenw10 Netgate Administrator
                            last edited by Dec 30, 2021, 9:39 PM

                            Mmm, nothing unusual there either. No CPU core maxed out.

                            I guess I would be running a packet capture on the WAN at this point to see what's actually happening. Are there a load of retransmissions or packet fragments etc.

                            Steve

                            C 1 Reply Last reply Dec 30, 2021, 9:41 PM Reply Quote 0
                            • C
                              cheapie408 @stephenw10
                              last edited by cheapie408 Dec 30, 2021, 9:43 PM Dec 30, 2021, 9:41 PM

                              @stephenw10
                              i'll try to do that...how long should I be running the packet capture for?

                              here's a screen capture of the entire speedtest process.

                              https://vimeo.com/manage/videos/661294436/f74c230e65

                              1 Reply Last reply Reply Quote 0
                              • S
                                stephenw10 Netgate Administrator
                                last edited by Dec 30, 2021, 10:10 PM

                                I would start with 1000 packets showing the beginning of the test. If there's something broken there it should show pretty quickly.

                                Steve

                                C 1 Reply Last reply Dec 30, 2021, 10:25 PM Reply Quote 0
                                • C
                                  cheapie408 @stephenw10
                                  last edited by Dec 30, 2021, 10:25 PM

                                  @stephenw10 Here's what it captured. Not sure how to decode this information.

                                  packet capture.txt

                                  1 Reply Last reply Reply Quote 0
                                  • S
                                    stephenw10 Netgate Administrator
                                    last edited by Dec 30, 2021, 10:50 PM

                                    You need to look at the actual pcap file in Wireshark to see anything useful really.

                                    C 1 Reply Last reply Dec 30, 2021, 11:08 PM Reply Quote 0
                                    • C
                                      cheapie408 @stephenw10
                                      last edited by Dec 30, 2021, 11:08 PM

                                      @stephenw10 attached is the cap file. I can see that there are icmp errors and some extremely long response time in the traffic but is not smart enough to analyze it to identify the exact issue. :(

                                      https://drive.google.com/file/d/1l-6VkFO8zfGs8sUBnX7Spltxgp10trQo/view?usp=sharing

                                      1 Reply Last reply Reply Quote 0
                                      • S
                                        stephenw10 Netgate Administrator
                                        last edited by Dec 31, 2021, 12:30 AM

                                        Hmm, well the WAN was quite busy at that time. The 1000 packets covered only 2.3s.
                                        Load of random port UDP traffic in there. Clients behind torrenting maybe?

                                        The WAN sent 5 pings in that time to what I assume is the gateway IP and no replies came back. Did you set the monitoring back to the gateway IP?

                                        Did you enable promiscuous mode when doing that? That's usually a good idea as some things can be hidden otherwise.

                                        Overall apart from the lack of ping responses it doesn't look too terrible.

                                        Are you able to retry that with the WAN in promisc mode and preferably without the LAN side client spewing UDP traffic?

                                        Steve

                                        C ? 2 Replies Last reply Dec 31, 2021, 12:32 AM Reply Quote 0
                                        • C
                                          cheapie408 @stephenw10
                                          last edited by Dec 31, 2021, 12:32 AM

                                          I'll have to try again in the morning when wife and kids don't need the internet. When they're up every one is online

                                          1 Reply Last reply Reply Quote 0
                                          24 out of 55
                                          • First post
                                            24/55
                                            Last post
                                          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.