Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Gateway monitor down

    Scheduled Pinned Locked Moved General pfSense Questions
    83 Posts 5 Posters 16.8k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • K
      kevindd992002 @stephenw10
      last edited by

      @stephenw10 said in Gateway monitor down:

      Looks like the graph didn't upload.

      Sorry. I edited my post above to fix this. Here's another tracert result that also shows the problem:

      alt text

      1 Reply Last reply Reply Quote 0
      • K
        kevindd992002
        last edited by kevindd992002

        Here's the latency problem that is evident even when I have my router ping the WAN interface IP (first hop from my router):

        PING 112.205.32.1 (112.205.32.1) from {my router's WAN interface IP}: 56 data bytes
        64 bytes from 112.205.32.1: icmp_seq=0 ttl=255 time=1242.815 ms
        64 bytes from 112.205.32.1: icmp_seq=1 ttl=255 time=1310.078 ms
        64 bytes from 112.205.32.1: icmp_seq=2 ttl=255 time=1457.912 ms
        64 bytes from 112.205.32.1: icmp_seq=3 ttl=255 time=473.654 ms
        64 bytes from 112.205.32.1: icmp_seq=4 ttl=255 time=2.773 ms
        64 bytes from 112.205.32.1: icmp_seq=5 ttl=255 time=2.146 ms
        64 bytes from 112.205.32.1: icmp_seq=6 ttl=255 time=1.822 ms
        64 bytes from 112.205.32.1: icmp_seq=7 ttl=255 time=4.379 ms
        64 bytes from 112.205.32.1: icmp_seq=8 ttl=255 time=455.918 ms
        64 bytes from 112.205.32.1: icmp_seq=9 ttl=255 time=424.541 ms
        
        --- 112.205.32.1 ping statistics ---
        10 packets transmitted, 10 packets received, 0.0% packet loss round-trip min/avg/max/stddev = 1.822/537.604/1457.912/557.557 ms
        

        That's a a ping from the same WAN subnet and should be just less than 1ms or maybe even 2/3ms.

        1 Reply Last reply Reply Quote 0
        • stephenw10S
          stephenw10 Netgate Administrator
          last edited by

          Mmm, that's catastrophically bad!
          If that's the first hop, and it's not just the gateway not responding to ping, there's not much that pfSense can do about it. I assume you were not saturating the link at that time?

          K 1 Reply Last reply Reply Quote 0
          • K
            kevindd992002 @stephenw10
            last edited by

            @stephenw10 said in Gateway monitor down:

            Mmm, that's catastrophically bad!
            If that's the first hop, and it's not just the gateway not responding to ping, there's not much that pfSense can do about it. I assume you were not saturating the link at that time?

            Exactly! Yes, the gateway/first hop does respond respond to ping but the latency is very unstable as you see in my last post. No saturation at all.

            That's also why I'm very convinced that it's an ISP issue. I just don't know how to dumb it down for them to understand. They all base their "knowledge" on the results of www.speedtest.net. When I do a test there, I do see a 2ms latency which I think is just one of those ping results that's normal. But what's more important is the average of continous ping latency results which is what pfsense does.

            1 Reply Last reply Reply Quote 0
            • stephenw10S
              stephenw10 Netgate Administrator
              last edited by

              The gateway itself does not have to respond to ping so results against it directly are not necessarily indicative of an issue.

              I would try running smokeping or MTR against a number of external targets and see how that varies.

              Steve

              K 1 Reply Last reply Reply Quote 0
              • K
                kevindd992002 @stephenw10
                last edited by

                @stephenw10 said in Gateway monitor down:

                The gateway itself does not have to respond to ping so results against it directly are not necessarily indicative of an issue.

                I would try running smokeping or MTR against a number of external targets and see how that varies.

                Steve

                Yeah it doesn't need to respond to ping but shouldn't that be a clear cut yes or no scenario? Since we're seeing that it responds to ping, doesn't that tell us that it is setup to respond to ping?

                I do run smokeping and it's also seeing the issue:

                1df1632f-237b-4a8f-9e28-afcf42428624-image.png

                1 Reply Last reply Reply Quote 0
                • stephenw10S
                  stephenw10 Netgate Administrator
                  last edited by

                  Ouch, yeah that's pretty bad!

                  The gateway might respond to ping but it may not be prioritised at all. When the gateway is loaded it might drop ping packets or respond with high latency and that's acceptable if traffic through it is passed as expected. That's not happening here though.

                  K 1 Reply Last reply Reply Quote 0
                  • K
                    kevindd992002 @stephenw10
                    last edited by

                    @stephenw10 said in Gateway monitor down:

                    Ouch, yeah that's pretty bad!

                    Right? And now I'm getting intermittent packet losses with my pfsense gateway monitor. I disable gateway monitoring action for now in pfsense but because there are packet losses I can definitely "feel" how sluggish my Internet browsing is. I probably need to ask my neighbors if they experience the same.

                    The gateway might respond to ping but it may not be prioritised at all. When the gateway is loaded it might drop ping packets or respond with high latency and that's acceptable if traffic through it is passed as expected. That's not happening here though.

                    Ahh, that makes sense.

                    1 Reply Last reply Reply Quote 0
                    • K
                      kevindd992002
                      last edited by

                      @stephenw10

                      I did a bit more troubleshooting and got interesting results:

                      1. Tests from my wired desktop client directly connected to the ISP ONU:
                      C:\Users\Kevin>ping 112.204.224.1 -t
                      
                      Pinging 112.204.224.1 with 32 bytes of data:
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=1ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=13ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=17ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      Reply from 112.204.224.1: bytes=32 time=3ms TTL=255
                      
                      Ping statistics for 112.204.224.1:
                          Packets: Sent = 14, Received = 14, Lost = 0 (0% loss),
                      Approximate round trip times in milli-seconds:
                          Minimum = 1ms, Maximum = 17ms, Average = 4ms
                      Control-C
                      ^C
                      C:\Users\Kevin>tracert 112.204.224.1
                      
                      Tracing route to 112.204.224.1.pldt.net [112.204.224.1]
                      over a maximum of 30 hops:
                      
                        1     2 ms     3 ms     2 ms  112.204.224.1.pldt.net [112.204.224.1]
                      
                      Trace complete.
                      
                      C:\Users\Kevin>ping 8.8.8.8 -t
                      
                      Pinging 8.8.8.8 with 32 bytes of data:
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=28ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=49ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=28ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=28ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=28ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=28ms TTL=58
                      Reply from 8.8.8.8: bytes=32 time=27ms TTL=58
                      
                      Ping statistics for 8.8.8.8:
                          Packets: Sent = 31, Received = 31, Lost = 0 (0% loss),
                      Approximate round trip times in milli-seconds:
                          Minimum = 27ms, Maximum = 49ms, Average = 27ms
                      Control-C
                      ^C
                      C:\Users\Kevin>tracert 8.8.8.8
                      
                      Tracing route to dns.google [8.8.8.8]
                      over a maximum of 30 hops:
                      
                        1     3 ms     3 ms     3 ms  112.204.224.1.pldt.net [112.204.224.1]
                        2     4 ms     3 ms     3 ms  122.2.187.142.static.pldt.net [122.2.187.142]
                        3     *        *        *     Request timed out.
                        4    27 ms    27 ms    28 ms  210.213.130.103.static.pldt.net [210.213.130.103]
                        5    29 ms    28 ms    29 ms  74.125.118.24
                        6    28 ms    28 ms    28 ms  209.85.244.25
                        7    24 ms    23 ms    23 ms  216.239.42.89
                        8    27 ms    27 ms    27 ms  dns.google [8.8.8.8]
                      
                      Trace complete.
                      
                      1. Tests from the same wired desktop client connected through my ASUS RT-AC66U (acting as a switch), the same switch connected to my pfsense box, and the ISP ONU connected to the same pfsense box:
                      C:\Users\Kevin>ping 112.205.32.1 -t
                      
                      Pinging 112.205.32.1 with 32 bytes of data:
                      Reply from 112.205.32.1: bytes=32 time=2ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=1ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=6ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=2180ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=1571ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=52ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=448ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=1ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=3ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=2ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=740ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=523ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=1275ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=1318ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=17ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=88ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=4ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=3ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=3ms TTL=254
                      Reply from 112.205.32.1: bytes=32 time=523ms TTL=254
                      
                      Ping statistics for 112.205.32.1:
                          Packets: Sent = 20, Received = 20, Lost = 0 (0% loss),
                      Approximate round trip times in milli-seconds:
                          Minimum = 1ms, Maximum = 2180ms, Average = 438ms
                      Control-C
                      ^C
                      C:\Users\Kevin>tracert 112.205.32.1
                      
                      Tracing route to 112.205.32.1.pldt.net [112.205.32.1]
                      over a maximum of 30 hops:
                      
                        1    <1 ms    <1 ms    <1 ms  pfSense.condo.arpa [192.168.20.1]
                        2  1955 ms   235 ms    94 ms  112.205.32.1.pldt.net [112.205.32.1]
                      
                      Trace complete.
                      
                      C:\Users\Kevin>tracert 8.8.8.8
                      
                      Tracing route to dns.google [8.8.8.8]
                      over a maximum of 30 hops:
                      
                        1    <1 ms    <1 ms     1 ms  pfSense.condo.arpa [192.168.20.1]
                        2  1193 ms  1072 ms  1043 ms  112.205.32.1.pldt.net [112.205.32.1]
                        3    11 ms     4 ms     3 ms  122.2.187.146.static.pldt.net [122.2.187.146]
                        4     *        *        *     Request timed out.
                        5   615 ms  1429 ms   122 ms  210.213.130.103.static.pldt.net [210.213.130.103]
                        6  1646 ms   847 ms   769 ms  72.14.195.168
                        7    24 ms    23 ms    21 ms  108.170.231.19
                        8   245 ms   917 ms    46 ms  209.85.143.37
                        9  1570 ms  1371 ms    70 ms  dns.google [8.8.8.8]
                      
                      Trace complete.
                      
                      1. Tests from the pfsense diagnostic tools when ISP ONU is connected to it. I've made sure that traceroute using ping (instead of the default UDP) so that it mirrors exactly what tracert in Windows does.
                      PING 112.205.32.1 (112.205.32.1): 56 data bytes
                      64 bytes from 112.205.32.1: icmp_seq=0 ttl=255 time=789.747 ms
                      64 bytes from 112.205.32.1: icmp_seq=1 ttl=255 time=3276.583 ms
                      64 bytes from 112.205.32.1: icmp_seq=2 ttl=255 time=2379.589 ms
                      64 bytes from 112.205.32.1: icmp_seq=3 ttl=255 time=1405.570 ms
                      64 bytes from 112.205.32.1: icmp_seq=4 ttl=255 time=407.121 ms
                      64 bytes from 112.205.32.1: icmp_seq=5 ttl=255 time=2.525 ms
                      64 bytes from 112.205.32.1: icmp_seq=6 ttl=255 time=3.033 ms
                      64 bytes from 112.205.32.1: icmp_seq=7 ttl=255 time=1.394 ms
                      64 bytes from 112.205.32.1: icmp_seq=8 ttl=255 time=314.137 ms
                      64 bytes from 112.205.32.1: icmp_seq=9 ttl=255 time=594.438 ms
                      
                      --- 112.205.32.1 ping statistics ---
                      10 packets transmitted, 10 packets received, 0.0% packet loss
                      round-trip min/avg/max/stddev = 1.394/917.414/3276.583/1058.258 ms
                      
                      1  112.205.32.1.pldt.net (112.205.32.1)  500.894 ms  66.996 ms  942.851 ms
                      
                      1  112.205.32.1.pldt.net (112.205.32.1)  1.776 ms  2.692 ms  3.052 ms
                       2  122.2.187.146.static.pldt.net (122.2.187.146)  2.664 ms  1.913 ms  3.742 ms
                       3  * * *
                       4  210.213.130.103.static.pldt.net (210.213.130.103)  534.225 ms  804.088 ms  139.527 ms
                       5  72.14.195.168 (72.14.195.168)  24.507 ms  22.051 ms  23.182 ms
                       6  108.170.231.19 (108.170.231.19)  21.379 ms  21.898 ms  21.930 ms
                       7  209.85.143.37 (209.85.143.37)  29.757 ms  46.183 ms  52.547 ms
                       8  dns.google (8.8.8.8)  53.694 ms  25.930 ms  25.661 ms
                      

                      With these tests, I think we can safely say that there's something going on with pfsense that's causing the problem. The ISP is not at fault since everything is working properly when I connect the ONU directly to the desktop client, bypassing pfsense.

                      We also know that the problem isn't the AC66U switch (or any other wired/wireless client connected to it) because the problem also exists using the pfsense diagnostic tools.

                      With this troubleshooting development, do you have any ideas where to start looking into?

                      ? 1 Reply Last reply Reply Quote 0
                      • ?
                        A Former User @kevindd992002
                        last edited by

                        @kevindd992002, you can post a screenshot of your dasboard.

                        K 1 Reply Last reply Reply Quote 0
                        • stephenw10S
                          stephenw10 Netgate Administrator
                          last edited by

                          Mmm, there are only two things I'm aware of that can produce behaviour like that in pfSense.

                          1. Active traffic shaping.

                          2. A bug in 21.05 that affected the SG-3100.

                          K 2 Replies Last reply Reply Quote 0
                          • K
                            kevindd992002 @A Former User
                            last edited by kevindd992002

                            @silence said in Gateway monitor down:

                            @kevindd992002, you can post a screenshot of your dasboard.

                            Yes. Here you go:

                            04c0019d-15e1-4cd7-bfc2-f6a881f37c91-image.png

                            5cb1349d-3fe7-41a5-8afe-05ee4a017013-image.png

                            3b52caf8-64c7-48bc-bc9b-5e91070ee6d6-image.png

                            22e52b4d-0969-46c9-8d90-ee83a8bf9711-image.png

                            4dcfd61a-79e6-42eb-8415-af5c429cb5d0-image.png

                            5db2bea1-ce4f-479c-a0f6-dfe23ac4435e-image.png

                            cfb2ebc9-a090-4dc8-9138-e8609267b35f-image.png

                            H 1 Reply Last reply Reply Quote 0
                            • K
                              kevindd992002 @stephenw10
                              last edited by

                              @stephenw10 said in Gateway monitor down:

                              Mmm, there are only two things I'm aware of that can produce behaviour like that in pfSense.

                              1. Active traffic shaping.

                              2. A bug in 21.05 that affected the SG-3100.

                              I don't have traffic shaper enabled and I also don't have an SG-3100. Additionally, my pfsense box is not anywhere near "heavy load".

                              stephenw10S 1 Reply Last reply Reply Quote 0
                              • stephenw10S
                                stephenw10 Netgate Administrator @kevindd992002
                                last edited by

                                @kevindd992002 said in Gateway monitor down:

                                Additionally, my pfsense box is not anywhere near "heavy load".

                                It showing 50% loaded there without passing any significant traffic. What do the monitoring graphs show for processor loading vs latency?

                                K 1 Reply Last reply Reply Quote 0
                                • K
                                  kevindd992002 @stephenw10
                                  last edited by

                                  @stephenw10 said in Gateway monitor down:

                                  @kevindd992002 said in Gateway monitor down:

                                  Additionally, my pfsense box is not anywhere near "heavy load".

                                  It showing 50% loaded there without passing any significant traffic. What do the monitoring graphs show for processor loading vs latency?

                                  You're right, I didn't notice that in the pictures above. When I checked again just now, it's hovering between 8~11% CPU usage. This is what I see for that graph:

                                  1e8e0638-9e58-40e0-9ec8-b98a842550d4-image.png

                                  1 Reply Last reply Reply Quote 0
                                  • K
                                    kevindd992002
                                    last edited by

                                    Here's a better view:

                                    91452513-66f4-4ea5-b989-202309230b43-image.png

                                    1 Reply Last reply Reply Quote 0
                                    • stephenw10S
                                      stephenw10 Netgate Administrator
                                      last edited by

                                      Hmm, well definitely more use when the latency goes up but that could be cause or effect.

                                      This is the same system you used with the previous ISP where you didn't have an issue?

                                      Same Coreboot version? I see you have updated it several times and there have been version that adversely effected throughput.

                                      Steve

                                      K 1 Reply Last reply Reply Quote 0
                                      • K
                                        kevindd992002 @stephenw10
                                        last edited by

                                        @stephenw10 said in Gateway monitor down:

                                        Hmm, well definitely more use when the latency goes up but that could be cause or effect.

                                        This is the same system you used with the previous ISP where you didn't have an issue?

                                        Same Coreboot version? I see you have updated it several times and there have been version that adversely effected throughput.

                                        Steve

                                        Right. Yes, the same system that I used with my previous ISP that didn't have an issue. And yes, same Coreboot version. In fact, I have another pfsense box (w/ the same Coreboot version) that's still on the previous ISP and no issue.

                                        ping quality to 8.8.4.4 from "other" pfsense box:

                                        d811bdb2-8a8f-4ff1-b892-674fe7d9eb80-image.png

                                        A few hours ago from the affected pfsense box, the ping quality to 8.8.8.8 started coming back down to lower values but still a bit on the high side:

                                        5209bda8-dc8e-4b7c-b8c0-3a274639dcd4-image.png

                                        I can try updating to Coreboot to 14.5.0.1 and see if that helps.

                                        1 Reply Last reply Reply Quote 0
                                        • stephenw10S
                                          stephenw10 Netgate Administrator
                                          last edited by

                                          You could try. I have never used an APU2 so I have no reference there.

                                          1 Reply Last reply Reply Quote 0
                                          • H
                                            hda @kevindd992002
                                            last edited by

                                            @kevindd992002 why are you running the pcscd service ?
                                            It may not be helping ...

                                            K 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.