Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    2.3.4->2.4.4 Upgrade 100% Packet Loss on WAN Interface

    Scheduled Pinned Locked Moved Problems Installing or Upgrading pfSense Software
    18 Posts 4 Posters 4.4k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • R
      rjarratt
      last edited by

      I did check for all these things before and they all seemed fine as I recall. I won't get another chance to check again until next weekend, I will report back then.

      Thanks

      Rob

      1 Reply Last reply Reply Quote 0
      • R
        rjarratt
        last edited by

        I checked the DHCP logs and the gateway IP is in the same subnet as the leased IP address.

        I do see some IPv6 errors, but I am assuming they are benign as my ISP is IPv4.

        I looked in the system logs. I am getting some errors relating to the time of day (there is an oddity with FreeBSD not seeming to get the time from Hyper-V correctly). The errors are like this one:

        rc.bootup: The command '/usr/bin/nice -n20 /usr/local/bin/rrdtool update /var/db/rrd/ipsec-packets.rrd N:U:U:U:U:U:U:U:U' returned exit code '1', the output was 'ERROR: /var/db/rrd/ipsec-packets.rrd: illegal attempt to update using time 1541839764 when last update time is 1571980920 (minimum one second step)'

        In the Gateways part of the system log I see lots of errors from dpinger, I assume they are symptoms rather than cause though. Here they are:

        send_interval 500ms loss_interval 2000ms time_period 60000ms report_interval 0ms data_len 0 alert_interval 1000ms latency_alarm 500ms loss_alarm 20% dest_addr a.b.c.1 bind_addr a.b.c.86 identifier "WAN_DHCP "

        WAN_DHCP a.b.c.1: Alarm latency 0us stddev 0us loss 100%

        johnpozJ 1 Reply Last reply Reply Quote 0
        • stephenw10S
          stephenw10 Netgate Administrator
          last edited by

          When DHCP works and nothing else does that's usually a bad firewall rule. Thouhg here that would be somewhere upstream unless you have a blocking floating OUT rule.

          Try running a packet capture on WAN. Do you see the ping packets leaving? Do you see any packets coming back?

          Steve

          1 Reply Last reply Reply Quote 1
          • R
            rjarratt
            last edited by

            I tried a packet capture before, and I tried it again today. I don't see any packets originating from computers on the LAN going out on the WAN. I do see some DNS lookups that appear to be coming from pfSense itself. I have looked at the firewall rules and there doesn't really seem to be anything that would block traffic. Looking at the firewall logs I see traffic being blocked, but it all appears to be IPv6.

            1 Reply Last reply Reply Quote 0
            • stephenw10S
              stephenw10 Netgate Administrator
              last edited by

              Are those DNS lookups actually working? Does Diag > DNS Lookup work?

              Can you packet capture the DHCP exchange?

              What is the MAC of the gateway? Maybe it's something odd. Though that should affect 2.3.4 just the same.

              Steve

              1 Reply Last reply Reply Quote 0
              • johnpozJ
                johnpoz LAYER 8 Global Moderator @rjarratt
                last edited by

                @rjarratt said in 2.3.4->2.4.4 Upgrade 100% Packet Loss on WAN Interface:

                gateway IP is in the same subnet as the leased IP address.

                And this is public IP or private IP?? This is a VM right... Are we sure interfaces are not moving about and changing order on update? If your pfsense can not talk to your gateway your going to have a problem.

                Your gateway and public IP should be the same when on 2.3.4 as it is when you upgrade... I do not see your mac changing on your vm... So if you can ping your gateway when your on 2.3.4 and not when on 2.4.4 something really odd is going on..

                An intelligent man is sometimes forced to be drunk to spend time with his fools
                If you get confused: Listen to the Music Play
                Please don't Chat/PM me for help, unless mod related
                SG-4860 24.11 | Lab VMs 2.8, 24.11

                1 Reply Last reply Reply Quote 1
                • R
                  rjarratt
                  last edited by

                  DNS lookup from the configurator does not work. I have just noticed a small difference between the old version and the new version in ifconfig

                  Here is the output on the old version (edited because my post is being marked as spam):
                  hn1:
                  ether 00:c0:df:10:58:09
                  status: active

                  And here is the output on the new version:
                  hn1: ether 00:c0:df:10:58:09
                  hwaddr 00:15:5d:00:1f:06
                  status: active

                  There is a new line for "hwaddr". I am not sure what the difference is, but the hwaddr value is the default MAC address I have set in Hyper-V, but I have also set the option to allow the Guest OS to change the MAC address. The MAC address reported by the Configurator is always the "ether" one.

                  Note that I can ping the gateway (192.168.0.1) in 2.4.4 from the LAN. The IP address I get in the ifconfig outputs above is the same in both cases.

                  I also noticed this error, is it relevant?
                  arpresolve: can't allocate llinfo for 82.28.4.1 on hn1

                  1 Reply Last reply Reply Quote 0
                  • stephenw10S
                    stephenw10 Netgate Administrator
                    last edited by

                    The arpresolve errors on WAN imply it is trying to ARP for that IP and failing. I assume 82.28.4.1 is the WAN gateway IP?

                    That does look like what you see if you hit the DHCP issue I mentioned in my first reply.
                    https://redmine.pfsense.org/issues/8507

                    I expect to see a bad MTU if you are hitting that but it's worth adding the workaround line to your DHCP options anyway:

                    In the "Lease Requirements and Requests" section for WAN DHCP in the field "Option modifiers" add the text without quotes: "supersede interface-mtu 0"
                    

                    Or trying a 2.4.5 snapshot which has that fixed already.

                    Steve

                    1 Reply Last reply Reply Quote 0
                    • R
                      rjarratt
                      last edited by

                      I had a closer look at the issue and tried it out with no luck. Before I started I did this:

                      : netstat -4rnW
                      Routing tables
                      
                      Internet:
                      Destination        Gateway            Flags       Use    Mtu      Netif Expire
                      default            82.28.4.1          UGS         234   1500        hn1
                      82.28.4.0/22       link#6             U           186   1500        hn1
                      82.28.4.86         link#6             UHS           0  16384        lo0
                      127.0.0.1          link#2             UH           31  16384        lo0
                      192.168.0.0/24     link#5             U           124   1500        hn0
                      192.168.0.1        link#5             UHS           0  16384        lo0
                      

                      I then set the advanced option anyway, released and renewed the lease and I even rebooted after changing the option, without success. The netstat looked the same as above after setting the option and rebooting etc.

                      1 Reply Last reply Reply Quote 0
                      • BabizB
                        Babiz
                        last edited by

                        I look same issue on my APU2 pfSense upgraded to lastest release and after restoring configuration/reboot , dpinger fail trough PPPoE and monitor IP 8.8.4.4.

                        So I guess some kind of messing up related pfSense internals stuff.

                        My solution is simply: Delete all gateways / reboot /re assign interfaces from cli/ssh /reboot.

                        Yeah pfSense concept is beauty but it's not fully perfect we know. 🐨

                        1 Reply Last reply Reply Quote 0
                        • stephenw10S
                          stephenw10 Netgate Administrator
                          last edited by

                          If you were hitting that DHCP issue you would not see a change in the routing table only the interface MTU.

                          The ARP resolve errors imply the gateway is not responding to ARP. Try running a packet capture on WAN to be sure it is actually sending the ARP requests and that the gateway is really not responding.

                          Steve

                          1 Reply Last reply Reply Quote 1
                          • First post
                            Last post
                          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.