Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    2.3.4->2.4.4 Upgrade 100% Packet Loss on WAN Interface

    Scheduled Pinned Locked Moved Problems Installing or Upgrading pfSense Software
    18 Posts 4 Posters 4.2k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • R
      rjarratt
      last edited by

      I checked the DHCP logs and the gateway IP is in the same subnet as the leased IP address.

      I do see some IPv6 errors, but I am assuming they are benign as my ISP is IPv4.

      I looked in the system logs. I am getting some errors relating to the time of day (there is an oddity with FreeBSD not seeming to get the time from Hyper-V correctly). The errors are like this one:

      rc.bootup: The command '/usr/bin/nice -n20 /usr/local/bin/rrdtool update /var/db/rrd/ipsec-packets.rrd N:U:U:U:U:U:U:U:U' returned exit code '1', the output was 'ERROR: /var/db/rrd/ipsec-packets.rrd: illegal attempt to update using time 1541839764 when last update time is 1571980920 (minimum one second step)'

      In the Gateways part of the system log I see lots of errors from dpinger, I assume they are symptoms rather than cause though. Here they are:

      send_interval 500ms loss_interval 2000ms time_period 60000ms report_interval 0ms data_len 0 alert_interval 1000ms latency_alarm 500ms loss_alarm 20% dest_addr a.b.c.1 bind_addr a.b.c.86 identifier "WAN_DHCP "

      WAN_DHCP a.b.c.1: Alarm latency 0us stddev 0us loss 100%

      johnpozJ 1 Reply Last reply Reply Quote 0
      • stephenw10S
        stephenw10 Netgate Administrator
        last edited by

        When DHCP works and nothing else does that's usually a bad firewall rule. Thouhg here that would be somewhere upstream unless you have a blocking floating OUT rule.

        Try running a packet capture on WAN. Do you see the ping packets leaving? Do you see any packets coming back?

        Steve

        1 Reply Last reply Reply Quote 1
        • R
          rjarratt
          last edited by

          I tried a packet capture before, and I tried it again today. I don't see any packets originating from computers on the LAN going out on the WAN. I do see some DNS lookups that appear to be coming from pfSense itself. I have looked at the firewall rules and there doesn't really seem to be anything that would block traffic. Looking at the firewall logs I see traffic being blocked, but it all appears to be IPv6.

          1 Reply Last reply Reply Quote 0
          • stephenw10S
            stephenw10 Netgate Administrator
            last edited by

            Are those DNS lookups actually working? Does Diag > DNS Lookup work?

            Can you packet capture the DHCP exchange?

            What is the MAC of the gateway? Maybe it's something odd. Though that should affect 2.3.4 just the same.

            Steve

            1 Reply Last reply Reply Quote 0
            • johnpozJ
              johnpoz LAYER 8 Global Moderator @rjarratt
              last edited by

              @rjarratt said in 2.3.4->2.4.4 Upgrade 100% Packet Loss on WAN Interface:

              gateway IP is in the same subnet as the leased IP address.

              And this is public IP or private IP?? This is a VM right... Are we sure interfaces are not moving about and changing order on update? If your pfsense can not talk to your gateway your going to have a problem.

              Your gateway and public IP should be the same when on 2.3.4 as it is when you upgrade... I do not see your mac changing on your vm... So if you can ping your gateway when your on 2.3.4 and not when on 2.4.4 something really odd is going on..

              An intelligent man is sometimes forced to be drunk to spend time with his fools
              If you get confused: Listen to the Music Play
              Please don't Chat/PM me for help, unless mod related
              SG-4860 24.11 | Lab VMs 2.8, 24.11

              1 Reply Last reply Reply Quote 1
              • R
                rjarratt
                last edited by

                DNS lookup from the configurator does not work. I have just noticed a small difference between the old version and the new version in ifconfig

                Here is the output on the old version (edited because my post is being marked as spam):
                hn1:
                ether 00:c0:df:10:58:09
                status: active

                And here is the output on the new version:
                hn1: ether 00:c0:df:10:58:09
                hwaddr 00:15:5d:00:1f:06
                status: active

                There is a new line for "hwaddr". I am not sure what the difference is, but the hwaddr value is the default MAC address I have set in Hyper-V, but I have also set the option to allow the Guest OS to change the MAC address. The MAC address reported by the Configurator is always the "ether" one.

                Note that I can ping the gateway (192.168.0.1) in 2.4.4 from the LAN. The IP address I get in the ifconfig outputs above is the same in both cases.

                I also noticed this error, is it relevant?
                arpresolve: can't allocate llinfo for 82.28.4.1 on hn1

                1 Reply Last reply Reply Quote 0
                • stephenw10S
                  stephenw10 Netgate Administrator
                  last edited by

                  The arpresolve errors on WAN imply it is trying to ARP for that IP and failing. I assume 82.28.4.1 is the WAN gateway IP?

                  That does look like what you see if you hit the DHCP issue I mentioned in my first reply.
                  https://redmine.pfsense.org/issues/8507

                  I expect to see a bad MTU if you are hitting that but it's worth adding the workaround line to your DHCP options anyway:

                  In the "Lease Requirements and Requests" section for WAN DHCP in the field "Option modifiers" add the text without quotes: "supersede interface-mtu 0"
                  

                  Or trying a 2.4.5 snapshot which has that fixed already.

                  Steve

                  1 Reply Last reply Reply Quote 0
                  • R
                    rjarratt
                    last edited by

                    I had a closer look at the issue and tried it out with no luck. Before I started I did this:

                    : netstat -4rnW
                    Routing tables
                    
                    Internet:
                    Destination        Gateway            Flags       Use    Mtu      Netif Expire
                    default            82.28.4.1          UGS         234   1500        hn1
                    82.28.4.0/22       link#6             U           186   1500        hn1
                    82.28.4.86         link#6             UHS           0  16384        lo0
                    127.0.0.1          link#2             UH           31  16384        lo0
                    192.168.0.0/24     link#5             U           124   1500        hn0
                    192.168.0.1        link#5             UHS           0  16384        lo0
                    

                    I then set the advanced option anyway, released and renewed the lease and I even rebooted after changing the option, without success. The netstat looked the same as above after setting the option and rebooting etc.

                    1 Reply Last reply Reply Quote 0
                    • BabizB
                      Babiz
                      last edited by

                      I look same issue on my APU2 pfSense upgraded to lastest release and after restoring configuration/reboot , dpinger fail trough PPPoE and monitor IP 8.8.4.4.

                      So I guess some kind of messing up related pfSense internals stuff.

                      My solution is simply: Delete all gateways / reboot /re assign interfaces from cli/ssh /reboot.

                      Yeah pfSense concept is beauty but it's not fully perfect we know. 🐨

                      1 Reply Last reply Reply Quote 0
                      • stephenw10S
                        stephenw10 Netgate Administrator
                        last edited by

                        If you were hitting that DHCP issue you would not see a change in the routing table only the interface MTU.

                        The ARP resolve errors imply the gateway is not responding to ARP. Try running a packet capture on WAN to be sure it is actually sending the ARP requests and that the gateway is really not responding.

                        Steve

                        1 Reply Last reply Reply Quote 1
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.