Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    PfSense looses connection every 28-30 days.

    Scheduled Pinned Locked Moved General pfSense Questions
    41 Posts 7 Posters 6.6k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DerelictD
      Derelict LAYER 8 Netgate
      last edited by Derelict

      That's why I said just let it run for 1000000 packets on WAN filtered on UDP port 67.

      If you want to tie up a laptop or something running wireshark that's cool too.

      Chattanooga, Tennessee, USA
      A comprehensive network diagram is worth 10,000 words and 15 conference calls.
      DO NOT set a source address/port in a port forward or firewall rule unless you KNOW you need it!
      Do Not Chat For Help! NO_WAN_EGRESS(TM)

      1 Reply Last reply Reply Quote 0
      • J
        jacksnack2
        last edited by jacksnack2

        Hello,

        It was about ~35 days this time.

        pcap uploaded filtered.pcap .

        The pfsense machine was not rebooted, only the modem.

        After a reboot, a new IP address was assigned. Connections are now normal.

        Thanks Again.

        1 Reply Last reply Reply Quote 0
        • DerelictD
          Derelict LAYER 8 Netgate
          last edited by Derelict

          Another case of the ISP device simply stopping responses to DHCP Requests and DHCP Discovers.

          It looks to me like there is another MAC address out there making DHCP requests.

          1051 2019-06-04 16:20:05.338922 0.0.0.0 68 255.255.255.255 67 DHCP 64 538 9.496981000 0x0000 (0) DHCP Discover - Transaction ID 0x4b38b221

          Ethernet II, Src: ac:ec:80:79:2e:77, Dst: ff:ff:ff:ff:ff:ff

          I'm assuming your WAN port MAC address is:

          Ethernet II, Src: 38:60:77:04:e8:2c, Dst: ac:ec:80:79:2e:75

          I do not know why what looks like the ISP modem would be making DHCP requests on that network but it seems fishy to me.

          If pfSense renews the lease every 2 hours for a month then it just stops getting a response when issuing identical requests, it is a problem with the modem, not pfSense. Look for yourself. What is the difference between the requests in Frames 1,3,5, and 7 that did receive a response vs the requests in 9,10,11,12, etc that did not?

          I would certainly put 192.168.100.1 in the Reject leases from area on the WAN configuration if it is not already there.

          I don't see that pfSense is doing anything wrong here. The modem simply stops responding, apparently.

          Chattanooga, Tennessee, USA
          A comprehensive network diagram is worth 10,000 words and 15 conference calls.
          DO NOT set a source address/port in a port forward or firewall rule unless you KNOW you need it!
          Do Not Chat For Help! NO_WAN_EGRESS(TM)

          1 Reply Last reply Reply Quote 0
          • stephenw10S
            stephenw10 Netgate Administrator
            last edited by

            Ok, so you can see in the pacp that it renews the IP s few times but then the DHCP server stops responding. it keep trying and eventually starts broadcasting for any DHCP servers. Then it looses it's own DHCP lease but keeps sending requests.

            Then after sometime the modem starts up it's dhcp server and gives the pfSense WAN a private IP.

            You probably want to prevent that happening by adding 192.168.100.1 to the 'Reject leases from' field on the WAN DHCP setup.

            However that doesn't explain why the remote dhcp server stopped responding.

            Steve

            1 Reply Last reply Reply Quote 0
            • chpalmerC
              chpalmer
              last edited by chpalmer

              @jacksnack2 said in PfSense looses connection every 28-30 days.:

              Arris TM822G

              Arris TM822G while not listed below it is a Puma 5 modem and has issues. Quite old actually if I remember right. Its a telephone modem so it would get an address of its own bridge mode or not.

              Badmodems.com

              POS.

              Replace the old sparkplugs before any further diagnosis is the norm in my book.

              Triggering snowflakes one by one..
              Intel(R) Core(TM) i5-4590T CPU @ 2.00GHz on an M400 WG box.

              1 Reply Last reply Reply Quote 0
              • J
                jacksnack2
                last edited by

                Thank you all for the quick feedback.

                I am a Linux Admin by trade, although networking does occasionally fall under my perview :>

                I blocked the modem IP as suggested. Also, I have re-enabled dpinger as this allows the router to re-obtain a lease. However the issue is the while obtaining a lease via dpinger, DNS resolution fails for internal clients. A router reboot is required.

                I don't see a solution here. But again, I do appreciate the help.

                1 Reply Last reply Reply Quote 0
                • johnpozJ
                  johnpoz LAYER 8 Global Moderator
                  last edited by

                  @jacksnack2 said in PfSense looses connection every 28-30 days.:

                  while obtaining a lease via dpinger,

                  Huh? Dpinger doesn't have anything to do with renewing a dhcp lease??

                  An intelligent man is sometimes forced to be drunk to spend time with his fools
                  If you get confused: Listen to the Music Play
                  Please don't Chat/PM me for help, unless mod related
                  SG-4860 24.11 | Lab VMs 2.8, 24.11

                  J 1 Reply Last reply Reply Quote 0
                  • J
                    jacksnack2 @johnpoz
                    last edited by

                    @johnpoz dpinger does not directly deal with leases, but it does fire actions:

                    /usr/local/sbin/pfSctl
                    -c "service reload dyndns ${GW}"
                    -c "service reload ipsecdns"
                    -c "service reload openvpn ${GW}"
                    -c "filter reload" >/dev/null 2>&1

                    I can state the when dpinger was enabled, the router held an IP address, even though DNS did not work internally.

                    This allowed me to ssh into the machine and reboot.

                    Once I disabled dpinger, no IP address existed for the WAN.

                    Are you saying this is a coincidence?

                    1 Reply Last reply Reply Quote 0
                    • johnpozJ
                      johnpoz LAYER 8 Global Moderator
                      last edited by

                      None of that would have anything to do with dhcp lease renew..

                      An intelligent man is sometimes forced to be drunk to spend time with his fools
                      If you get confused: Listen to the Music Play
                      Please don't Chat/PM me for help, unless mod related
                      SG-4860 24.11 | Lab VMs 2.8, 24.11

                      1 Reply Last reply Reply Quote 0
                      • stephenw10S
                        stephenw10 Netgate Administrator
                        last edited by

                        Indeed the dhclient is independent of dpinger. Something it triggered may have restarted the dhclient perhaps but if it was able to pull a lease it would have done so anyway.

                        You said the rebooting the modem also allowed it to come back up. I would try simply pulling the WAN cable from either the modem or pfSense and reconnecting it. Does that also bring back the connection?

                        Are you running 2.4.4p3 now? It's possible you're hitting this: https://redmine.pfsense.org/issues/9267

                        That is fixed in current 2.5 snapshots if you're able to test one.

                        Steve

                        J 1 Reply Last reply Reply Quote 0
                        • J
                          jacksnack2 @stephenw10
                          last edited by

                          @stephenw10 Bug #9267 does give me hope in a resolution.

                          I will look into this.

                          Thank You.

                          1 Reply Last reply Reply Quote 0
                          • DerelictD
                            Derelict LAYER 8 Netgate
                            last edited by

                            That redmine does not seem to match. The packet capture is pretty clear. At least it seems pretty clear to me.

                            Another test would be not rebooting anything and simply disconnecting the coax from the modem, letting it drop, and reconnecting it. That would eliminate any interface bounces unless the modem does one in that case.

                            Chattanooga, Tennessee, USA
                            A comprehensive network diagram is worth 10,000 words and 15 conference calls.
                            DO NOT set a source address/port in a port forward or firewall rule unless you KNOW you need it!
                            Do Not Chat For Help! NO_WAN_EGRESS(TM)

                            1 Reply Last reply Reply Quote 0
                            • stephenw10S
                              stephenw10 Netgate Administrator
                              last edited by

                              Yes, you're right. In the case of that bug the client stops requesting a new lease but here it clearly continues. Ignore me! 🙄

                              Steve

                              1 Reply Last reply Reply Quote 0
                              • J
                                jacksnack2
                                last edited by jacksnack2

                                https://redmine.pfsense.org/issues/9267
                                "...DHCP timeout occurs and the cached gateway address is not pingable. This results in a case where the cached IP is removed from the interface, but dhclient is informed via the exit status of 0 that the IP was added successfully. As a result, the impacted interface remains without an IPv4 address..."

                                Seems plausible this is the issue.

                                Why remove the cached IP?

                                1 Reply Last reply Reply Quote 0
                                • DerelictD
                                  Derelict LAYER 8 Netgate
                                  last edited by

                                  I have no doubt that might cause some people problems, but I don't see how it will make your modem stop responding to DHCPREQUEST/DHCPDISCOVER as it apparently does.

                                  Chattanooga, Tennessee, USA
                                  A comprehensive network diagram is worth 10,000 words and 15 conference calls.
                                  DO NOT set a source address/port in a port forward or firewall rule unless you KNOW you need it!
                                  Do Not Chat For Help! NO_WAN_EGRESS(TM)

                                  1 Reply Last reply Reply Quote 0
                                  • K
                                    Kimberly3475
                                    last edited by Kimberly3475

                                    What is happening when those ARP resolve messages start? You showed the end, what about the beginning? Is the MTU showing anything strange on the interface when it is not working?

                                    dpinger is trying to ping the gateway address but it cannot because it is not receiving an ARP response for it on WAN. Then it miraculously does for some reason. ttrockstars

                                    If it were me I'd packet capture for ARP on WAN and see what is happening. I'd just set interface WAN protocol ARP and a packet count of 100000 or 1000000 and let it run. Then get the times of the start and end of the can't allocate llinfo logs and see what's happening there in wireshark.

                                    J 1 Reply Last reply Reply Quote 0
                                    • J
                                      jacksnack2 @Kimberly3475
                                      last edited by jacksnack2

                                      @Kimberly3475 Thanks.

                                      I will most likely start a packet capture in the next several weeks, as this is the time for the next event to occur.

                                      1 Reply Last reply Reply Quote 0
                                      • DerelictD
                                        Derelict LAYER 8 Netgate
                                        last edited by Derelict

                                        I would run it on the command line and capture both DHCP and ARP.

                                        Something like this should work:

                                        1. Stop any running capture in the gui
                                        2. SSH or console in
                                        3. Menu option 8
                                        4. nohup /usr/sbin/tcpdump -i eth0 -c 1000000 -s 0 -w /root/packetcapture.cap arp or port 67 &
                                        5. exit

                                        eth0 needs to be your WAN interface (em0, igb1, etc). You can get that interface name from Status > Interfaces.

                                        You should be able to log out and the GUI should show the capture running there. Should be able to stop it and view it normally when the time comes.

                                        You might want to start one, let it soak, and stop it to see how much ARP there is out there. It might be a lot and will vary due to the design at the ISP. You might want to up that count to 10000000.

                                        Chattanooga, Tennessee, USA
                                        A comprehensive network diagram is worth 10,000 words and 15 conference calls.
                                        DO NOT set a source address/port in a port forward or firewall rule unless you KNOW you need it!
                                        Do Not Chat For Help! NO_WAN_EGRESS(TM)

                                        J 1 Reply Last reply Reply Quote 0
                                        • J
                                          jacksnack2
                                          last edited by jacksnack2

                                          @Derelict hello.

                                          I am going to do exactly that, but may from a Linux box on the network via ssh.

                                          I’ll post back when something comes up.

                                          Thanks again for all the help.

                                          1 Reply Last reply Reply Quote 0
                                          • J
                                            jacksnack2 @Derelict
                                            last edited by

                                            @Derelict Thanks again for the reply.

                                            However I think there is error in you command:

                                            what does '1.' represent?

                                            I ran this via ssh without the '1.'

                                            ARP captures are brutal...filling the logs. Already at 554684. Is there anyway to minimize the captures to something more specific?

                                            Thanks.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.