Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    pfSense internet connection fails and needs manual restart of server.

    Scheduled Pinned Locked Moved General pfSense Questions
    12 Posts 2 Posters 677 Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • O
      ojosaghae @stephenw10
      last edited by

      @stephenw10 Upload done.
      Thanks for your kind assistance.

      1 Reply Last reply Reply Quote 0
      • stephenw10S
        stephenw10 Netgate Administrator
        last edited by

        Those are very different crashes.

        #1 Backtrace:

        db:0:kdb.enter.default>  bt
        Tracing pid 12 tid 100059 td 0xfffffe006425aac0
        kdb_enter() at kdb_enter+0x32/frame 0xfffffe006445b5a0
        vpanic() at vpanic+0x183/frame 0xfffffe006445b5f0
        panic() at panic+0x43/frame 0xfffffe006445b650
        trap_fatal() at trap_fatal+0x409/frame 0xfffffe006445b6b0
        trap_pfault() at trap_pfault+0x4f/frame 0xfffffe006445b710
        calltrap() at calltrap+0x8/frame 0xfffffe006445b710
        --- trap 0xc, rip = 0xffffffff80cd1c7d, rsp = 0xfffffe006445b7e0, rbp = 0xfffffe006445b860 ---
        __mtx_lock_sleep() at __mtx_lock_sleep+0xcd/frame 0xfffffe006445b860
        pf_find_state() at pf_find_state+0x1dd/frame 0xfffffe006445b8b0
        pf_test_state_tcp() at pf_test_state_tcp+0x1cc/frame 0xfffffe006445ba10
        pf_test() at pf_test+0x1102/frame 0xfffffe006445bb90
        pf_check_out() at pf_check_out+0x22/frame 0xfffffe006445bbb0
        pfil_mbuf_out() at pfil_mbuf_out+0x35/frame 0xfffffe006445bbe0
        ip_output() at ip_output+0xc3e/frame 0xfffffe006445bce0
        ip_forward() at ip_forward+0x3d5/frame 0xfffffe006445bd90
        ip_input() at ip_input+0x686/frame 0xfffffe006445bdf0
        swi_net() at swi_net+0x138/frame 0xfffffe006445be60
        ithread_loop() at ithread_loop+0x257/frame 0xfffffe006445bef0
        fork_exit() at fork_exit+0x7d/frame 0xfffffe006445bf30
        fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe006445bf30
        --- trap 0xa5a5a5a5, rip = 0xa5a5a5a5a5a5a5a5, rsp = 0xa5a5a5a5a5a5a5a5, rbp = 0xa5a5a5a5a5a5a5a5 ---
        

        #1 Panic:

        Fatal trap 12: page fault while in kernel mode
        cpuid = 7; apic id = 07
        fault virtual address	= 0xbb1e
        fault code		= supervisor read data, page not present
        instruction pointer	= 0x20:0xffffffff80fd95f5
        stack pointer	        = 0x28:0xfffffe0064474720
        frame pointer	        = 0x28:0xfffffe0064474800
        code segment		= base 0x0, limit 0xfffff, type 0x1b
        			= DPL 0, pres 1, long 1, def32 0, gran 1
        processor eflags	= interrupt enabled, resume, IOPL = 0
        current process		= 12 (swi1: netisr 5)
        rdi: fffff8012464b300 rsi:               14 rdx: fffffe00644749e0
        rcx:             bb00  r8: fffff8012464b36e  r9: fffff8016afcc4e0
        rax:               40 rbx: fffff8016afcc540 rbp: fffffe0064474800
        
        
        Fatal trap 12: page fault while in kernel mode
        cpuid = 2; apic id = 02
        fault virtual address	= 0xef50
        fault code		= supervisor read data, page not present
        instruction pointer	= 0x20:0xffffffff80cd1c7d
        stack pointer	        = 0x0:0xfffffe006445b7e0
        frame pointer	        = 0x0:0xfffffe006445b860
        code segment		= base 0x0, limit 0xfffff, type 0x1b
        			= DPL 0, pres 1, long 1, def32 0, gran 1
        processor eflags	= interrupt enabled, resume, IOPL = 0
        current process		= 12 (swi1: netisr 2)
        rdi: fffffe006f4d9d70 rsi:             eb00 rdx:            50faa
        rcx: fffffe006f252000  r8:                a  r9:              602
        rax:                0 rbx: fffffe006425aac0 rbp: fffffe006445b860
        r10:          5ce37dd r11:                0 r12: fffffe006445b800
        r13: fffffe006f4d9d70 r14:             eb00 r15:                0
        trap number		= 12
        panic: page fault
        cpuid = 2
        time = 1695389621
        KDB: enter: panic
        

        #2 Backtrace:

        db:0:kdb.enter.default>  bt
        Tracing pid 0 tid 100201 td 0xfffffe0070404720
        kdb_enter() at kdb_enter+0x32/frame 0xfffffe000ab2cdb0
        vpanic() at vpanic+0x183/frame 0xfffffe000ab2ce00
        panic() at panic+0x43/frame 0xfffffe000ab2ce60
        dblfault_handler() at dblfault_handler+0x1ce/frame 0xfffffe000ab2cf20
        Xdblfault() at Xdblfault+0xd7/frame 0xfffffe000ab2cf20
        --- trap 0x17, rip = 0xffffffff8126d51f, rsp = 0xfffffe00a3343688, rbp = 0xfffffe00a3349220 ---
        done_load_dr() at done_load_dr+0x1f/frame 0xfffffe00a3349220
        

        #2 Panic:

        <118>Mounting ZFS boot environment... done.
        <118><jemalloc>: jemalloc_extent.c:1195: Failed assertion: "p[i] == 0"
        
        
        
        
        Fatal trap 12: page fault while in kernel mode
        
        Fatal trap 12: page fault while in kernel mode
        cpuid = 2; apic id = 02
        cpuid = 5; apic id = 05
        fault virtual address	= 0x78
        fault code		= supervisor read data, page not present
        instruction pointer	= 0x20:0xffffffff8362ca55
        stack pointer	        = 0x28:0xfffffe00a354a140
        Fatal double fault
        rip 0xffffffff8126d51f rsp 0xfffffe00a3343688 rbp 0xfffffe00a3349220
        frame pointer	        = 0x28:0xfffffe00a354a160
        code segment		= base 0x0, limit 0xfffff, type 0x1b
        rax 0xffffffff80d2f2e1 rdx 0xffffffff84016384 rbx 0xfffffe000aafd1c0
        rcx 0xfffffe000ab300c0 rsi 0xfffffe0070404720 rdi 0xffffffff8303ef30
        fault virtual address	= 0xfffffb2d37dc8a78
        r8 0xfffffe0070404c40 r9 0xfffffe00a3348000 r10 0
        r11 0x7ff705c8 r12 0xfffffe0070404e20 r13 0xffffffff8309d730
        r14 0xfffffe000adb5660 r15 0xffffffff83092d88 rflags 0x10046
        cs 0x20 ss 0x28 ds 0x3b es 0x3b fs 0x13 gs 0x1b
        fsbase 0 gsbase 0xffffffff84016000 kgsbase 0
        cpuid = 6; apic id = 06
        panic: double fault
        cpuid = 6
        time = 1695448028
        KDB: enter: panic
        

        I'd guess that the second crash is a result of the first one since it crashed before it finished booting, just after mounting the filesystem.

        Is this the first time that has happened?

        O 1 Reply Last reply Reply Quote 0
        • O
          ojosaghae @stephenw10
          last edited by

          @stephenw10 Yes it is.
          This was in relation to this issue. I decided to just do a clean install, so I downloaded the image from Netgate and set up a new machine, then restored the backup I did from the old machine. It worked cleanly without any errors. And then after some days, this came up.

          1 Reply Last reply Reply Quote 0
          • stephenw10S
            stephenw10 Netgate Administrator
            last edited by

            Any idea what might have been happening when it hit that? Any unusual traffic?

            O 1 Reply Last reply Reply Quote 0
            • O
              ojosaghae @stephenw10
              last edited by

              @stephenw10 There was something I noticed.
              I suddenly noticed snort was throwing up some alerts - Potentially Bad Traffic and Generic Protocol Command Decode. I uploaded a couple of sample screenshots..

              Screenshot 2023-09-30 at 20.26.29.png Screenshot 2023-09-30 at 20.25.43.png

              1 Reply Last reply Reply Quote 0
              • stephenw10S
                stephenw10 Netgate Administrator
                last edited by

                I ran it past one of our developers who said something unusual must have happened to reach that because it shouldn't be possible. That was just his first comment, hopefully there will be more when he has time to review it properly.

                Steve

                O 1 Reply Last reply Reply Quote 0
                • O
                  ojosaghae @stephenw10
                  last edited by

                  @stephenw10 Thank you so much.
                  I look forward to the feedback from you (and your developer team) ๐Ÿ’

                  1 Reply Last reply Reply Quote 0
                  • stephenw10S
                    stephenw10 Netgate Administrator
                    last edited by

                    Ok that second crash looks like it could be a hardware issue which means the first crash could also be. If you haven't seen this before or since that might also indicate hardware. Are you able to run a memtest on that hardware?

                    O 1 Reply Last reply Reply Quote 0
                    • O
                      ojosaghae @stephenw10
                      last edited by

                      @stephenw10 Hi and apologies for the delay in replying.
                      I ran memtest on the server - more than once. I ran the basic, and then the advanced (which ran for hours). There was no error at all.
                      photo_2023-10-18 15.35.06.jpeg
                      Any update concerning the first crash please ๐Ÿ™ ?

                      1 Reply Last reply Reply Quote 0
                      • stephenw10S
                        stephenw10 Netgate Administrator
                        last edited by

                        No updates I'm afraid. It still looks like harwdare, though clearly not RAM from that test.

                        1 Reply Last reply Reply Quote 0
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.