Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Multiple network failures after dirty upgrade to 23.01

    Scheduled Pinned Locked Moved General pfSense Questions
    12 Posts 4 Posters 1.8k Views 4 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • A Offline
      azdeltawye @stephenw10
      last edited by

      @stephenw10 said in Multiple network failures after dirty upgrade to 23.01:

      Yeah that's a drive error. It looks like the eMMC.
      Steve

      Thank you Steve for the quick response!

      One thing I forgot to mention; after the upgrade, I was troubleshooting the recent NUT package problem where it would not connect to the UPS. I plugged a USB hub into one of the USB ports on the 5100 which completely took down the 5100; all LEDs went dark. I had to disconnect the power supply to recover. Thanks to running a zfs disk structure, everything recovered... That, to me, almost seems like a power supply issue.

      Is that expected behavior?

      I didn't think a USB 3.0 hub would draw that much power...

      1 Reply Last reply Reply Quote 0
      • stephenw10S Offline
        stephenw10 Netgate Administrator
        last edited by

        Hmm, yeah I would not expect to see that ever. Are you sure that USB hub is good?

        A 1 Reply Last reply Reply Quote 0
        • A Offline
          azdeltawye @stephenw10
          last edited by

          @stephenw10 said in Multiple network failures after dirty upgrade to 23.01:

          Hmm, yeah I would not expect to see that ever. Are you sure that USB hub is good?

          Yeah, it's brand new. I also verified that it works on my Windows machine.

          Do you think I should submit a trouble ticket?

          1 Reply Last reply Reply Quote 0
          • stephenw10S Offline
            stephenw10 Netgate Administrator
            last edited by

            Well if it works fine without that hub attached it's hard to say its a problem with the 5100. I assume it works with the UPS connected directly to the 5100?
            Is the hub powered? It might be exceeding the current rating of the port without external power connected.

            Steve

            A 1 Reply Last reply Reply Quote 0
            • A Offline
              azdeltawye @stephenw10
              last edited by

              @stephenw10

              Yeah the 5100 worked fine with the UPS plugged into the USB port.

              Yes this USB hub is powered from the port power. I don't need to use it, I was just trying that to see if it helped the NUT pkg connectivity error.

              I went ahead and tested the eMMC with the command:
              mmc extcsd read /dev/mmcsd0rpmb

              The results were:

              =============================================
                Extended CSD rev 1.7 (MMC 5.0)
              =============================================
              
              Card Supported Command sets [S_CMD_SET: 0x01]
              HPI Features [HPI_FEATURE: 0x01]: implementation based on CMD13
              Background operations support [BKOPS_SUPPORT: 0x01]
              Max Packet Read Cmd [MAX_PACKED_READS: 0x00]
              Max Packet Write Cmd [MAX_PACKED_WRITES: 0x3c]
              Data TAG support [DATA_TAG_SUPPORT: 0x01]
              Data TAG Unit Size [TAG_UNIT_SIZE: 0x03]
              Tag Resources Size [TAG_RES_SIZE: 0x00]
              Context Management Capabilities [CONTEXT_CAPABILITIES: 0x05]
              Large Unit Size [LARGE_UNIT_SIZE_M1: 0x07]
              Extended partition attribute support [EXT_SUPPORT: 0x03]
              Generic CMD6 Timer [GENERIC_CMD6_TIME: 0x19]
              Power off notification [POWER_OFF_LONG_TIME: 0xff]
              Cache Size [CACHE_SIZE] is 128 KiB
              Background operations status [BKOPS_STATUS: 0x00]
              1st Initialisation Time after programmed sector [INI_TIMEOUT_AP: 0x64]
              Power class for 52MHz, DDR at 3.6V [PWR_CL_DDR_52_360: 0x00]
              Power class for 52MHz, DDR at 1.95V [PWR_CL_DDR_52_195: 0x00]
              Power class for 200MHz at 3.6V [PWR_CL_200_360: 0x00]
              Power class for 200MHz, at 1.95V [PWR_CL_200_195: 0x00]
              Minimum Performance for 8bit at 52MHz in DDR mode:
               [MIN_PERF_DDR_W_8_52: 0x00]
               [MIN_PERF_DDR_R_8_52: 0x00]
              TRIM Multiplier [TRIM_MULT: 0x11]
              Secure Feature support [SEC_FEATURE_SUPPORT: 0x55]
              Boot Information [BOOT_INFO: 0x07]
               Device supports alternative boot method
               Device supports dual data rate during boot
               Device supports high speed timing during boot
              Boot partition size [BOOT_SIZE_MULTI: 0x20]
              Access size [ACC_SIZE: 0x07]
              High-capacity erase unit size [HC_ERASE_GRP_SIZE: 0x01]
               i.e. 512 KiB
              High-capacity erase timeout [ERASE_TIMEOUT_MULT: 0x11]
              Reliable write sector count [REL_WR_SEC_C: 0x01]
              High-capacity W protect group size [HC_WP_GRP_SIZE: 0x10]
               i.e. 8192 KiB
              Sleep current (VCC) [S_C_VCC: 0x08]
              Sleep current (VCCQ) [S_C_VCCQ: 0x08]
              Sleep/awake timeout [S_A_TIMEOUT: 0x13]
              Sector Count [SEC_COUNT: 0x00e90000]
               Device is block-addressed
              Minimum Write Performance for 8bit:
               [MIN_PERF_W_8_52: 0x08]
               [MIN_PERF_R_8_52: 0x08]
               [MIN_PERF_W_8_26_4_52: 0x08]
               [MIN_PERF_R_8_26_4_52: 0x08]
              Minimum Write Performance for 4bit:
               [MIN_PERF_W_4_26: 0x08]
               [MIN_PERF_R_4_26: 0x08]
              Power classes registers:
               [PWR_CL_26_360: 0x00]
               [PWR_CL_52_360: 0x00]
               [PWR_CL_26_195: 0x00]
               [PWR_CL_52_195: 0x00]
              Partition switching timing [PARTITION_SWITCH_TIME: 0x03]
              Out-of-interrupt busy timing [OUT_OF_INTERRUPT_TIME: 0x04]
              I/O Driver Strength [DRIVER_STRENGTH: 0x1f]
              Enhanced Strobe mode [STROBE_SUPPORT: 0x00]
              Card Type [CARD_TYPE: 0x57]
               HS400 Dual Data Rate eMMC @200MHz 1.8VI/O
               HS200 Single Data Rate eMMC @200MHz 1.8VI/O
               HS Dual Data Rate eMMC @52MHz 1.8V or 3VI/O
               HS eMMC @52MHz - at rated device voltage(s)
               HS eMMC @26MHz - at rated device voltage(s)
              CSD structure version [CSD_STRUCTURE: 0x02]
              Command set [CMD_SET: 0x00]
              Command set revision [CMD_SET_REV: 0x00]
              Power class [POWER_CLASS: 0x00]
              High-speed interface timing [HS_TIMING: 0x01]
              Erased memory content [ERASED_MEM_CONT: 0x00]
              Boot configuration bytes [PARTITION_CONFIG: 0x03]
               Not boot enable
               R/W Replay Protected Memory Block (RPMB)
              Boot config protection [BOOT_CONFIG_PROT: 0x00]
              Boot bus Conditions [BOOT_BUS_CONDITIONS: 0x00]
              High-density erase group definition [ERASE_GROUP_DEF: 0x01]
              Boot write protection status registers [BOOT_WP_STATUS]: 0x00
              Boot Area Write protection [BOOT_WP]: 0x00
               Power ro locking: possible
               Permanent ro locking: possible
               partition 0 ro lock status: not locked
               partition 1 ro lock status: not locked
              User area write protection register [USER_WP]: 0x00
              FW configuration [FW_CONFIG]: 0x00
              RPMB Size [RPMB_SIZE_MULT]: 0x20
              Write reliability setting register [WR_REL_SET]: 0x1f
               user area: the device protects existing data if a power failure occurs during a write operation
               partition 1: the device protects existing data if a power failure occurs during a write operation
               partition 2: the device protects existing data if a power failure occurs during a write operation
               partition 3: the device protects existing data if a power failure occurs during a write operation
               partition 4: the device protects existing data if a power failure occurs during a write operation
              Write reliability parameter register [WR_REL_PARAM]: 0x04
               Device supports the enhanced def. of reliable write
              Enable background operations handshake [BKOPS_EN]: 0x00
              H/W reset function [RST_N_FUNCTION]: 0x00
              HPI management [HPI_MGMT]: 0x00
              Partitioning Support [PARTITIONING_SUPPORT]: 0x07
               Device support partitioning feature
               Device can have enhanced tech.
              Max Enhanced Area Size [MAX_ENH_SIZE_MULT]: 0x0001d2
               i.e. 3817472 KiB
              Partitions attribute [PARTITIONS_ATTRIBUTE]: 0x00
              Partitioning Setting [PARTITION_SETTING_COMPLETED]: 0x00
               Device partition setting NOT complete
              General Purpose Partition Size
               [GP_SIZE_MULT_4]: 0x000000
               [GP_SIZE_MULT_3]: 0x000000
               [GP_SIZE_MULT_2]: 0x000000
               [GP_SIZE_MULT_1]: 0x000000
              Enhanced User Data Area Size [ENH_SIZE_MULT]: 0x000000
               i.e. 0 KiB
              Enhanced User Data Start Address [ENH_START_ADDR]: 0x00000000
               i.e. 0 bytes offset
              Bad Block Management mode [SEC_BAD_BLK_MGMNT]: 0x00
              Periodic Wake-up [PERIODIC_WAKEUP]: 0x00
              Program CID/CSD in DDR mode support [PROGRAM_CID_CSD_DDR_SUPPORT]: 0x01
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[127]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[126]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[125]]: 0x20
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[124]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[123]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[122]]: 0x20
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[121]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[120]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[119]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[118]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[117]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[116]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[115]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[114]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[113]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[112]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[111]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[110]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[109]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[108]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[107]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[106]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[105]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[104]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[103]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[102]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[101]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[100]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[99]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[98]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[97]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[96]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[95]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[94]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[93]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[92]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[91]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[90]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[89]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[88]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[87]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[86]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[85]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[84]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[83]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[82]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[81]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[80]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[79]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[78]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[77]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[76]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[75]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[74]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[73]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[72]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[71]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[70]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[69]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[68]]: 0x01
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[67]]: 0x00
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[66]]: 0x07
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[65]]: 0xa9
              Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[64]]: 0x03
              Native sector size [NATIVE_SECTOR_SIZE]: 0x00
              Sector size emulation [USE_NATIVE_SECTOR]: 0x00
              Sector size [DATA_SECTOR_SIZE]: 0x00
              1st initialization after disabling sector size emulation [INI_TIMEOUT_EMU]: 0x00
              Class 6 commands control [CLASS_6_CTRL]: 0x00
              Number of addressed group to be Released[DYNCAP_NEEDED]: 0x00
              Exception events control [EXCEPTION_EVENTS_CTRL]: 0x0000
              Exception events status[EXCEPTION_EVENTS_STATUS]: 0x0000
              Extended Partitions Attribute [EXT_PARTITIONS_ATTRIBUTE]: 0x0000
              Context configuration [CONTEXT_CONF[51]]: 0x00
              Context configuration [CONTEXT_CONF[50]]: 0x00
              Context configuration [CONTEXT_CONF[49]]: 0x00
              Context configuration [CONTEXT_CONF[48]]: 0x00
              Context configuration [CONTEXT_CONF[47]]: 0x00
              Context configuration [CONTEXT_CONF[46]]: 0x00
              Context configuration [CONTEXT_CONF[45]]: 0x00
              Context configuration [CONTEXT_CONF[44]]: 0x00
              Context configuration [CONTEXT_CONF[43]]: 0x00
              Context configuration [CONTEXT_CONF[42]]: 0x00
              Context configuration [CONTEXT_CONF[41]]: 0x00
              Context configuration [CONTEXT_CONF[40]]: 0x00
              Context configuration [CONTEXT_CONF[39]]: 0x00
              Context configuration [CONTEXT_CONF[38]]: 0x00
              Context configuration [CONTEXT_CONF[37]]: 0x00
              Packed command status [PACKED_COMMAND_STATUS]: 0x00
              Packed command failure index [PACKED_FAILURE_INDEX]: 0x00
              Power Off Notification [POWER_OFF_NOTIFICATION]: 0x00
              Control to turn the Cache ON/OFF [CACHE_CTRL]: 0x01
              eMMC Firmware Version: R
              eMMC Life Time Estimation A [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_A]: 0x0b
              eMMC Life Time Estimation B [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_B]: 0x0b
              eMMC Pre EOL information [EXT_CSD_PRE_EOL_INFO]: 0x01
              Secure Removal Type [SECURE_REMOVAL_TYPE]: 0x01
               information is configured to be removed by an erase of the physical memory
               Supported Secure Removal Type:
                information removed by an erase of the physical memory
              

              So yeah, this eMMC is EOL...
              Ironically, running the memory test caused it to crash.

              stephenw10S 1 Reply Last reply Reply Quote 0
              • stephenw10S Offline
                stephenw10 Netgate Administrator @azdeltawye
                last edited by

                @azdeltawye said in Multiple network failures after dirty upgrade to 23.01:

                eMMC Life Time Estimation B [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_B]: 0x0b

                Yes, I would get an SSD in there ASAP.

                1 Reply Last reply Reply Quote 0
                • P Offline
                  PacketSniffer @azdeltawye
                  last edited by

                  @azdeltawye I have a Netgate 2100 and I'm experiencing similar issues as yourself. DHCP seems to fail, can't log into the firewall, and console is unresponsive. After a very long time, console spit out the following "Solaris: WARNING: Pool 'pfSense' has encountered an uncorrectable I/O failure and has been suspended.", which is how I found your post. This all happened after the 23.01 update. For my unit, a reboot doesn't fix the issue, a reflash of the firmware and reload of the config is what gets the firewall going for an additional 2-3 days. I've opened a ticket with Netgate to troubleshoot this issue and they basically said that internal eMMC is dying. I provided these logs when even flashing the firmware was giving me a hard time :

                  (100s of these lines)
                  *mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache
                  mmcsd0: failed to flush cache

                  Loader variables:
                  vfs.root.mountfrom=zfs:pfSense/ROOT/default

                  Manual root filesystem specification:
                  <fstype>:<device> [options]
                  Mount <device> using filesystem <fstype>
                  and with the specified (optional) option list.

                  eg. ufs:/dev/da0s1a
                  zfs:zroot/ROOT/default
                  cd9660:/dev/cd0 ro
                  (which is equivalent to: mount -t cd9660 -o ro /dev/cd0 /)

                  ? List valid disk boot devices
                  . Yield 1 second (for background tasks)
                  <empty line> Abort manual input

                  mountroot>*

                  So yeah, I guess this is confirmation that I need to get a new SSD in there. Not sure why this only happened after 23.01 tho.

                  A S 2 Replies Last reply Reply Quote 0
                  • A Offline
                    azdeltawye @PacketSniffer
                    last edited by

                    @packetsniffer
                    Yeah, my Netgate appliance eventually became unreachable via RJ45 ports and console serial port. I replaced, or actually added, the approved SATA SSD in an attempt to recover the unit but that did not help. The unit was toast...

                    Takeaway lesson: Next Netgate appliance purchase - pony up the extra $$ for the MAX version. Preferably something without eMMC!

                    1 Reply Last reply Reply Quote 0
                    • S Offline
                      SteveITS Rebel Alliance @PacketSniffer
                      last edited by

                      @packetsniffer Most likely because an update writes a lot to disk. For the record there is this:
                      https://docs.netgate.com/pfsense/en/latest/troubleshooting/disk-lifetime.html
                      and RAM disks can save writing: https://docs.netgate.com/pfsense/en/latest/troubleshooting/disk-writes.html

                      https://www.netgate.com/supported-pfsense-plus-packages has remarks on whether an SSD is recommended for certain packages that can have intense logging depending on configuration.

                      Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                      When upgrading, allow 10-15 minutes to reboot, or more depending on packages, and device or disk speed.
                      Upvote 👍 helpful posts!

                      P 1 Reply Last reply Reply Quote 2
                      • P Offline
                        PacketSniffer @SteveITS
                        last edited by

                        @SteveITS Thank you.

                        I followed a few links to test the onboard memory, and it turns out mine was pretty dead.

                        I threw a new Transcend 512GB (TS512GMTS430S) in the Netgate 2100, flashed 23.01 on there, restored my config, and I've been solid for 2+ weeks.

                        I will be looking into cleaning up my logging to reduce wear and tear.

                        Thanks everyone!

                        1 Reply Last reply Reply Quote 1
                        • First post
                          Last post
                        Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.