Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Multiple network failures after dirty upgrade to 23.01

    Scheduled Pinned Locked Moved General pfSense Questions
    12 Posts 4 Posters 1.7k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • stephenw10S
      stephenw10 Netgate Administrator
      last edited by

      Yeah that's a drive error. It looks like the eMMC.

      You could fit an m.2 SSD and reinstall to it instead:
      https://docs.netgate.com/pfsense/en/latest/solutions/sg-5100/m-2-sata-installation.html

      Steve

      A 1 Reply Last reply Reply Quote 0
      • A
        azdeltawye @stephenw10
        last edited by

        @stephenw10 said in Multiple network failures after dirty upgrade to 23.01:

        Yeah that's a drive error. It looks like the eMMC.
        Steve

        Thank you Steve for the quick response!

        One thing I forgot to mention; after the upgrade, I was troubleshooting the recent NUT package problem where it would not connect to the UPS. I plugged a USB hub into one of the USB ports on the 5100 which completely took down the 5100; all LEDs went dark. I had to disconnect the power supply to recover. Thanks to running a zfs disk structure, everything recovered... That, to me, almost seems like a power supply issue.

        Is that expected behavior?

        I didn't think a USB 3.0 hub would draw that much power...

        1 Reply Last reply Reply Quote 0
        • stephenw10S
          stephenw10 Netgate Administrator
          last edited by

          Hmm, yeah I would not expect to see that ever. Are you sure that USB hub is good?

          A 1 Reply Last reply Reply Quote 0
          • A
            azdeltawye @stephenw10
            last edited by

            @stephenw10 said in Multiple network failures after dirty upgrade to 23.01:

            Hmm, yeah I would not expect to see that ever. Are you sure that USB hub is good?

            Yeah, it's brand new. I also verified that it works on my Windows machine.

            Do you think I should submit a trouble ticket?

            1 Reply Last reply Reply Quote 0
            • stephenw10S
              stephenw10 Netgate Administrator
              last edited by

              Well if it works fine without that hub attached it's hard to say its a problem with the 5100. I assume it works with the UPS connected directly to the 5100?
              Is the hub powered? It might be exceeding the current rating of the port without external power connected.

              Steve

              A 1 Reply Last reply Reply Quote 0
              • A
                azdeltawye @stephenw10
                last edited by

                @stephenw10

                Yeah the 5100 worked fine with the UPS plugged into the USB port.

                Yes this USB hub is powered from the port power. I don't need to use it, I was just trying that to see if it helped the NUT pkg connectivity error.

                I went ahead and tested the eMMC with the command:
                mmc extcsd read /dev/mmcsd0rpmb

                The results were:

                =============================================
                  Extended CSD rev 1.7 (MMC 5.0)
                =============================================
                
                Card Supported Command sets [S_CMD_SET: 0x01]
                HPI Features [HPI_FEATURE: 0x01]: implementation based on CMD13
                Background operations support [BKOPS_SUPPORT: 0x01]
                Max Packet Read Cmd [MAX_PACKED_READS: 0x00]
                Max Packet Write Cmd [MAX_PACKED_WRITES: 0x3c]
                Data TAG support [DATA_TAG_SUPPORT: 0x01]
                Data TAG Unit Size [TAG_UNIT_SIZE: 0x03]
                Tag Resources Size [TAG_RES_SIZE: 0x00]
                Context Management Capabilities [CONTEXT_CAPABILITIES: 0x05]
                Large Unit Size [LARGE_UNIT_SIZE_M1: 0x07]
                Extended partition attribute support [EXT_SUPPORT: 0x03]
                Generic CMD6 Timer [GENERIC_CMD6_TIME: 0x19]
                Power off notification [POWER_OFF_LONG_TIME: 0xff]
                Cache Size [CACHE_SIZE] is 128 KiB
                Background operations status [BKOPS_STATUS: 0x00]
                1st Initialisation Time after programmed sector [INI_TIMEOUT_AP: 0x64]
                Power class for 52MHz, DDR at 3.6V [PWR_CL_DDR_52_360: 0x00]
                Power class for 52MHz, DDR at 1.95V [PWR_CL_DDR_52_195: 0x00]
                Power class for 200MHz at 3.6V [PWR_CL_200_360: 0x00]
                Power class for 200MHz, at 1.95V [PWR_CL_200_195: 0x00]
                Minimum Performance for 8bit at 52MHz in DDR mode:
                 [MIN_PERF_DDR_W_8_52: 0x00]
                 [MIN_PERF_DDR_R_8_52: 0x00]
                TRIM Multiplier [TRIM_MULT: 0x11]
                Secure Feature support [SEC_FEATURE_SUPPORT: 0x55]
                Boot Information [BOOT_INFO: 0x07]
                 Device supports alternative boot method
                 Device supports dual data rate during boot
                 Device supports high speed timing during boot
                Boot partition size [BOOT_SIZE_MULTI: 0x20]
                Access size [ACC_SIZE: 0x07]
                High-capacity erase unit size [HC_ERASE_GRP_SIZE: 0x01]
                 i.e. 512 KiB
                High-capacity erase timeout [ERASE_TIMEOUT_MULT: 0x11]
                Reliable write sector count [REL_WR_SEC_C: 0x01]
                High-capacity W protect group size [HC_WP_GRP_SIZE: 0x10]
                 i.e. 8192 KiB
                Sleep current (VCC) [S_C_VCC: 0x08]
                Sleep current (VCCQ) [S_C_VCCQ: 0x08]
                Sleep/awake timeout [S_A_TIMEOUT: 0x13]
                Sector Count [SEC_COUNT: 0x00e90000]
                 Device is block-addressed
                Minimum Write Performance for 8bit:
                 [MIN_PERF_W_8_52: 0x08]
                 [MIN_PERF_R_8_52: 0x08]
                 [MIN_PERF_W_8_26_4_52: 0x08]
                 [MIN_PERF_R_8_26_4_52: 0x08]
                Minimum Write Performance for 4bit:
                 [MIN_PERF_W_4_26: 0x08]
                 [MIN_PERF_R_4_26: 0x08]
                Power classes registers:
                 [PWR_CL_26_360: 0x00]
                 [PWR_CL_52_360: 0x00]
                 [PWR_CL_26_195: 0x00]
                 [PWR_CL_52_195: 0x00]
                Partition switching timing [PARTITION_SWITCH_TIME: 0x03]
                Out-of-interrupt busy timing [OUT_OF_INTERRUPT_TIME: 0x04]
                I/O Driver Strength [DRIVER_STRENGTH: 0x1f]
                Enhanced Strobe mode [STROBE_SUPPORT: 0x00]
                Card Type [CARD_TYPE: 0x57]
                 HS400 Dual Data Rate eMMC @200MHz 1.8VI/O
                 HS200 Single Data Rate eMMC @200MHz 1.8VI/O
                 HS Dual Data Rate eMMC @52MHz 1.8V or 3VI/O
                 HS eMMC @52MHz - at rated device voltage(s)
                 HS eMMC @26MHz - at rated device voltage(s)
                CSD structure version [CSD_STRUCTURE: 0x02]
                Command set [CMD_SET: 0x00]
                Command set revision [CMD_SET_REV: 0x00]
                Power class [POWER_CLASS: 0x00]
                High-speed interface timing [HS_TIMING: 0x01]
                Erased memory content [ERASED_MEM_CONT: 0x00]
                Boot configuration bytes [PARTITION_CONFIG: 0x03]
                 Not boot enable
                 R/W Replay Protected Memory Block (RPMB)
                Boot config protection [BOOT_CONFIG_PROT: 0x00]
                Boot bus Conditions [BOOT_BUS_CONDITIONS: 0x00]
                High-density erase group definition [ERASE_GROUP_DEF: 0x01]
                Boot write protection status registers [BOOT_WP_STATUS]: 0x00
                Boot Area Write protection [BOOT_WP]: 0x00
                 Power ro locking: possible
                 Permanent ro locking: possible
                 partition 0 ro lock status: not locked
                 partition 1 ro lock status: not locked
                User area write protection register [USER_WP]: 0x00
                FW configuration [FW_CONFIG]: 0x00
                RPMB Size [RPMB_SIZE_MULT]: 0x20
                Write reliability setting register [WR_REL_SET]: 0x1f
                 user area: the device protects existing data if a power failure occurs during a write operation
                 partition 1: the device protects existing data if a power failure occurs during a write operation
                 partition 2: the device protects existing data if a power failure occurs during a write operation
                 partition 3: the device protects existing data if a power failure occurs during a write operation
                 partition 4: the device protects existing data if a power failure occurs during a write operation
                Write reliability parameter register [WR_REL_PARAM]: 0x04
                 Device supports the enhanced def. of reliable write
                Enable background operations handshake [BKOPS_EN]: 0x00
                H/W reset function [RST_N_FUNCTION]: 0x00
                HPI management [HPI_MGMT]: 0x00
                Partitioning Support [PARTITIONING_SUPPORT]: 0x07
                 Device support partitioning feature
                 Device can have enhanced tech.
                Max Enhanced Area Size [MAX_ENH_SIZE_MULT]: 0x0001d2
                 i.e. 3817472 KiB
                Partitions attribute [PARTITIONS_ATTRIBUTE]: 0x00
                Partitioning Setting [PARTITION_SETTING_COMPLETED]: 0x00
                 Device partition setting NOT complete
                General Purpose Partition Size
                 [GP_SIZE_MULT_4]: 0x000000
                 [GP_SIZE_MULT_3]: 0x000000
                 [GP_SIZE_MULT_2]: 0x000000
                 [GP_SIZE_MULT_1]: 0x000000
                Enhanced User Data Area Size [ENH_SIZE_MULT]: 0x000000
                 i.e. 0 KiB
                Enhanced User Data Start Address [ENH_START_ADDR]: 0x00000000
                 i.e. 0 bytes offset
                Bad Block Management mode [SEC_BAD_BLK_MGMNT]: 0x00
                Periodic Wake-up [PERIODIC_WAKEUP]: 0x00
                Program CID/CSD in DDR mode support [PROGRAM_CID_CSD_DDR_SUPPORT]: 0x01
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[127]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[126]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[125]]: 0x20
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[124]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[123]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[122]]: 0x20
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[121]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[120]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[119]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[118]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[117]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[116]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[115]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[114]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[113]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[112]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[111]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[110]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[109]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[108]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[107]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[106]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[105]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[104]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[103]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[102]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[101]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[100]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[99]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[98]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[97]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[96]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[95]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[94]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[93]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[92]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[91]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[90]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[89]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[88]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[87]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[86]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[85]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[84]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[83]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[82]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[81]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[80]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[79]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[78]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[77]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[76]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[75]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[74]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[73]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[72]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[71]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[70]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[69]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[68]]: 0x01
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[67]]: 0x00
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[66]]: 0x07
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[65]]: 0xa9
                Vendor Specific Fields [VENDOR_SPECIFIC_FIELD[64]]: 0x03
                Native sector size [NATIVE_SECTOR_SIZE]: 0x00
                Sector size emulation [USE_NATIVE_SECTOR]: 0x00
                Sector size [DATA_SECTOR_SIZE]: 0x00
                1st initialization after disabling sector size emulation [INI_TIMEOUT_EMU]: 0x00
                Class 6 commands control [CLASS_6_CTRL]: 0x00
                Number of addressed group to be Released[DYNCAP_NEEDED]: 0x00
                Exception events control [EXCEPTION_EVENTS_CTRL]: 0x0000
                Exception events status[EXCEPTION_EVENTS_STATUS]: 0x0000
                Extended Partitions Attribute [EXT_PARTITIONS_ATTRIBUTE]: 0x0000
                Context configuration [CONTEXT_CONF[51]]: 0x00
                Context configuration [CONTEXT_CONF[50]]: 0x00
                Context configuration [CONTEXT_CONF[49]]: 0x00
                Context configuration [CONTEXT_CONF[48]]: 0x00
                Context configuration [CONTEXT_CONF[47]]: 0x00
                Context configuration [CONTEXT_CONF[46]]: 0x00
                Context configuration [CONTEXT_CONF[45]]: 0x00
                Context configuration [CONTEXT_CONF[44]]: 0x00
                Context configuration [CONTEXT_CONF[43]]: 0x00
                Context configuration [CONTEXT_CONF[42]]: 0x00
                Context configuration [CONTEXT_CONF[41]]: 0x00
                Context configuration [CONTEXT_CONF[40]]: 0x00
                Context configuration [CONTEXT_CONF[39]]: 0x00
                Context configuration [CONTEXT_CONF[38]]: 0x00
                Context configuration [CONTEXT_CONF[37]]: 0x00
                Packed command status [PACKED_COMMAND_STATUS]: 0x00
                Packed command failure index [PACKED_FAILURE_INDEX]: 0x00
                Power Off Notification [POWER_OFF_NOTIFICATION]: 0x00
                Control to turn the Cache ON/OFF [CACHE_CTRL]: 0x01
                eMMC Firmware Version: R
                eMMC Life Time Estimation A [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_A]: 0x0b
                eMMC Life Time Estimation B [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_B]: 0x0b
                eMMC Pre EOL information [EXT_CSD_PRE_EOL_INFO]: 0x01
                Secure Removal Type [SECURE_REMOVAL_TYPE]: 0x01
                 information is configured to be removed by an erase of the physical memory
                 Supported Secure Removal Type:
                  information removed by an erase of the physical memory
                

                So yeah, this eMMC is EOL...
                Ironically, running the memory test caused it to crash.

                stephenw10S 1 Reply Last reply Reply Quote 0
                • stephenw10S
                  stephenw10 Netgate Administrator @azdeltawye
                  last edited by

                  @azdeltawye said in Multiple network failures after dirty upgrade to 23.01:

                  eMMC Life Time Estimation B [EXT_CSD_DEVICE_LIFE_TIME_EST_TYP_B]: 0x0b

                  Yes, I would get an SSD in there ASAP.

                  1 Reply Last reply Reply Quote 0
                  • P
                    PacketSniffer @azdeltawye
                    last edited by

                    @azdeltawye I have a Netgate 2100 and I'm experiencing similar issues as yourself. DHCP seems to fail, can't log into the firewall, and console is unresponsive. After a very long time, console spit out the following "Solaris: WARNING: Pool 'pfSense' has encountered an uncorrectable I/O failure and has been suspended.", which is how I found your post. This all happened after the 23.01 update. For my unit, a reboot doesn't fix the issue, a reflash of the firmware and reload of the config is what gets the firewall going for an additional 2-3 days. I've opened a ticket with Netgate to troubleshoot this issue and they basically said that internal eMMC is dying. I provided these logs when even flashing the firmware was giving me a hard time :

                    (100s of these lines)
                    *mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache
                    mmcsd0: failed to flush cache

                    Loader variables:
                    vfs.root.mountfrom=zfs:pfSense/ROOT/default

                    Manual root filesystem specification:
                    <fstype>:<device> [options]
                    Mount <device> using filesystem <fstype>
                    and with the specified (optional) option list.

                    eg. ufs:/dev/da0s1a
                    zfs:zroot/ROOT/default
                    cd9660:/dev/cd0 ro
                    (which is equivalent to: mount -t cd9660 -o ro /dev/cd0 /)

                    ? List valid disk boot devices
                    . Yield 1 second (for background tasks)
                    <empty line> Abort manual input

                    mountroot>*

                    So yeah, I guess this is confirmation that I need to get a new SSD in there. Not sure why this only happened after 23.01 tho.

                    A S 2 Replies Last reply Reply Quote 0
                    • A
                      azdeltawye @PacketSniffer
                      last edited by

                      @packetsniffer
                      Yeah, my Netgate appliance eventually became unreachable via RJ45 ports and console serial port. I replaced, or actually added, the approved SATA SSD in an attempt to recover the unit but that did not help. The unit was toast...

                      Takeaway lesson: Next Netgate appliance purchase - pony up the extra $$ for the MAX version. Preferably something without eMMC!

                      1 Reply Last reply Reply Quote 0
                      • S
                        SteveITS Galactic Empire @PacketSniffer
                        last edited by

                        @packetsniffer Most likely because an update writes a lot to disk. For the record there is this:
                        https://docs.netgate.com/pfsense/en/latest/troubleshooting/disk-lifetime.html
                        and RAM disks can save writing: https://docs.netgate.com/pfsense/en/latest/troubleshooting/disk-writes.html

                        https://www.netgate.com/supported-pfsense-plus-packages has remarks on whether an SSD is recommended for certain packages that can have intense logging depending on configuration.

                        Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                        When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                        Upvote 👍 helpful posts!

                        P 1 Reply Last reply Reply Quote 2
                        • P
                          PacketSniffer @SteveITS
                          last edited by

                          @SteveITS Thank you.

                          I followed a few links to test the onboard memory, and it turns out mine was pretty dead.

                          I threw a new Transcend 512GB (TS512GMTS430S) in the Netgate 2100, flashed 23.01 on there, restored my config, and I've been solid for 2+ weeks.

                          I will be looking into cleaning up my logging to reduce wear and tear.

                          Thanks everyone!

                          1 Reply Last reply Reply Quote 1
                          • First post
                            Last post
                          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.