Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Another Netgate with storage failure, 6 in total so far

    Scheduled Pinned Locked Moved Official Netgate® Hardware
    304 Posts 37 Posters 72.7k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • A
      andrew_cb @jwt
      last edited by

      @jwt

      In response to your points:

      When I search for "enterprise-ready" on store.netgate.com, the only two devices that come up are the 8300 Base and 8300 Max. Neither has an eMMC.
      The only device in the Netgate catalog that does not have a SSD/NVMe option is the 1100. Every other device (2100, 4200, 6100) has an option for NVMe storage or only comes with NVMe storage (8200, 8300).

      I said "Enterprise-ready," not as a literal search term/description. Do you prefer "business-ready" instead? If you search for "enterprise," I think you will find it under the heading "Best For" on the 4100, 6100, and 7100. All models are Best For "Managed Service Provider / Managed Security Service Provider (MSP/MSSP) On-Premises Appliance."

      What's the problem here, other than at one point, they said "42%" when the figure is 48%.

      There have been four posts about storage failure in the last two weeks, and that's just in this subforum. This shows that storage failure is a frequent and ongoing issue. Particularly concerning is the storage failure on 4200, which has only been out for nine months!

      I am unaware of any package which is marked as religious rather than secular.

      I referred to the package list as "sacred" since in most of the threads about storage failure, someone nearly always replies, "Check the package list and see that package XYZ requires an SSD." The package list and package documentation are used to explain and justify most storage failures. If the package documentation is the source of truth for storage requirements, then it needs to be much more prominent and up-to-date with all packages.

      In a word: No. There is no statement about safety on that page. In fact, neither the word "safety", or "safe" occurs on that page.

      If that is the case, then there should clearly be a warning, even for packages maintained by Netgate.

      Even more useful would be redmine bug reports.

      Can you clarify what sort of bug reports (storage requirements, update package documentation, update package list, etc) are?

      Not if the trailer hitch you install isn't rated for 10 tons.

      That's a good point. Hopefully, the truck, hitch, and/or trailer manual would clearly mention this so I would be aware of it before purchasing.

      I don't want to quote Steve Jobs, but... you're holding it wrong.

      Addressed in my previous post.

      Used within its limitations, eMMC is a good solution.

      What are the limitations? I have been unable to locate any information on store.netgate.com, and docs.netgate.com only has "Troubleshooting Disk Lifetime."

      Please show me the Netgate device that came (only) with eMMC that cost(s) "thousands of dollars".

      "Thousands of dollars" was not the main point of that sentence. But for reference, the 7100 was $999 USD in this announcement.

      I suggest you tone down rhetoric such as this. "White-labeled" does disservice to our level of effort and engagement with Silicom. You suggest that all we do is slap a label on the box. Are you aware that Silicom also builds (quite similar) devices for Dell and others? Are you suggesting that these are also "white label"? Does Apple using Foxconn to build iPhones mean that iPhones are also "white label"?

      My point was that concerning the storage, unless Netate is speccing something different, Silicom appears to use
      eMMC storage in many of its devices (such as the unit the 4200 is based on). Also, I understand that the products that Foxconn builds for Apple are designed by Apple, manufactured exclusively for Apple, and are not resold under other brand names.

      TRIM (or an equivalent such as DISCARD) are required by JEDEC standards as far back as 2010.

      I may be wrong about TRIM support on eMMC devices. I saw it mentioned in a few posts. TRIM support should help increase eMMC life (or, inversely, it would be even worse without it).

      Yet you do not show these calculations, such that we may assess their accuracy. Specifically, I'm curious what you used for a Write Amplification Factor and how you determined same.

      I used the calculations from here but increased the device size to 8GB, which doubled the result as expected. The WAF used is 4.5. If you can access better information, I would like to see how it changes the calculations. I also gave an example of the values for a 10x life increase.

      An 8GB device with 3000 P/E cycles per cell works out to 24TB of total writes, assuming that every P/E cycle allows user data to be stored. That works out to ~22GB per day, ~900MB per hour, 15MB per minute, 252KB per second, to last for 3 years.

      If pfSsense could use the disk stats in FreeBSD to track the data written (per second/minute and cumulatively) and display a warning when it exceeds a certain threshold, that would be very helpful.

      I found it interesting that you did not acknowledge my frustration or express concern that a Netgate customer is experiencing such a high failure rate, regardless of the cause. I noticed you did not even address my assertion that we are experiencing a 30-40% failure rate of devices under 3 years old. Your focus seems to be on debating specific technicalities and semantics rather than the elephant in the room of why I and many others experience storage failures and what Netgate is doing to address this, whether through technical means or improved documentation and awareness.

      1 Reply Last reply Reply Quote 0
      • A
        andrew_cb @jwt
        last edited by

        @jwt said in Another Netgate with storage failure, 6 in total so far:

        I actually came back to this thread to see if you had responded, but apparently not.

        Unfortunately, I have been busy planning the storage replacement of the 10 Netgate devices that are showing 100% and 110% storage wear. Hopefully, we can complete the replacements before they suddenly die and leave our customers without internet service.

        1 Reply Last reply Reply Quote 0
        • A
          andrew_cb
          last edited by

          Another idea: Could pfSense display a warning when a user attempts to install certain packages, either always or just when eMMC storage is detected?

          Warning: Package xyz is write-intensive and can cause accelerated wear and/or premature storage failure. It is recommended to use mmc-utils or the SMART tool and widget to monitor storage health. Further information is available here, here and here.

          Maybe something like this on the Packages menu in pfSense?

          Some packages are resource-intensive and/or require additional storage considerations. Review the supported packages list for any specific requirements.

          I noticed that the documentation for the SMART status page is not in the Diagnostics menu documentation category - I had to search for it and found it under "System Monitoring" - it should probably be linked in both places.

          1 Reply Last reply Reply Quote 3
          • A
            andrew_cb @jwt
            last edited by

            @jwt I came back to this thread to see if you had responded, but apparently not.

            1 Reply Last reply Reply Quote 0
            • A andrew_cb referenced this topic on
            • A
              andrew_cb
              last edited by andrew_cb

              It seems that @stbellcom is in a similar situation of the eMMC storage on his Netgate 6100's dying, some in a little as 6 months.

              One device refused to power on after the eMMC failed, which I have also experienced. After he desoldered the eMMC chips, the unit started working.

              https://forum.netgate.com/topic/196028/6100-failed-emmc-replaced-with-nvme-but-now-no-longer-reboots

              M 1 Reply Last reply Reply Quote 0
              • M
                michmoor LAYER 8 Rebel Alliance @andrew_cb
                last edited by

                @andrew_cb thread went silent………

                Firewall: NetGate,Palo Alto-VM,Juniper SRX
                Routing: Juniper, Arista, Cisco
                Switching: Juniper, Arista, Cisco
                Wireless: Unifi, Aruba IAP
                JNCIP,CCNP Enterprise

                1 Reply Last reply Reply Quote 0
                • stephenw10S
                  stephenw10 Netgate Administrator
                  last edited by

                  Well it is the weekend. Even I try to clock out occasionally. 😉

                  1 Reply Last reply Reply Quote 1
                  • w0wW
                    w0w
                    last edited by

                    The problem with eMMC would hardly ever have occurred if the eMMC size had been chosen appropriately. After all, TBW is proportional to this parameter. Essentially, in this sense, the difference from SSDs is minimal, except for the type of connection and the fact that SSDs can be replaced without hot air soldering skills . What seems most odd to me is the choice of such small eMMC sizes, especially considering that eMMC prices in those years didn’t significantly increase the final cost of the product.

                    1 Reply Last reply Reply Quote 1
                    • A
                      andrew_cb
                      last edited by

                      I was directed to this post from 2021 about pfSense with ZFS writing 14-20GB per day.
                      This got me curious, so I checked our fleet and found that the range is 3-28GB per day, with an average of 18GB. That's about 2-3 DWPD.

                      2025-01-27_1807.png

                      This works out to an average of 6.5TB per year, and 19.7TB after 3 years.

                      Assuming each cell in the eMMC storage can be written 3000 times, 8GB can handle 24TB of writes before the storage is worn out.
                      This is the best-case scenario. Given the background filesystem behavior plus internal drive processing, writing 20TB of data of a 24TB maximum seems "good".
                      A 32GB eMMC could handle 96TB TBW, which would be 52GB per day maximum, and probably about 43GB (80%) per day real-world.

                      Comparatively, a 250GB Samsung 860 EVO has a TBW value of 150TB, which works out to 82GB per day for 5years. Halving that to approximate a 120GB drive gives a TBW of 75TB and 41 GB per day for 5 years. This fits with the typical 0.3 DWPD rating of most SSDs.

                      Clearly, a larger storage drive, whether eMMC or SSD, greatly increases the lifespan of the storage.
                      If eMMC itself is not the problem, then the small 8GB appears to be a significant factor. Increasing the onboard eMMC storage to 16 or 32GB would greatly increase its lifespan and significantly reduce the number of premature device failures.

                      1 Reply Last reply Reply Quote 1
                      • A
                        andrew_cb
                        last edited by

                        Good morning @jwt . Just checking to see if Netgate is still looking into this and if you have any further comments to add.

                        1 Reply Last reply Reply Quote 0
                        • D
                          dstaylor
                          last edited by

                          My 2 1/2-year-old 6100 is showing over 100% usage, so I've ordered an NVMe to install. I did not have any logging-intensive applications running, so it's frustrating. The cheaper mini PCs I bought for other family members' houses are just chugging along with their SSDs, more memory, and faster CPUs.

                          Is there a guide on installing an NVMe and reinstalling pfSense on that drive out there somewhere? I think my drive arrives on Sunday, so that will be one of my weekend tasks.

                          S A 2 Replies Last reply Reply Quote 0
                          • S
                            SteveITS Galactic Empire @dstaylor
                            last edited by

                            @dstaylor There isn't a guide for the 6100. Try to find some forum threads on that since I seem to recall discussions. It's possible it's not an NVMe drive, too, I don't know, but some other models use SATA. Just in case you want to check before opening it.

                            Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                            When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                            Upvote 👍 helpful posts!

                            D 1 Reply Last reply Reply Quote 0
                            • D
                              dstaylor @SteveITS
                              last edited by

                              @SteveITS I did see mention of NVMe for the 6100, and even a couple of pictures others had posted showing the drive installed. I just have not found any guide on doing an install and making sure it selects the new drive.

                              Guess I'll have my VM of 2.7.2 standing by in case the whole thing dies.

                              1 Reply Last reply Reply Quote 0
                              • stephenw10S
                                stephenw10 Netgate Administrator
                                last edited by stephenw10

                                It is NVMe in the 4100/6100/8200. It's not hard to fit if you have any experience assembling PCs.

                                1 Reply Last reply Reply Quote 0
                                • A
                                  andrew_cb @dstaylor
                                  last edited by

                                  @dstaylor I feel your pain. The storage on your 6100 should not wear out so quickly!

                                  Netgate staff say that it is not possible/supported/recommended for an end-user to install an SSD in the 4100 and 6100, so there is no official documentation.

                                  "It's a time of purchase upgrade option."

                                  "Upgrading, replacing, or adding, an SSD to the 6100 is not supported and may void your warranty. We do not provide an SSD for sale that can be added nor do we provide instructions on how to make the replacement as the entire system is very tightly fit into the case and you can break connections opening and re-securing the enclosure. It happened to a number of our staff during early testing and I have personally confirmed the tight fit on production models of the 4100 (which is basically the same form factor). At present time there are no uses for the additional ports/slots on the 6100."

                                  "For those who are considering upgrading the base model this is not supported by TAC - and may void the warranty (if there is any remaining)."

                                  Why does Netgate misleadingly advertise the slots when they have no use and the user is not supposed to touch them? Well, that is a real puzzler!

                                  So if your Netgate device is 31 to 365 days old, you are SOL. Fortunately, this limitation does not apply to you since your device is out of warranty.

                                  The actual SSD installation process is easy:

                                  • Remove the rubber feet and then remove all 8 torx screws (2 front, 2 back, 4 under the feet).
                                  • Remove the plastic filler panel that was held in by the screws.
                                  • Gently separate the top and bottom half of your 6100.
                                  • Install the M.2 NVMe drive into the slot.
                                  • Carefully put the top and bottom halves together - pay attention to the LED lights and the 3 plastic "shrouds" that bend around the circuit board.
                                  • Reinstall the screws and attach the feet.

                                  The instructions from the 4200 are pretty similar and cover the software re-installation process.

                                  This video of opening a 6100 should be helpful.

                                  D D 2 Replies Last reply Reply Quote 1
                                  • A
                                    andrew_cb
                                    last edited by

                                    @jimp @stephenw10 @kphillips @marcosm @cmcdonald Would any of you care to comment on this thread? With 4.3k views here and over 60k views on Reddit, I hope that @jwt's comments do not represent Netgate's official and only response to the issues that have been raised.

                                    M 1 Reply Last reply Reply Quote 1
                                    • M
                                      marcosm Netgate @andrew_cb
                                      last edited by

                                      @andrew_cb Some good points have been raised along with actionable suggestions to mitigate the issue. Thanks for the constructive feedback - the issue has our attention.

                                      A 1 Reply Last reply Reply Quote 4
                                      • D
                                        dstaylor @andrew_cb
                                        last edited by

                                        @andrew_cb I installed a KingSpec NVMe 256GB drive from Amazon this morning. I followed the guidance you provided and the procedure to wipe the MMC drive in the documents.

                                        Screenshot 2025-01-31 103924.png

                                        It's back up and running now. The only thing that seems to be different is I no longer have the slowly flashing diamond that shows "boot complete/ready". I do get the flashing square during boot, and the orange circle when I put it in standby. I checked that none of the "light pipes" were bent, so that is a head-scratcher. Now when it's running all the LEDs are dark. I took away the memory disk I had implemented to try and prolong the mmc drive because it filled up during my reinstall and package reload.

                                        Best I can tell it's all good except for the LED issue.

                                        1 Reply Last reply Reply Quote 0
                                        • stephenw10S
                                          stephenw10 Netgate Administrator
                                          last edited by

                                          Hmm, odd. Try running: pfSense-led.sh ready

                                          D 1 Reply Last reply Reply Quote 0
                                          • D
                                            dstaylor @stephenw10
                                            last edited by

                                            @stephenw10 I'm running that command from where? And what output am I expecting?

                                            D 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.