Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Another Netgate with storage failure, 6 in total so far

    Scheduled Pinned Locked Moved Official Netgate® Hardware
    284 Posts 36 Posters 43.3k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • fireodoF
      fireodo @stephenw10
      last edited by

      @stephenw10 said in Another Netgate with storage failure, 6 in total so far:

      Be good if you could set it for /tmp only..... 🤔

      Maybe better because some inconsistencys in /var could cause trouble ...🤔

      Kettop Mi4300YL CPU: i5-4300Y @ 1.60GHz RAM: 8GB Ethernet Ports: 4
      SSD: SanDisk pSSD-S2 16GB (ZFS) WiFi: WLE200NX
      pfsense 2.7.2 CE
      Packages: Apcupsd Cron Iftop Iperf LCDproc Nmap pfBlockerNG RRD_Summary Shellcmd Snort Speedtest System_Patches.

      1 Reply Last reply Reply Quote 0
      • stephenw10S
        stephenw10 Netgate Administrator
        last edited by

        Well you can lose /var and /tmp entirely (if they are ramdisks) and the system will still boot back and replace them.

        fireodoF 1 Reply Last reply Reply Quote 1
        • fireodoF
          fireodo @stephenw10
          last edited by fireodo

          @stephenw10 said in Another Netgate with storage failure, 6 in total so far:

          Well you can lose /var and /tmp entirely (if they are ramdisks) and the system will still boot back and replace them.

          So, you agree with me, letting /tmp AND /var "out" of sync would do no harm?

          Fact is - since 2021 I had reboots but no problems with these settings. Power failure I have not because my pfsense hangs on a UPS.

          Kettop Mi4300YL CPU: i5-4300Y @ 1.60GHz RAM: 8GB Ethernet Ports: 4
          SSD: SanDisk pSSD-S2 16GB (ZFS) WiFi: WLE200NX
          pfsense 2.7.2 CE
          Packages: Apcupsd Cron Iftop Iperf LCDproc Nmap pfBlockerNG RRD_Summary Shellcmd Snort Speedtest System_Patches.

          dennypageD 1 Reply Last reply Reply Quote 0
          • JonathanLeeJ
            JonathanLee
            last edited by JonathanLee

            Side note: I keep thinking about a smart Optane NVMe
            drive that leverages ram for the constant rewrites and a capacitor/battery for power fail to write to SSD. Some Symantec and Veritas systems as well as TinTri systems have such a capacitor. 4200 systems need the NVMe drive. the needs of zfs are burning up onboard components

            IMG_1747.png

            Put specific items in the ram of the nvme that burn up drives with zfs in this type of environment.

            Make sure to upvote

            M 1 Reply Last reply Reply Quote 0
            • dennypageD
              dennypage @fireodo
              last edited by

              @stephenw10 said in Another Netgate with storage failure, 6 in total so far:

              Well you can lose /var and /tmp entirely (if they are ramdisks) and the system will still boot back and replace them.

              @fireodo said in Another Netgate with storage failure, 6 in total so far:

              So, you agree with me, letting /tmp AND /var "out" of sync would do no harm?

              That's not the same thing. Use of ramdisks is a configuration driven thing, and the system knows that /var must be recreated at boot. There is a lot of data stored in /var, and if it's corrupted you could encounter any number of problems.

              IMO, you're fine letting /tmp go to pot, but not /var.

              1 Reply Last reply Reply Quote 1
              • stephenw10S
                stephenw10 Netgate Administrator
                last edited by

                Yup when ramdisks are enabled it triggers a bunch of things at boot and shutdown. But it might be possible....

                1 Reply Last reply Reply Quote 0
                • M
                  mer @JonathanLee
                  last edited by

                  @JonathanLee said in Another Netgate with storage failure, 6 in total so far:

                  needs of zfs are burning up onboard components

                  I'm not trying to say people aren't having problems, I'm trying to understand "why" they are.

                  What needs of ZFS are causing this?

                  Burning up these kinds of components are related to the number of erase cycles, which circles back around to writes.
                  What parts of pfSense are doing a lot of writes? I can see persistent logging, maybe "check for updates", maybe atime property enabled? Updating persistant store for block lists?

                  The way I look at it, once the system is configured and running, it should be doing mostly read from the filesystem and running from memory.

                  S 1 Reply Last reply Reply Quote 0
                  • dennypageD
                    dennypage @stephenw10
                    last edited by

                    @stephenw10 said in Another Netgate with storage failure, 6 in total so far:

                    Be good if you could set it for /tmp only..... 🤔

                    It would be really good if you could simply do ramdisk for /tmp only. No need to save/restore.

                    1 Reply Last reply Reply Quote 0
                    • S
                      SteveITS Galactic Empire @mer
                      last edited by

                      @mer said in Another Netgate with storage failure, 6 in total so far:

                      What parts of pfSense are doing a lot of writes?

                      Some packages (https://www.netgate.com/supported-pfsense-plus-packages), logging of default block rules, IGMP block logging, logging set in packages, updates of block lists and country lists, nginx access log (dashboard widgets), and similar.

                      Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                      When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                      Upvote 👍 helpful posts!

                      JonathanLeeJ w0wW dennypageD S 4 Replies Last reply Reply Quote 1
                      • JonathanLeeJ
                        JonathanLee @SteveITS
                        last edited by

                        @SteveITS what files can use a linker file to direct to a usb drive? That model can use a usb drive right ?

                        Make sure to upvote

                        S 1 Reply Last reply Reply Quote 0
                        • w0wW
                          w0w @SteveITS
                          last edited by

                          @SteveITS
                          Hmm, so enabling compression "on the fly" in case of logs can significantly reduce writes, yes?

                          zfs set compression=lz4 pfsense/var/log
                          

                          Some log compression options can be enabled via the GUI, but I don’t think they use "on-the-fly" compression.

                          1 Reply Last reply Reply Quote 0
                          • dennypageD
                            dennypage @SteveITS
                            last edited by

                            @SteveITS said in Another Netgate with storage failure, 6 in total so far:

                            logging of default block rules, IGMP block logging

                            These two can be quite voluminous, but are easy to address:

                            • Add a rule on Local to pass IPv4+IPv6 IGMP with IP options set. I think this should actually be a default rule in pfSense.
                            • Disable logging of packets blocked by the default rule in Firewall Logs. There are often thousands of these every day, and the individual log entries really aren't of much value.
                            1 Reply Last reply Reply Quote 1
                            • S
                              SteveITS Galactic Empire @JonathanLee
                              last edited by

                              @JonathanLee Oh I have no idea. :)

                              @w0w said in Another Netgate with storage failure, 6 in total so far:

                              Some log compression options can be enabled via the GUI, but I don’t think they use "on-the-fly" compression.

                              Yes it does: https://docs.netgate.com/pfsense/en/latest/monitoring/logs/index.html#log-format

                              "ZFS already compresses this data"

                              @dennypage said in Another Netgate with storage failure, 6 in total so far:

                              easy to address

                              Yep, mentioned above. In a link maybe, it's been a long thread. We actually don't pass the IGMP, since it's "supposed" to be blocked (always has been) we add a block rule that is set to not log. Otherwise IGMP is logged even if the logging for the default block rule is off.

                              Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                              When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                              Upvote 👍 helpful posts!

                              dennypageD 1 Reply Last reply Reply Quote 1
                              • dennypageD
                                dennypage @SteveITS
                                last edited by

                                @SteveITS said in Another Netgate with storage failure, 6 in total so far:

                                We actually don't pass the IGMP, since it's "supposed" to be blocked (always has been) we add a block rule that is set to not log.

                                I would not say IGMP is supposed to be blocked, and it's rather inefficient to do. Multicast flooding is not desirable, even if it's only mDNS.

                                Of course, if IGMP is completely disabled in your switches, it doesn't matter. But if it is disabled in your switches, you won't see the IGMP messages to begin with. 😊

                                S 1 Reply Last reply Reply Quote 1
                                • S
                                  SteveITS Galactic Empire @dennypage
                                  last edited by

                                  @dennypage rephrasing, pfSense blocks them by default.
                                  https://docs.netgate.com/pfsense/en/latest/troubleshooting/log-filter-blocked.html#packets-with-ip-options

                                  Pre-2.7.2/23.09: Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                                  When upgrading, allow 10-15 minutes to restart, or more depending on packages and device speed.
                                  Upvote 👍 helpful posts!

                                  dennypageD 1 Reply Last reply Reply Quote 0
                                  • dennypageD
                                    dennypage @SteveITS
                                    last edited by

                                    @SteveITS said in Another Netgate with storage failure, 6 in total so far:

                                    rephrasing, pfSense blocks them by default.

                                    Yea, that's why I called out that pfSense should add a default pass rule for IGMP.

                                    Blocking packets with IP options that are to be forwarded is a good default, however IGMP isn't forwarded. Blocking IGMP by default makes little sense.

                                    A 1 Reply Last reply Reply Quote 1
                                    • A
                                      andrew_cb @dennypage
                                      last edited by andrew_cb

                                      @dennypage @SteveITS I had commented on redmine 15400 but since it was closed I guess that my message went unnoticed.

                                      I have created a new redmine 16068 for adding options to disable logging of packets with IP options.

                                      1 Reply Last reply Reply Quote 2
                                      • K
                                        kingsleyadam
                                        last edited by

                                        Thanks to @andrew_cb and others for bringing awareness to this. I had no idea my 6100 has limitations due to the eMMC. I went out of my way to buy a 6100 over my own router build because I just wanted to setup my router and forget about it. As someone who is fully remote the router is the last thing I can have fail.

                                        I saw the threads on Reddit and did a quick check. Just over 1.5 years of having my 6100, it’s already at 70% wear.

                                        I bought 2x16GB Intel Optane Drives which you can get for less than 5 euro a piece and managed to get them installed and set up in a mirror for redundancy (the drives are so cheap, I think it’s silly not too). I also 0’d out my eMMC drive to ensure it does’t cause any conflicts.

                                        I’m not thrilled that I had to do this, I’m thankful there were M.2 ports on the 6100 that I could use. But my biggest take-away is that installing your own SSD is not “supported” and could void your warranty is unacceptable. I think there should be a well documented SSD upgrade for any device that has an available slot, it should not void the warranty, and most definitely shouldn’t be discouraged.

                                        As a side note, I really wish the installer was offline. I was sweating bullets attempting to configure the WAN in the Installer with PPoE and VLAN tagging (don’t get me wrong, it was easy, but if it didn’t work I’d be SOL).

                                        A 1 Reply Last reply Reply Quote 2
                                        • A
                                          andrew_cb @kingsleyadam
                                          last edited by

                                          @kingsleyadam I am glad you discovered the storage wear on your 6100 and installed an SSD before you experienced a sudden failure!

                                          I had no idea my 6100 has limitations due to the eMMC. I went out of my way to buy a 6100 over my own router build because I just wanted to setup my router and forget about it. As someone who is fully remote the router is the last thing I can have fail.

                                          Your comment is exactly what this thread is about.

                                          There have been many good suggestions in this thread on ways to reduce the wear of the onboard eMMC, but they do not address the main point of this thread:

                                          If any usage assumptions or limitations are not clearly stated upfront or in the documentation, then it does not matter what the technical reasons are, how valid they are, or what workarounds are available!

                                          You cannot advertise a ladder as great for construction work and then not disclose that it has a 100-pound weight limit, just like you cannot sell a manual transmission vehicle without an instrument panel and then say it is the user's fault when the engine blows up.

                                          If Netgate sold ladders like they sell firewalls, what kind of chaos would result from people using the Netgate Ladder-4100 BASE version?

                                          If there are limitations, recommendations, or "best practices" regarding firewalls with eMMC storage, then state them clearly on the product page and conspicuously in the documentation! That would significantly reduce or even eliminate this entire problem.

                                          It has been nearly two months since Netgate acknowledged the issue, and there have been no changes. I do not understand why Netgate refuses to spend an hour copying and pasting an informational blurb to the store product pages.

                                          J 2 Replies Last reply Reply Quote 0
                                          • A andrew_cb referenced this topic on
                                          • J
                                            jwt Netgate @andrew_cb
                                            last edited by

                                            It has been nearly two months since Netgate acknowledged the issue, and there have been no changes.

                                            [literally responding from a Starbucks in So Colorado at 5:30 on a Friday]

                                            You are wrong, there are a lot of changes in-progress, but I’m not getting I to this with you, here, right now.

                                            A 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.