Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Enhanced Intel SpeedStep / Speed Shift - Are they fully supported?

    Scheduled Pinned Locked Moved Hardware
    74 Posts 8 Posters 11.1k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • RobbieTTR
      RobbieTT @stephenw10
      last edited by RobbieTT

      @stephenw10 said in Enhanced Intel SpeedStep / Speed Shift - Are they fully supported?:

      I came to the conclusion it's because simply running sysctl to read the cpu state causes it to bump the frequency.

      Thankfully I don't see that, at least on this cpu:

      [23.09-RELEASE]/root: sysctl dev.cpu.0.freq dev.cpu.1.freq dev.cpu.2.freq dev.cpu.3.freq dev.cpu.4.freq dev.cpu.5.freq dev.cpu.6.freq dev.cpu.7.freq
      dev.cpu.0.freq: 799
      dev.cpu.1.freq: 799
      dev.cpu.2.freq: 799
      dev.cpu.3.freq: 799
      dev.cpu.4.freq: 799
      dev.cpu.5.freq: 799
      dev.cpu.6.freq: 799
      dev.cpu.7.freq: 799
      [23.09-RELEASE]/root: 
      

      Heisenberg defeated.

      Less luck with PPPoE handling though (920 Mbps download test):

      [23.09-RELEASE]/root: sysctl dev.cpu.0.freq dev.cpu.1.freq dev.cpu.2.freq dev.cpu.3.freq dev.cpu.4.freq dev.cpu.5.freq dev.cpu.6.freq dev.cpu.7.freq
      dev.cpu.0.freq: 3199
      dev.cpu.1.freq: 799
      dev.cpu.2.freq: 799
      dev.cpu.3.freq: 799
      dev.cpu.4.freq: 799
      dev.cpu.5.freq: 799
      dev.cpu.6.freq: 799
      dev.cpu.7.freq: 799
      [23.09-RELEASE]/root: 
      

      ☕️

      1 Reply Last reply Reply Quote 0
      • stephenw10S
        stephenw10 Netgate Administrator
        last edited by

        Hmm, I guess as long as the load sysctl imposes doesn't push it over whatever the threshold is you wouldn't see that. You CPU is likely a lot more powerful than what I'm seeing that on.

        1 Reply Last reply Reply Quote 0
        • B
          bigjohns97
          last edited by

          Per core working just fine here.

          d6f84ec9-5882-402d-b412-148cbf269650-image.png

          62503d1d-ba5d-4e5a-b0b6-35f00d7c07d4-image.png

          1 Reply Last reply Reply Quote 0
          • B
            bigjohns97
            last edited by

            BTW here is a great article showing why this is so important.

            https://pcper.com/2015/11/intel-speed-shift-tested-significant-user-experience-improvements/

            For those who weren't utilizing powerd it's no big deal while those of us who were welcome this update with open arms.

            In these examples you can see that it really only take a couple of milliseconds now to ramp clock speed.

            Essentially meaning there is no reason to not run this on a modern CPU that supports it when concerned about the best possible performance. HUGE for those of us with power hungry x86 CPU's that are running 24/7.

            RobbieTTR 1 Reply Last reply Reply Quote 2
            • RobbieTTR
              RobbieTT @bigjohns97
              last edited by RobbieTT

              @bigjohns97

              I appear to get higher (better) PPPoE throughput on my Xeon-D, which I didn't expect. I need to think a bit more as to why.

              Anyway, just Intel Speed Select Technology to come... 🐥

              ☕️

              B 1 Reply Last reply Reply Quote 0
              • B
                bigjohns97 @RobbieTT
                last edited by bigjohns97

                @RobbieTT What is the model of your CPU?

                Were you using PowerD before?

                Also any virtualization involved?

                RobbieTTR 1 Reply Last reply Reply Quote 0
                • RobbieTTR
                  RobbieTT @bigjohns97
                  last edited by

                  @bigjohns97

                  It's an Intel Xeon D-1736NT and yes, I was using powerd previously.

                  ☕️

                  B 1 Reply Last reply Reply Quote 0
                  • B
                    bigjohns97 @RobbieTT
                    last edited by

                    @RobbieTT This is way better than PowerD, PowerD = SpeedStep

                    Check out my link above.

                    1 Reply Last reply Reply Quote 0
                    • Q
                      q54e3w
                      last edited by

                      pfsensecpu80vs60.jpg

                      1 Reply Last reply Reply Quote 0
                      • T
                        tman222
                        last edited by

                        After testing / monitoring for another week, I have concluded that a Speed Shift setting of "60" works provides a pretty good performance / efficiency trade off on the Intel Xeon D-1718T CPU in one of my systems. If I increase the value further to "80", I find that the low CPU frequencies become too sticky (i.e. it seems to take too long to ramp up), while not really resulting in incremental power savings. If I lower to "50" the CPU ramps up too quickly to top frequencies, resulting in a temperature increase. What's interesting - on another system with a Intel Core i3-10100 CPU, a setting of "60" appears not conservative enough and the CPU still ramps up very quickly to higher frequencies. Could there be some differences between how different Intel CPU architectures handle / implement Speed Shift?

                        I 1 Reply Last reply Reply Quote 2
                        • I
                          InstanceExtension @tman222
                          last edited by

                          @tman222

                          I just completed a number of WAN latency tests on my Xeon D-1718T system and had different results. I had to go down to a setting of 25 for Speed Shift to prevent the router from introducing latency on the WAN connection. 30 might have been ok, but 25 seemed to provide the best throughput and power results. A value of 60 increased the latency back to the same values I had when running on my Atom C3758 based router with PowerD set at Max values.

                          1 Reply Last reply Reply Quote 0
                          • stephenw10S
                            stephenw10 Netgate Administrator
                            last edited by

                            How much latency was that?

                            I 1 Reply Last reply Reply Quote 0
                            • I
                              InstanceExtension @stephenw10
                              last edited by

                              @stephenw10 Lowering the Speed Shift value from 60 to 30 dropped the loaded download latency by ~20ms on average based upon the Waveform Bufferbloat test: https://www.waveform.com/tools/bufferbloat

                              This is with a Comcast 1.2Gb download speed (real value is 1.4Gb).

                              RobbieTTR 1 Reply Last reply Reply Quote 0
                              • stephenw10S
                                stephenw10 Netgate Administrator
                                last edited by

                                Hmm, significant then. 🤔

                                1 Reply Last reply Reply Quote 0
                                • RobbieTTR
                                  RobbieTT @InstanceExtension
                                  last edited by

                                  @InstanceExtension

                                  I've not experienced anything like that, albeit I have more cores. I do have HyperThreading off though, so I wonder if that makes a difference?

                                  ☕️

                                  I 1 Reply Last reply Reply Quote 0
                                  • I
                                    InstanceExtension @RobbieTT
                                    last edited by

                                    @RobbieTT On the LAN side everything is 10Gb high powered devices, so maybe I'm just pushing the router a bit more.

                                    RobbieTTR 1 Reply Last reply Reply Quote 0
                                    • RobbieTTR
                                      RobbieTT @InstanceExtension
                                      last edited by

                                      @InstanceExtension Perhaps it is a difference in workload but my LANs are also 10GbE, plus I use bi-directional FQ_CoDel and I have the additional pfSense burden of limiting my PPPoE WAN to a single core.

                                      ☕️

                                      1 Reply Last reply Reply Quote 1
                                      • T
                                        tman222
                                        last edited by

                                        It's definitely a trade off and and the type of workload / environment matters as well. By favoring power savings over performance I also noticed an increase in latency, but it was very small: maybe an additional 200 microseconds (0.2ms) or so to the first hop (i.e. the dpinger gateway ping), which is probably due to the CPU sitting an min frequency the majority of the time. On the WAN side I have a symmetric 2Gbit Fiber circuit and have not seen any increase in latency or decrease in performance there. Where I have noticed a decrease in performance is when running an iperf3 test between two 10Gbit hosts located on different internals network segments (i.e. basically just L3 routing). Prior to adjusting speed shift settings, I could max out the bandwidth (~9.4Gbit/s) between the hosts with one iperf3 stream (P=1). By favoring power savings, I see closer to 8.5-9.0Gbit/s (on average) now with a single stream. Increasing the streams to two or greater results in the bandwidth maxed out again. Perhaps the load of just one stream (or maybe iperf3 in general) isn't great enough to push the CPU into the highest frequencies when power saving is favored. In any case, it's currently not limiting in any way because I don't have a need to route at 10Gbit line speed. Once WAN speeds increase to that level I will probably have to adjust / tweak the speed shift settings again. In the meantime, with a system that's fairly lightly loaded most of the time, I'm fine with accepting a slight increase in latency and slight decrease in performance in exchange for 2-3 degrees C lower temperatures (on average), along with decreased power consumption.

                                        RobbieTTR 1 Reply Last reply Reply Quote 0
                                        • RobbieTTR
                                          RobbieTT @tman222
                                          last edited by

                                          @tman222
                                          With mine set at 80% I don't see any change in throughput when routing, with 1 or 4 streams. It is a pure flat line at 9.90 Gbps.

                                          If I generate 10GbE using the router as the server or client I can detect a small ripple in the graph with 1 stream but the 9.90 Gbps average remains:

                                           2023-12-16 at 12.10.38.png

                                          4 streams is still a flat line though:

                                           2023-12-16 at 12.02.03.png

                                          If I simultaneously ping a gateway using the same link I do see a small increase, as you would expect when saturating it with an iPerf test:

                                           2023-12-16 at 12.14.11.png

                                          ☕️

                                          Sergei_ShablovskyS 2 Replies Last reply Reply Quote 0
                                          • T
                                            tman222
                                            last edited by

                                            Hi @RobbieTT - that's interesting! Are those tests done using 1500 byte packets or a larger MTU? Do you get similar results if testing with a client behind the router vs. from the router?

                                            RobbieTTR 1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.