Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    What is the biggest attack in GBPS you stopped

    Scheduled Pinned Locked Moved General pfSense Questions
    737 Posts 33 Posters 816.4k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F Offline
      firewalluser
      last edited by

      @Harvy66:

      I just realized that when ping flooding my firewall, the only thing that changed was the IRQ CPU time. I looked up ICMP and it seems the ICMP response is built right into the network stack. From what I can tell, the ICMP responses are being handled on that same realtime kernel thread. If this is the case and my admin interface stopped responding to pings because of load, that means the realtime thread on a different interface couldn't even run.

      One of the features of MSI-X is interrupt masks. When a "hard" interrupt occurs, the current context thread gets interrupted and the real time thread gets scheduled. Then the thread can set a mask and block all other interrupts. It can process all of its data. Of course blocking interrupts is bad if you don't have a way to indicate there is new work, so there are "soft" interrupts. If the hardware supports it, the hardware can flag a shared memory location that new data is ready. When the current thread is done processing it's current data, it can do one last check to see if the soft interrupt was signaled. If not, unmask the interrupts and return. If it was flagged, then continue processing until all the data is work is done and no more soft interrupts have been signaled.

      It may be possible that the WAN interface is in a constant state of backlog and the realtime kernel thread never unschedules because of constant backlog, starving my admin interface from CPU time.

      With that in mind, some basic cheap nics ie realtek might actually be less hassle compared to say an intel nic when solving this sort of problem

      With a minimum ping of 0.003ms and an average of 0.176, yet only ~250 interrupts per second, the i350 NIC is doing some nifty magic.

      Having some devices do some of the work can be useful, but I suspect its also helping create the problem being seen here in this instance.

      http://en.wikipedia.org/wiki/Message_Signaled_Interrupts#MSI-X
      https://forums.freebsd.org/threads/msi-msi-x-on-intel-em-nic.27736/
      http://people.freebsd.org/~jhb/papers/bsdcan/2007/article/node8.html

      Might be useful as its looking at possibles areas for MSI-X failures.
      http://comments.gmane.org/gmane.os.freebsd.stable/71699
      http://christopher-technicalmusings.blogspot.co.uk/2012/12/passthrough-pcie-devices-from-esxi-to.html

      Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

      Asch Conformity, mainly the blind leading the blind.

      1 Reply Last reply Reply Quote 0
      • F Offline
        firewalluser
        last edited by

        @Supermule:

        Yes… but why can other firewalls keep up with the traffic and pf cant?

        Out of the box or have they been tuned?

        If hardware was the limit here, then all tested should suffer the same faith. They dont…

        Fortigates VMware Appliance and Mikrotik handles the same traffic no issues. Tell me why.... on the same ressources and in the same Hypervisor.

        Tuning?

        What makes pfsense's default settings suitable for your datacentre setup compared to my home use setup?

        Its like tuning a F1 race car, its not going to do well in World Rally Championship setting is it on snow, across deserts or in forests, likewise a rally car is not going to do so well on a race track against F1 cars is it.

        Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

        Asch Conformity, mainly the blind leading the blind.

        1 Reply Last reply Reply Quote 0
        • S Offline
          Supermule Banned
          last edited by

          Out of the box.

          Tuning no… It should default to the behaviour of handling the traffic received and shouldnt fail on 3mbit/s traffic.

          You cant compare a F1 car to a WRC car. Its like comparing apples to bananas.

          The only they have in common is that they are fruits :D

          As does the firewalls. They handle traffic and blocks it if needed. Default behaviour. EOD.

          @firewalluser:

          @Supermule:

          Yes… but why can other firewalls keep up with the traffic and pf cant?

          Out of the box or have they been tuned?

          If hardware was the limit here, then all tested should suffer the same faith. They dont…

          Fortigates VMware Appliance and Mikrotik handles the same traffic no issues. Tell me why.... on the same ressources and in the same Hypervisor.

          Tuning?

          What makes pfsense's default settings suitable for your datacentre setup compared to my home use setup?

          Its like tuning a F1 race car, its not going to do well in World Rally Championship setting is it on snow, across deserts or in forests, likewise a rally car is not going to do so well on a race track against F1 cars is it.

          1 Reply Last reply Reply Quote 0
          • S Offline
            Supermule Banned
            last edited by

            A little weird thing I noticed…

            I tried running the VM with an odd amount of cores.

            It didnt change a bit, but behaviour did. 2 of the Cores are switching between 0% and 100% after the attack was stopped but none of them is 0% if the other one is...

            Anybody care to explain why?

            I havent seen this before and its like it wont let go.

            vmware.PNG
            vmware.PNG_thumb

            1 Reply Last reply Reply Quote 0
            • F Offline
              firewalluser
              last edited by

              Look at the code in the OS that determines what to assign to the available cores.

              The OS might be "smart" enough to work out which cores are already under load and also under load for long periods of time, and thus it might assign other cores to be used.

              This is windows thread scheduling https://msdn.microsoft.com/en-us/library/ms685100%28VS.85%29.aspx

              Freebsd
              https://calomel.org/freebsd_network_tuning.html
              Things you need to look at include processor affinity & thread scheduling as mentioned in the link above and some of it shown below.

              "######################################### net.isr. tuning begin ##############

              NOTE regarding "net.isr.*" : Processor affinity can effectively reduce cache

              problems but it does not curb the persistent load-balancing problem.[1]

              Processor affinity becomes more complicated in systems with non-uniform

              architectures. A system with two dual-core hyper-threaded CPUs presents a

              challenge to a scheduling algorithm. There is complete affinity between two

              virtual CPUs implemented on the same core via hyper-threading, partial

              affinity between two cores on the same physical chip (as the cores share

              some, but not all, cache), and no affinity between separate physical chips.

              It is possible that net.isr.bindthreads="0" and net.isr.maxthreads="3" can

              cause more slowdown if your system is not cpu loaded already. We highly

              recommend getting a more efficient network card instead of setting the

              "net.isr.*" options. Look at the Intel i350 for gigabit or the Myricom

              10G-PCIE2-8C2-2S for 10gig. These cards will reduce the machines nic

              processing to 12% or lower."

              This might also be useful albeit for an earlier version of freebsd.
              http://www.icir.org/gregor/tools/pthread-scheduling.html

              Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

              Asch Conformity, mainly the blind leading the blind.

              1 Reply Last reply Reply Quote 0
              • F Offline
                firewalluser
                last edited by

                @Supermule:

                Out of the box.

                Tuning no… It should default to the behaviour of handling the traffic received and shouldnt fail on 3mbit/s traffic.

                You cant compare a F1 car to a WRC car. Its like comparing apples to bananas.

                The only they have in common is that they are fruits :D

                As does the firewalls. They handle traffic and blocks it if needed. Default behaviour. EOD.

                I disagree, but thats because we dont know if the other products out of the box have additional code in order for it to adapt or change to requirements which may not be built into pfsense.

                Can you provide examples of out of the box fw's which do work out of the box and can you show me what is different to pfsense in order for them to work out of the box?

                The phrase, you get what you pay for springs to mind at the moment.  :)

                Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                Asch Conformity, mainly the blind leading the blind.

                1 Reply Last reply Reply Quote 0
                • S Offline
                  Supermule Banned
                  last edited by

                  Mikrotik, Fortigate, ISA Server and Windows Firewall.

                  No other of what we have tested passed the tests.

                  1 Reply Last reply Reply Quote 0
                  • H Offline
                    Harvy66
                    last edited by

                    FreeBSD is getting more SMP love for its network stack in 11. Each major release seems to have better core scaling for IO in general. There are some major plans to allow the network stack to both receive and send flows stickied to a single core and have flows randomly distributed among the cores.

                    One thing that I do not know about SMP loving is NAT. I know the NAT implementation has been single threaded for a while. It's possible it may be able to get a re-write once some of the new SMP network stack APIs are finalized. Or just got IPv6.

                    1 Reply Last reply Reply Quote 0
                    • F Offline
                      firewalluser
                      last edited by

                      @Supermule:

                      Mikrotik, Fortigate, ISA Server and Windows Firewall.

                      No other of what we have tested passed the tests.

                      Mikrotik - RouterOS based on Linux, so try some linux hacks on it for stability testing.
                      http://en.wikipedia.org/wiki/MikroTik#RouterOS

                      Fortigate - FortiOS based on Linux, so as above…
                      http://en.wikipedia.org/wiki/Fortinet#GPL_violations

                      Windows ISA Server Forefront no longer available as MS have announced they are dropping it so support will be gone in time. Try some windows hacks for stability testing.

                      This matters because freebsd is primarily aimed at stability although it has pioneered some features yet to be seen in other OS platforms and also holds an unofficial world record for the most amount of data transmitted
                      https://www.freebsd.org/advocacy/whyusefreebsd.html

                      http://www.serverwatch.com/tutorials/article.php/10825_3393051_2/Differentiating-Among-BSD-Distros.htm
                      "FreeBSD holds the unofficial record for transferring data, having achieved more than 2 Terabytes of data from one server running the OS. It follows from this statistic that FreeBSD is also one of the most stable OSes available."

                      The last part above is not what you want to hear considering what you are experiencing but it goes back to my point about tuning.

                      You can tune a little ford fiesta engine to compete on a 1/4 mile with similar performance as a bigger engined car, but that ford fiesta engine will then have no reliability and will likely explode after completing the 1/4 mile.

                      I guess what you need to do is define your aim's then select the correct fw according to those aim's.

                      Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                      Asch Conformity, mainly the blind leading the blind.

                      1 Reply Last reply Reply Quote 0
                      • S Offline
                        Supermule Banned
                        last edited by

                        Attack is currently scaled down to 2mbit/s and the FW still dies despite limiting states pr. second and states pr. host.

                        Ran Top command and here is some info.

                        1 core is still blasting away at full speed. (Core 6) if you pay for IOPS/CPU in a datacenter, this is not good news.

                        Advanced_options_rule.PNG_thumb
                        Advanced_options_rule.PNG
                        top1.PNG_thumb
                        top1.PNG
                        top.PNG_thumb
                        top.PNG
                        vmware.PNG_thumb
                        vmware.PNG

                        1 Reply Last reply Reply Quote 0
                        • S Offline
                          Supermule Banned
                          last edited by

                          It seems to need it bad.

                          But when will pfSense have it, is another interesting question.

                          @Harvy66:

                          FreeBSD is getting more SMP love for its network stack in 11. Each major release seems to have better core scaling for IO in general. There are some major plans to allow the network stack to both receive and send flows stickied to a single core and have flows randomly distributed among the cores.

                          One thing that I do not know about SMP loving is NAT. I know the NAT implementation has been single threaded for a while. It's possible it may be able to get a re-write once some of the new SMP network stack APIs are finalized. Or just got IPv6.

                          1 Reply Last reply Reply Quote 0
                          • S Offline
                            Supermule Banned
                            last edited by

                            Last one for today. Still packetloss and an unresponsive GUI. Traffic stateless is around 15mbit/s.

                            I see filterlog comsuming a lot of CPU during the attack. Big difference in Vmware CPU wise… Core nr. 6 is still blasting away, but its not attack dependant. If it wasnt going at a 100% then it could have survived (maybe).

                            vmware.PNG
                            vmware.PNG_thumb
                            top_before_attack_stateless.PNG
                            top_before_attack_stateless.PNG_thumb
                            top_attack_stateless.PNG
                            top_attack_stateless.PNG_thumb
                            traffic.PNG
                            traffic.PNG_thumb

                            1 Reply Last reply Reply Quote 0
                            • S Offline
                              Supermule Banned
                              last edited by

                              SYN Flood recording stateless running TOP on Console

                              Youtube Video

                              SYN Flood recording running SYN Proxy states with limiters and TOP on Console

                              Youtube Video

                              If you wonder why you cant see core nr4 in TOP, you are not the only one.

                              It runs 100% in VmWare.

                              core_missing.PNG
                              core_missing.PNG_thumb

                              1 Reply Last reply Reply Quote 0
                              • F Offline
                                firewalluser
                                last edited by

                                So have you identified the code thats running on the core that maxes out when this happens?

                                If you havent, how can you fix the problem?

                                At the moment you are just reporting symptoms which as you can see by the length of the thread its not been that useful at fixing the problem so far has it?

                                Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                                Asch Conformity, mainly the blind leading the blind.

                                1 Reply Last reply Reply Quote 0
                                • S Offline
                                  Supermule Banned
                                  last edited by

                                  I havent got a clue of where to begin and where to look.

                                  I cant see whats using the core…

                                  What do you make of this? Core nr. 4 says its idle at 100%

                                  core4.PNG
                                  core4.PNG_thumb

                                  1 Reply Last reply Reply Quote 0
                                  • T Offline
                                    tim.mcmanus
                                    last edited by

                                    @firewalluser:

                                    So have you identified the code thats running on the core that maxes out when this happens?

                                    I'm almost certain the issue is with the network driver in FreeBSD and it's also being contributed to by PF.

                                    When my state table is low (394K) the attack cripples the entire box with the exception of the console.  PF alerts that it hit its state table max in the console.  I'm not sure why a full state table creates a more significant impact on the box, but it does.

                                    When I increase the state table I get the IRQ warning; the interrupt storm.  This disables the interface being attacked and it's most likely due to the interface grabbing one CPU/core and filling it with software interrupts.  PF takes all packets and puts them through the CPU, and in this case it would/should grab only one CPU.  This makes sense because you don't want IRQ polling across all CPUs (I have a link to an excellent article regarding this design, I'll find it in a few).  So generating an interrupt storm on any interface should max out one CPU and take that interface down because of the interrupts being generated.  The CPU does not have the resources to process legit request because it's overwhelmed with interrupts.

                                    I guess that the network driver would have to include code to drop these packets before they got to the OS/kernel.  Once the kernel gets involved in processing these packets, it generates the interrupt storm, bogs down one CPU, and the interface goes down.

                                    I also assume that there is probably some performance tuning I can do in pfSense, but I think the issue is at a lower level than that.  If I have time this weekend, I'll pin up a FreeBSD 10.1 box running PF to validate these assumptions, but I have a strong feeling it's the networking driver that's creating this issue by passing every packet off to the kernel and PF for processing.

                                    1 Reply Last reply Reply Quote 0
                                    • S Offline
                                      Supermule Banned
                                      last edited by

                                      I disabled Device Polling and the 100% usage of the Core nr. 4 went away instantly.

                                      This is console with Top -p running when no attack and then the SYN attack.

                                      First one is with no portforward and the box is fine.

                                      2nd is with portforward and it dies.

                                      ![top-p_no portforward_SYN flood.PNG](/public/imported_attachments/1/top-p_no portforward_SYN flood.PNG)
                                      ![top-p_no portforward_SYN flood.PNG_thumb](/public/imported_attachments/1/top-p_no portforward_SYN flood.PNG_thumb)
                                      ![top-p_WITH_portforward_SYN flood.PNG](/public/imported_attachments/1/top-p_WITH_portforward_SYN flood.PNG)
                                      ![top-p_WITH_portforward_SYN flood.PNG_thumb](/public/imported_attachments/1/top-p_WITH_portforward_SYN flood.PNG_thumb)

                                      1 Reply Last reply Reply Quote 0
                                      • H Offline
                                        Harvy66
                                        last edited by

                                        It's possible that when the state table is full and a new packet for yet another new state comes in, if it's all being processed on the same thread, maybe a new state with a full table triggers some sort of "clean up" in an attempt to make room, and this clean up is really expensive to be doing per packet.

                                        1 Reply Last reply Reply Quote 0
                                        • H Offline
                                          Harvy66
                                          last edited by

                                          I think you mentioned this before, but I just want to make sure. When you say with/without forwarding, do you mean when targeting the forwarded port or just forwarding in general?

                                          If it's forwarding in general, maybe it's NAT that's causing some/all of the issues. Since NAT is single threaded, if a new state is coming in and in order to forward ports, NAT needs to first re-write the header information prior to the firewall seeing the packet, now we have a single chunk of code that is acting as gatekeeper to the firewall, and it's single threaded to boot.

                                          When you have no forwarding rules, NAT doesn't even need to be checked. But if you have one more more rules, NAT has to check the new state packet, for every new state packet that comes in, single threaded, lots of locking.

                                          edit: I see syslogd using a lot of CPU, are you logging blocked packets? May want to disable that during the test.

                                          1 Reply Last reply Reply Quote 0
                                          • T Offline
                                            tim.mcmanus
                                            last edited by

                                            It's also important to note that SM is running pfSense as a VM and I am running it on bare metal.  This can impact the way it handles network traffic.

                                            https://lists.freebsd.org/pipermail/freebsd-net/2015-March/041657.html

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.