Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    What is the biggest attack in GBPS you stopped

    Scheduled Pinned Locked Moved General pfSense Questions
    737 Posts 33 Posters 816.4k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • F Offline
      firewalluser
      last edited by

      @almabes:

      @firewalluser:

      What if you could throttle the packets coming in before the states were processed? Would that prevent the firewall from crashing/hanging?

      Isn't that the core idea of "a DDoS shouldn't be dealt with by the firewall, but upstream?"

      Yes & no.

      We have our internet feeds, we know what speed and amount of data we can get from it, ie some of it might be fast fibre but with a 10GB data limit, so is it up to us to ensure the fw can handle the speed of the data arriving surely? It also depends on what services the network provider provides to us for the money we pay for that internet feed, although depending on what country you are in, then the spooks may also have a hand in what arrives at your fw as well.

      Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

      Asch Conformity, mainly the blind leading the blind.

      1 Reply Last reply Reply Quote 0
      • F Offline
        firewalluser
        last edited by

        http://en.wikipedia.org/wiki/Interrupt_storm

        "In operating systems, an interrupt storm is an event during which a processor receives an inordinate number of interrupts that consume the majority of the processor's time. Interrupt storms are typically caused by hardware devices that do not support interrupt rate limiting."

        It doesnt seem unlike what I described earlier https://forum.pfsense.org/index.php?topic=91856.msg523964#msg523964 especially when considering chips are just running code stored in the chip instead of on the hard drive, ie like a BIOS and some Intel NIC's which provide some network processing capabilties unlike say a USB nic running on a rpi.  ;)

        Might be able to gleam some info & solutions from these links.
        https://forums.freebsd.org/threads/tp-link-tl-wn781nd-version-2-works-with-10-1-but-with-one-caveat.49667/
        2014

        https://forums.freebsd.org/threads/interrupt-storm-detected-on-irq10.17192/
        2010

        http://lists.freebsd.org/pipermail/freebsd-questions/2011-August/232647.html
        "Interrupt storms (an olde but a goode)" Bit like Rootkits which alot of people forgot about.

        https://forums.freebsd.org/threads/intel-dq77kb-high-interrupt-rate-when-using-hdmi.39210/
        2013

        https://forums.freenas.org/index.php?threads/getting-lots-if-interrupt-storm-on-irq16.3425/
        2011

        http://freebsd.1045724.n5.nabble.com/how-to-fix-quot-interrupt-storm-quot-td3819772.html
        2009

        http://daemonforums.org/showthread.php?t=500
        2008

        Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

        Asch Conformity, mainly the blind leading the blind.

        1 Reply Last reply Reply Quote 0
        • F Offline
          firewalluser
          last edited by

          Whats on IRQ 267 out of interest?

          Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

          Asch Conformity, mainly the blind leading the blind.

          1 Reply Last reply Reply Quote 0
          • T Offline
            tim.mcmanus
            last edited by

            @firewalluser:

            Whats on IRQ 267 out of interest?

            irq267: em1:rx 0                3529974        22

            em1 is my WAN2 interface.

            Intel 82574L Gigabit Ethernet Controller.

            1 Reply Last reply Reply Quote 0
            • H Offline
              Harvy66
              last edited by

              Interrupts are almost entirely due to packets per second, it's how the hardware talks to the OS. Complete guess, but I would assume(assumptions can be dangerous) that interrupts would not be high because of anything the firewall is doing, only because of lots of packets.

              1 Reply Last reply Reply Quote 0
              • H Offline
                hda
                last edited by

                @Harvy66:

                …
                If the issue is an issue involving states, a good test to make an extreme could be to try a few combinations. 1mil max states with a target of 10k, should never get much past 10k, but shouldn't hit the max state limit.

                But if still the back-off rate is to low ?

                Something other to test flushing states when storm, ../ Firewall Adaptive Timeouts :
                Leave [adaptive.start] at the default (60%), but set [adaptive.end] to the value 101% of your maxstates (i.s.o. the default 120%)
                As can be calculated, then the ultimate 5% (>95% maxstates) adapts much stronger flushing than the default setting.

                Hypothesis: if adaptation stronger with limit -> 0, then pfSense hardly chokes. True || False. ?

                1 Reply Last reply Reply Quote 0
                • F Offline
                  firewalluser
                  last edited by

                  @Harvy66:

                  Interrupts are almost entirely due to packets per second, it's how the hardware talks to the OS. Complete guess, but I would assume(assumptions can be dangerous) that interrupts would not be high because of anything the firewall is doing, only because of lots of packets.

                  https://www.freebsd.org/doc/en_US.ISO8859-1/books/arch-handbook/smp-design.html

                  " FreeBSD deals with interrupt handlers by giving them their own thread context. Providing a context for interrupt handlers allows them to block on locks. To help avoid latency, however, interrupt threads run at real-time kernel priority. Thus, interrupt handlers should not execute for very long to avoid starving other kernel threads. In addition, since multiple handlers may share an interrupt thread, interrupt handlers should not sleep or use a sleepable lock to avoid starving another interrupt handler."

                  HW Interrupts will be different to sw interrupts, in that hw interrupts will be treated as more important than most but not all sw interrupts.

                  https://www.freebsd.org/cgi/man.cgi?query=swi&apropos=0&sektion=9
                  "These functions are used to register and schedule software interrupt handlers.  Software interrupt handlers are attached to a software interrupt thread, just as hardware interrupt handlers are attached to a hardware interrupt thread. Multiple handlers can be attached to the same thread. Software interrupt handlers can be used to queue up less critical processing inside of hardware interrupt handlers so that the work can be done at a later time.  Software interrupt threads are different from other kernel threads in that they are treated as an interrupt thread. This means that time spent executing these threads is counted as interrupt time, and that they can be run via a lightweight context switch."

                  Windows is not immune to them either, its all in the drivers to a certain extent. https://msdn.microsoft.com/en-us/library/windows/hardware/ff540586%28v=vs.85%29.aspx

                  So in some respects any basic nic with little or no processing capabilities will rely more on the OS to do the packet processing and because computers are just glorified clockwork turk machines, so the OS is less likely to get out of shape with a basic nic as everything will just run like clockwork ignoring those electrons at the socket the OS is physically incapable of processing due to being tied up elsewhere, unlike with a smart nic which has various builtin processing capabilities which whilst making the packet processing quicker then causes a flood upstream in the OS itself. Its like CPU caches (L1,L2 & L3) can be a boon or a hindrance in certain circumstances as well.

                  With that in mind, some basic cheap nics ie realtek might actually be less hassle compared to say an intel nic when solving this sort of problem and will explain why some expensive hw in amateur hands can cause some embarrassment with customers advised to go with the latest and greatest. As the trend is generally for greater more sophisticated hack attempts so this sort of thing will only become more common and with the trend to employ younger talent straight out of uni, so the experience is lost and the cycles repeat considering the timescales of things like syn floods (1990's), rootkits (1990's) and interrupt storms (1990's) all of which were seen in the dos days before life became hidden behind gui's.

                  Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                  Asch Conformity, mainly the blind leading the blind.

                  1 Reply Last reply Reply Quote 0
                  • F Offline
                    firewalluser
                    last edited by

                    https://doc.pfsense.org/index.php/Tuning_and_Troubleshooting_Network_Cards#Intel_ix.284.29_Cards

                    "On releases prior to pfSense 2.2, the following may be necessary. If using VLANs with Intel 10 Gb ix(4) cards, some features of the driver for VLANs may need to be disabled to work correctly. For instance, to apply these settings on NIC ix0, run the following. "

                    I wonder if its worth increasing hw.intr_storm_threshold=10000 to something higher?

                    This is showing freebsd 10.1  https://calomel.org/freebsd_network_tuning.html

                    "For 10gig NIC's set to

                    9000 and use large MTU. (default 1000)

                    #hw.intr_storm_threshold="9000""

                    Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                    Asch Conformity, mainly the blind leading the blind.

                    1 Reply Last reply Reply Quote 0
                    • H Offline
                      Harvy66
                      last edited by

                      System idle, 208 interrupts per second for the one queue that seems to process ICMP from my desktop. Pinging the interface with 67.3k/sec ICMP packets, 250 interrupts per second.

                      Packets: sent=1422309, rcvd=1422309, error=0, lost=0 (0.0% loss) in 21.131114 sec
                      RTTs in ms: min/avg/max/dev: 0.003 / 0.176 / 20.716 / 0.294
                      Bandwidth in kbytes/sec: sent=4038.525, rcvd=4038.525

                      33Mb/s of 64byte ICMP packets barely made a dent.

                      An increase of 12% cpu time(48% cpu time of the core the queue is on) and a 19% increase in interrupts doesn't seem that bad for that many packets.

                      From a PPS and interrupt view, my system really doesn't care. Whatever the attack that was done before, 30Mb/s was bad enough that my admin interface went offline.

                      1 Reply Last reply Reply Quote 0
                      • H Offline
                        Harvy66
                        last edited by

                        I just realized that when ping flooding my firewall, the only thing that changed was the IRQ CPU time. I looked up ICMP and it seems the ICMP response is built right into the network stack. From what I can tell, the ICMP responses are being handled on that same realtime kernel thread. If this is the case and my admin interface stopped responding to pings because of load, that means the realtime thread on a different interface couldn't even run.

                        One of the features of MSI-X is interrupt masks. When a "hard" interrupt occurs, the current context thread gets interrupted and the real time thread gets scheduled. Then the thread can set a mask and block all other interrupts. It can process all of its data. Of course blocking interrupts is bad if you don't have a way to indicate there is new work, so there are "soft" interrupts. If the hardware supports it, the hardware can flag a shared memory location that new data is ready. When the current thread is done processing it's current data, it can do one last check to see if the soft interrupt was signaled. If not, unmask the interrupts and return. If it was flagged, then continue processing until all the data is work is done and no more soft interrupts have been signaled.

                        It may be possible that the WAN interface is in a constant state of backlog and the realtime kernel thread never unschedules because of constant backlog, starving my admin interface from CPU time.

                        With that in mind, some basic cheap nics ie realtek might actually be less hassle compared to say an intel nic when solving this sort of problem

                        With a minimum ping of 0.003ms and an average of 0.176, yet only ~250 interrupts per second, the i350 NIC is doing some nifty magic.

                        1 Reply Last reply Reply Quote 0
                        • S Offline
                          Supermule Banned
                          last edited by

                          I will test with some offloading and other tunables in pfsense later when online again.

                          Its SYN packets that is spoofed and of various sizes. Most of them doesnt have the ACK the FW needs and the states remain open.

                          Its easy to fend of an ICMP flood since its predicatable traffic. In my case, when 1 core hits 100% the FW goes offline and packetloss occurs. I cant see what that specific CPU does and it would be interesting to dig deeper into that and what process that consumes the CPU. It doesnt when there is no port forward enabled but as soon as it routes, then it goes ballistic.

                          Why can 1 core keep back everything else??

                          1 Reply Last reply Reply Quote 0
                          • S Offline
                            Supermule Banned
                            last edited by

                            Procstat -ka in Command prompt during a SYN attack.

                            No packet loss this time. No port forward.

                            procstat.PNG
                            procstat.PNG_thumb

                            1 Reply Last reply Reply Quote 0
                            • S Offline
                              Supermule Banned
                              last edited by

                              Same but with port forward.

                              procstat.PNG
                              procstat.PNG_thumb

                              1 Reply Last reply Reply Quote 0
                              • F Offline
                                firewalluser
                                last edited by

                                @Supermule:

                                I will test with some offloading and other tunables in pfsense later when online again.

                                Its SYN packets that is spoofed and of various sizes. Most of them doesnt have the ACK the FW needs and the states remain open.

                                Its easy to fend of an ICMP flood since its predicatable traffic. In my case, when 1 core hits 100% the FW goes offline and packetloss occurs. I cant see what that specific CPU does and it would be interesting to dig deeper into that and what process that consumes the CPU. It doesnt when there is no port forward enabled but as soon as it routes, then it goes ballistic.
                                Why can 1 core keep back everything else??

                                Software was always written for a single core cpu's, programmers & designers never thought we'd get multicore cpu's in the timeframe or for the little cost like we have had, so just like the Y2k bug, there has not been the planning for the future.

                                Now a multi core cpu still has to share resources, like L2 cache, hard disks and ram. You cant have two cores working on shared resource at the same time you get a deadlock. So the programmer needs to make decisions about what is acceptable to offload to another core and what is not.

                                If you want the speed keep as much as possible in a tight loop on one core, if you want to make it multi task at the expense of speed, off load more work to other cores knowing that too much swapping between cores increases the lock time and in extreme the lock time could be greater than the processing time, ergo nothing is achieved then.

                                Although this is framed in an software perspective, the points about multithreading is still relevant at the hw level, its just a different level of abstraction.
                                https://forum.pfsense.org/index.php?topic=91856.msg517843#msg517843

                                Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                                Asch Conformity, mainly the blind leading the blind.

                                1 Reply Last reply Reply Quote 0
                                • S Offline
                                  Supermule Banned
                                  last edited by

                                  Yes… but why can other firewalls keep up with the traffic and pf cant?

                                  If hardware was the limit here, then all tested should suffer the same faith. They dont...

                                  Fortigates VMware Appliance and Mikrotik handles the same traffic no issues. Tell me why.... on the same ressources and in the same Hypervisor.

                                  1 Reply Last reply Reply Quote 0
                                  • F Offline
                                    firewalluser
                                    last edited by

                                    @Harvy66:

                                    I just realized that when ping flooding my firewall, the only thing that changed was the IRQ CPU time. I looked up ICMP and it seems the ICMP response is built right into the network stack. From what I can tell, the ICMP responses are being handled on that same realtime kernel thread. If this is the case and my admin interface stopped responding to pings because of load, that means the realtime thread on a different interface couldn't even run.

                                    One of the features of MSI-X is interrupt masks. When a "hard" interrupt occurs, the current context thread gets interrupted and the real time thread gets scheduled. Then the thread can set a mask and block all other interrupts. It can process all of its data. Of course blocking interrupts is bad if you don't have a way to indicate there is new work, so there are "soft" interrupts. If the hardware supports it, the hardware can flag a shared memory location that new data is ready. When the current thread is done processing it's current data, it can do one last check to see if the soft interrupt was signaled. If not, unmask the interrupts and return. If it was flagged, then continue processing until all the data is work is done and no more soft interrupts have been signaled.

                                    It may be possible that the WAN interface is in a constant state of backlog and the realtime kernel thread never unschedules because of constant backlog, starving my admin interface from CPU time.

                                    With that in mind, some basic cheap nics ie realtek might actually be less hassle compared to say an intel nic when solving this sort of problem

                                    With a minimum ping of 0.003ms and an average of 0.176, yet only ~250 interrupts per second, the i350 NIC is doing some nifty magic.

                                    Having some devices do some of the work can be useful, but I suspect its also helping create the problem being seen here in this instance.

                                    http://en.wikipedia.org/wiki/Message_Signaled_Interrupts#MSI-X
                                    https://forums.freebsd.org/threads/msi-msi-x-on-intel-em-nic.27736/
                                    http://people.freebsd.org/~jhb/papers/bsdcan/2007/article/node8.html

                                    Might be useful as its looking at possibles areas for MSI-X failures.
                                    http://comments.gmane.org/gmane.os.freebsd.stable/71699
                                    http://christopher-technicalmusings.blogspot.co.uk/2012/12/passthrough-pcie-devices-from-esxi-to.html

                                    Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                                    Asch Conformity, mainly the blind leading the blind.

                                    1 Reply Last reply Reply Quote 0
                                    • F Offline
                                      firewalluser
                                      last edited by

                                      @Supermule:

                                      Yes… but why can other firewalls keep up with the traffic and pf cant?

                                      Out of the box or have they been tuned?

                                      If hardware was the limit here, then all tested should suffer the same faith. They dont…

                                      Fortigates VMware Appliance and Mikrotik handles the same traffic no issues. Tell me why.... on the same ressources and in the same Hypervisor.

                                      Tuning?

                                      What makes pfsense's default settings suitable for your datacentre setup compared to my home use setup?

                                      Its like tuning a F1 race car, its not going to do well in World Rally Championship setting is it on snow, across deserts or in forests, likewise a rally car is not going to do so well on a race track against F1 cars is it.

                                      Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                                      Asch Conformity, mainly the blind leading the blind.

                                      1 Reply Last reply Reply Quote 0
                                      • S Offline
                                        Supermule Banned
                                        last edited by

                                        Out of the box.

                                        Tuning no… It should default to the behaviour of handling the traffic received and shouldnt fail on 3mbit/s traffic.

                                        You cant compare a F1 car to a WRC car. Its like comparing apples to bananas.

                                        The only they have in common is that they are fruits :D

                                        As does the firewalls. They handle traffic and blocks it if needed. Default behaviour. EOD.

                                        @firewalluser:

                                        @Supermule:

                                        Yes… but why can other firewalls keep up with the traffic and pf cant?

                                        Out of the box or have they been tuned?

                                        If hardware was the limit here, then all tested should suffer the same faith. They dont…

                                        Fortigates VMware Appliance and Mikrotik handles the same traffic no issues. Tell me why.... on the same ressources and in the same Hypervisor.

                                        Tuning?

                                        What makes pfsense's default settings suitable for your datacentre setup compared to my home use setup?

                                        Its like tuning a F1 race car, its not going to do well in World Rally Championship setting is it on snow, across deserts or in forests, likewise a rally car is not going to do so well on a race track against F1 cars is it.

                                        1 Reply Last reply Reply Quote 0
                                        • S Offline
                                          Supermule Banned
                                          last edited by

                                          A little weird thing I noticed…

                                          I tried running the VM with an odd amount of cores.

                                          It didnt change a bit, but behaviour did. 2 of the Cores are switching between 0% and 100% after the attack was stopped but none of them is 0% if the other one is...

                                          Anybody care to explain why?

                                          I havent seen this before and its like it wont let go.

                                          vmware.PNG
                                          vmware.PNG_thumb

                                          1 Reply Last reply Reply Quote 0
                                          • F Offline
                                            firewalluser
                                            last edited by

                                            Look at the code in the OS that determines what to assign to the available cores.

                                            The OS might be "smart" enough to work out which cores are already under load and also under load for long periods of time, and thus it might assign other cores to be used.

                                            This is windows thread scheduling https://msdn.microsoft.com/en-us/library/ms685100%28VS.85%29.aspx

                                            Freebsd
                                            https://calomel.org/freebsd_network_tuning.html
                                            Things you need to look at include processor affinity & thread scheduling as mentioned in the link above and some of it shown below.

                                            "######################################### net.isr. tuning begin ##############

                                            NOTE regarding "net.isr.*" : Processor affinity can effectively reduce cache

                                            problems but it does not curb the persistent load-balancing problem.[1]

                                            Processor affinity becomes more complicated in systems with non-uniform

                                            architectures. A system with two dual-core hyper-threaded CPUs presents a

                                            challenge to a scheduling algorithm. There is complete affinity between two

                                            virtual CPUs implemented on the same core via hyper-threading, partial

                                            affinity between two cores on the same physical chip (as the cores share

                                            some, but not all, cache), and no affinity between separate physical chips.

                                            It is possible that net.isr.bindthreads="0" and net.isr.maxthreads="3" can

                                            cause more slowdown if your system is not cpu loaded already. We highly

                                            recommend getting a more efficient network card instead of setting the

                                            "net.isr.*" options. Look at the Intel i350 for gigabit or the Myricom

                                            10G-PCIE2-8C2-2S for 10gig. These cards will reduce the machines nic

                                            processing to 12% or lower."

                                            This might also be useful albeit for an earlier version of freebsd.
                                            http://www.icir.org/gregor/tools/pthread-scheduling.html

                                            Capitalism, currently The World's best Entertainment Control System and YOU cant buy it! But you can buy this, or some of this or some of these

                                            Asch Conformity, mainly the blind leading the blind.

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.