Playing with fq_codel in 2.4

Pentangle

@dtaht Porting it into pfsense as a package would be nice too.

dtaht

@pentangle It's in openbsd... please point a freebsd packager at it. https://repology.org/metapackage/flent/versions

Pentangle

@dtaht Unfortunately I'm one of those Windows experts who has little or no knowledge of BSD aside from what the GUI gives. I'll leave that to someone competent in it.

sciencetaco

@dtaht said in Playing with fq_codel in 2.4:

@pentangle It's in openbsd... please point a freebsd packager at it. https://repology.org/metapackage/flent/versions

ahh, this would be ideal. The only other hardwired hosts I have to test from are either raspberry pi or an ancient mac mini - pretty sure the mini is the reason my flent tests show such a different picture than when i test using netperf on the pfsense router.

here was my latest: charter can't tell me what i'm provisioned for, so i don't know if i have 400mb or 300mb, so i adjusted my shaper down to 290 from 390:

flent rrul run

dtaht

@sciencetaco Is that a live link? What's your "before" result? That would give you an exact idea as to what you are getting from them, and an even more exact would be running the tcp_download test instead of rrul. What's a dslreports on a before setting? You can try watching your link quality on the cablemodem itself also. Bad SNR there can be a problem.

(helps to have the flent.gz files)

Still don't know why you aren't getting close to the set rate. (can the limiter burst size be tuned?) Another thing to try is setting the codel algo to kick in (for test purposes) much later, like target 30ms, but I'm still pointing fingers at the limiter/cpu context switch latency, out of cpu, etc.........

dtaht

Going back to my alice's restaurant quote (which is an arcane bit of americana - you can lose 20 minutes to this https://www.youtube.com/watch?v=W5_8U4j51lI&start_radio=1&list=RDW5_8U4j51lI , easily, it's a thanksgiving tradition here)

(I think y'all are mostly in england or the continent?)

and being sad about how few views the various core bufferbloat presos jim, I and others have done ( https://www.youtube.com/results?search_query=bufferbloat ) - even the highly entertaining one by stephen hemminger ( https://www.youtube.com/watch?v=y5KPryOHwk8&t=828s ) only has 3k views (take 10 minutes out of your life, it's great, but not as great as alice's restaurant)

and seeing https://www.youtube.com/results?search_query=fq_codel+pfsense have 30k views...

I'd love to try to get some of what we've discussed and solved above with flent into a script and talk with a good presenter like lawrence or Mark Furneaux to get more folk to pay attention....

video is not my forte. But you can get anything you want at alice's restaurant.

sciencetaco

@dtaht said in Playing with fq_codel in 2.4:

@sciencetaco Is that a live link? What's your "before" result? That would give you an exact idea as to what you are getting from them, and an even more exact would be running the tcp_download test instead of rrul. What's a dslreports on a before setting? You can try watching your link quality on the cablemodem itself also. Bad SNR there can be a problem.

(helps to have the flent.gz files)

Still don't know why you aren't getting close to the set rate. (can the limiter burst size be tuned?) Another thing to try is setting the codel algo to kick in (for test purposes) much later, like target 30ms, but I'm still pointing fingers at the limiter/cpu context switch latency, out of cpu, etc.........

link is live, only traffic is 1 streaming client.

I disbled my limiters and ran a few tests, as you suggested. Here are the outputs:

tcp_down limited:

tcp_down limit

rrul46 without a limiter:

rrul46 no limit

tcp_down no limit:

tcp_download no limit

i really appreciate everyone that's taken the time to take a look at my output and offer suggestions. thank you all.
flent .gz files: (4 of them in the tarball)
0_1539269433292_flent.tar

dtaht

@sciencetaco your limited and unlimited tcp_download plots are essentially the same. latency is good. Sure you turned it off? Otherwise your latency is not bad

Otherwise the fact that four flows do a bit better than one implies that we have a flow per CMTS channel (take a look at your cmts config and see how many channels are operational), or a limit on my tcp stack on this server (I usually test with four flows) or your receive window... (which btw, I'd not considered before on this thread - I can't do much testing of the real world at real rtts outside the lab. At short RTTs I can certainly do 1gb on various hardware)

There's some hint that powerboost might be on with charter (first 20 sec solid then a major drop (yea, optimizing for speedtest)).

but heck, rather than fiddle with the tcp config, try killing it with a hammer. try using

--te=download_streams=16 tcp_ndown

with the limiter on and off. Cool that you just tested both ipv6 and ipv4 at the same time, also. I kind of hope we end up with a "superscript" that has all 10 tests nicely done in a row... against someone elses server.

The totals plot is much easier to deal with.

thx for sharing the flent data! However I meant to go to bed about 8 hours ago... bbl

gsakes

@ dtaht Here are the results with ack filter on the outbound interface, and ecn enabled:

alt text

dtaht

@gsakes That result is a thing of beauty. If only the whole internet worked like that.

gsakes

@dtaht lol - i must be reading that wrong - that result is actually better?

dtaht

@gsakes well post the flent files? and tc -s stats? to my eye - tcp up/download occilations are less, bandwidth ruler flat, uplink bandwidth improved by a few percent, and induced latency cut from 5ms to about 4. Love to do a comparison plot.

Another flent trick is you can lock the scaling to one plot when flipping between them, so your first plot had more occilations than you see here. I think... let me scroll back

5 years of work for those few percentage points compared to the orders of magnitude win fq_codel was... but I'll take it, and frame it.

gsakes

@dtaht Yes- frame it - it was worth it, no - really. Ok, here's without ack-filter, and ecn, I'll post those flent files next:

alt text

dtaht

a comparison plot is easier if you make your -t 'text-to-be-included-in-plot' actually meaningful. :)

gsakes

Flent files for ack-filter/ecn:

0_1539273758800_rrul-2018-10-11T155505.344180.text-to-be-included-in-plot.flent.gz

gsakes

@dtaht lol - ok - if you don't mind, give me the command line for what you want me to run, it'll be easier than for me looking through the gazillion test/combinations/options. Sure, text is easy, but let me know what other tests you want me to run:)

dtaht

I can fix it in post, but for a future run of any given new thing, it helps minute, hours, or months later if you run it with a meaningful title...

-t 'gsakes_cake_noecn_noack-filter'

or use the --notes option so you capture more of the basic test parameters.

I'm personally sitting on top of a few hundred thousand rrul tests....

Anyway, would love the prior noecn noackfilter test flent file before I try to get to bed

gsakes

@dtaht said in Playing with fq_codel in 2.4:

gsakes_cake_noecn_noack-filter'

Sure thing:
0_1539274464800_rrul-2018-10-11T161222.683165.gsakes_cake_noecn_noack-filter.flent.gz

gsakes

@dtaht (facepalm) - Oh, I see - Flent's got a rather nice gui, now it makes sense:)

dtaht

going back to the fios example earlier. having fq_codel shaping at 48mbit appears to get more bandwidth than fios with 120ms worth of buffering at 50mbit does. Why? You get a drop, it takes 120ms to start recovery, you get a drop at the end of that recovery and the process repeats. Someone tell FIOS that engineering their fiber network to speedtest only works for 20 seconds.....

It's kind of hard to see at this resolution but the green line's median and whiskers are locked solid at 48mbit while the other... (btw, I should go pull the actual data transferred out of flent, I didn't, but you can't actually trust it due to 120ms worth of occilation, anyway, but improved bandwidth here is real. As is the lowered latency.

0_1539274813463_rrul_-_c2558_pfsense2.4.4_u10mbit_d48.5mbit_fios_shaped.png