Qotom J1900 4 port mini-pc getting WAN errors in on a 1 gig fiber connection

whosmatt

The CPU in my box seems to be a little more powerful than in yours

I don't think that's true in general. The one edge it has is AES-NI but that doesn't matter unless you're comparing crypto performance. The Intel CPU has higher clock speed (both base and turbo) and will at least match the AMD in IPC, if not exceed it. That alone can explain the difference in LAN to WAN throughput.

sirozha

You may want to read a review of my box on Anandtech and also run a comparison of these two CPUs using one (or more) of the sties that offer that information.

dwood

Some tunables here : https://ashbyte.com/ashbyte/wiki/pfSense/Tuning

CubedRoot

I have tried the tunables suggested and I am still getting errors and only about 850 max throughput across the WAN. The CPU will spike to about 70% at the most, and its hardly using any RAM.

I suspect that if I can eliminate the errors, I'd see a marked increase in throughput.

sirozha

Did you configure flow control in pfSense? If you did, you need to configure flow control under the port on the LAN switch port connected to the pfSense LAN interface. With flow control configured in pfSense (don't forget to reboot pfsense) pfSense will be sending flow control messages to the upstream switch port to pause sending traffic, so the switch port must understand and obey these flow control messages. This will stop input errors on the LAN interface. In my case, once I configured flow control in pfSense, input errors also stopped incrementing on the pfSense WAN interface. Currently pfSense is still in the lab environment, being tested, so a Mac Mini is connected directly to the WAN interface for running iperf3. Therefore, macOS seems to have flow control enabled by default. I did, however have to enable flow control in the Cisco switch port connected to the pfSense LAN interface for the input errors on the pfSense LAN interface to stop.

As for the CPU utilization, when I do iperf3 through the pfSense, the CPU utilization in the box spikes to about 50%. However, when I run iperf3 to the pfSense LAN interface IP, the CPU utilization spikes to 70% utilization.

In my opinion, you are getting good throughput. You need to elminate input errors, though.

CubedRoot

I have no control of the upstream flow control settings, as its an ethernet that comes from the demarc box that has the fiber to ethernet media converter supplied by my ISP. All of my errors are coming from the WAN in interface.

On my old hardware, I do not have to rely on flow control. I can routinely see over 930 Mbps throughput on my old hardware with no tunings.

sirozha

@CubedRoot:

I have no control of the upstream flow control settings, as its an ethernet that comes from the demarc box that has the fiber to ethernet media converter supplied by my ISP. All of my errors are coming from the WAN in interface.

On my old hardware, I do not have to rely on flow control. I can routinely see over 930 Mbps throughput on my old hardware with no tunings.

I guess this is the price to pay for getting a cheap box to run pfSense.. The weak CPU cannot keep up with the input packets flodding the input buffers of the gigabit NICs. I have the same problem with my Fitlet box and am considering returning it and getting a Check Point instead since it seems I would have to pay the same amount for a similar performance check point.

whosmatt

@sirozha:

You may want to read a review of my box on Anandtech and also run a comparison of these two CPUs using one (or more) of the sties that offer that information.

I did. Not trying to impugn the device, just suggesting why OP may have higher throughput (errors notwithstanding). I'm in a similar boat; running a low power AMD SoC and even LAN to LAN throughput seems suspiciously low, at around 700Mbps with iperf. Even my lowly Sheevaplug (1.2GHz single core ARM chip from 2009) can do that. Fortunately my WAN speed is low enough that I don't have to deal with what you guys are running up against. Best of luck getting it figured out.

CubedRoot

I did a sysctl -a on the em.0 device:

And have been noticing these two:

dev.em.0.mac_stats.recv_no_buff: 4876
dev.em.0.mac_stats.missed_packets: 2019

Which indicates the incoming packets are possibly flowing in too fast for this unit to handle. In the dashboard, the interface statistics will report the dev.em.0.mac_stats.missed_packets for the interface.

Any suggestions on how to tune these bufffers?

Here is the full output of the sysctl -a |grep em.0


dev.em.0.wake: 0
dev.em.0.interrupts.rx_overrun: 0
dev.em.0.interrupts.rx_desc_min_thresh: 0
dev.em.0.interrupts.tx_queue_min_thresh: 0
dev.em.0.interrupts.tx_queue_empty: 0
dev.em.0.interrupts.tx_abs_timer: 8
dev.em.0.interrupts.tx_pkt_timer: 2
dev.em.0.interrupts.rx_abs_timer: 0
dev.em.0.interrupts.rx_pkt_timer: 690
dev.em.0.interrupts.asserts: 2858628
dev.em.0.mac_stats.tso_ctx_fail: 0
dev.em.0.mac_stats.tso_txd: 0
dev.em.0.mac_stats.tx_frames_1024_1522: 2879367
dev.em.0.mac_stats.tx_frames_512_1023: 33878
dev.em.0.mac_stats.tx_frames_256_511: 47852
dev.em.0.mac_stats.tx_frames_128_255: 59479
dev.em.0.mac_stats.tx_frames_65_127: 2185545
dev.em.0.mac_stats.tx_frames_64: 210228
dev.em.0.mac_stats.mcast_pkts_txd: 5
dev.em.0.mac_stats.bcast_pkts_txd: 42
dev.em.0.mac_stats.good_pkts_txd: 5416349
dev.em.0.mac_stats.total_pkts_txd: 5416349
dev.em.0.mac_stats.good_octets_txd: 4593379613
dev.em.0.mac_stats.good_octets_recvd: 11719132980
dev.em.0.mac_stats.rx_frames_1024_1522: 7626467
dev.em.0.mac_stats.rx_frames_512_1023: 72768
dev.em.0.mac_stats.rx_frames_256_511: 54886
dev.em.0.mac_stats.rx_frames_128_255: 94095
dev.em.0.mac_stats.rx_frames_65_127: 1216069
dev.em.0.mac_stats.rx_frames_64: 157708
dev.em.0.mac_stats.mcast_pkts_recvd: 0
dev.em.0.mac_stats.bcast_pkts_recvd: 1428
dev.em.0.mac_stats.good_pkts_recvd: 9221993
dev.em.0.mac_stats.total_pkts_recvd: 9224012
dev.em.0.mac_stats.xoff_txd: 0
dev.em.0.mac_stats.xoff_recvd: 0
dev.em.0.mac_stats.xon_txd: 0
dev.em.0.mac_stats.xon_recvd: 0
dev.em.0.mac_stats.coll_ext_errs: 0
dev.em.0.mac_stats.alignment_errs: 0
dev.em.0.mac_stats.crc_errs: 0
dev.em.0.mac_stats.recv_errs: 0
dev.em.0.mac_stats.recv_jabber: 0
dev.em.0.mac_stats.recv_oversize: 0
dev.em.0.mac_stats.recv_fragmented: 0
dev.em.0.mac_stats.recv_undersize: 0
dev.em.0.mac_stats.recv_no_buff: 4876
dev.em.0.mac_stats.missed_packets: 2019
dev.em.0.mac_stats.defer_count: 0
dev.em.0.mac_stats.sequence_errors: 0
dev.em.0.mac_stats.symbol_errors: 0
dev.em.0.mac_stats.collision_count: 0
dev.em.0.mac_stats.late_coll: 0
dev.em.0.mac_stats.multiple_coll: 0
dev.em.0.mac_stats.single_coll: 0
dev.em.0.mac_stats.excess_coll: 0
dev.em.0.queue_rx_0.rx_irq: 0
dev.em.0.queue_rx_0.rxd_tail: 873
dev.em.0.queue_rx_0.rxd_head: 874
dev.em.0.queue_tx_0.no_desc_avail: 0
dev.em.0.queue_tx_0.tx_irq: 0
dev.em.0.queue_tx_0.txd_tail: 801
dev.em.0.queue_tx_0.txd_head: 801
dev.em.0.fc_low_water: 16932
dev.em.0.fc_high_water: 18432
dev.em.0.rx_control: 67141658
dev.em.0.device_control: 1074790984
dev.em.0.watchdog_timeouts: 0
dev.em.0.rx_overruns: 3
dev.em.0.tx_dma_fail: 0
dev.em.0.mbuf_defrag_fail: 0
dev.em.0.link_irq: 0
dev.em.0.dropped: 0
dev.em.0.eee_control: 1
dev.em.0.rx_processing_limit: 100
dev.em.0.itr: 488
dev.em.0.tx_abs_int_delay: 66
dev.em.0.rx_abs_int_delay: 66
dev.em.0.tx_int_delay: 66
dev.em.0.rx_int_delay: 0
dev.em.0.fc: 3
dev.em.0.debug: -1
dev.em.0.nvm: -1
dev.em.0.%parent: pci1
dev.em.0.%pnpinfo: vendor=0x8086 device=0x150c subvendor=0x8086 subdevice=0x0000 class=0x020000
dev.em.0.%location: pci0:1:0:0 handle=\_SB_.PCI0.RP01.PXSX
dev.em.0.%driver: em
dev.em.0.%desc: Intel(R) PRO/1000 Network Connection 7.6.1-k

sirozha

@CubedRoot:

I did a sysctl -a on the em.0 device:

And have been noticing these two:
dev.em.0.mac_stats.recv_no_buff: 4876
dev.em.0.mac_stats.missed_packets: 2019
Which indicates the incoming packets are possibly flowing in too fast for this unit to handle. In the dashboard, the interface statistics will report the dev.em.0.mac_stats.missed_packets for the interface.

Any suggestions on how to tune these bufffers?

I've used these settings in the /boot/loader.conf.local file.


kern.ipc.nmbclusters=1000000
hw.pci.enable_msix=0
hw.igb.fc_setting=2
hw.igb.rxd=4096
hw.igb.txd=4096

You must restart your pfSense box after you configure these commands and save the file.

The hw.igb.rxd=4096 command (after I rebooted the pfSense box) eliminated the "recv_no_buff" errors.

You will have to replace .igb. with .em. in the above commands.

CubedRoot

Well, I gave that a shot (replacing with em) and didnt see much of an improvement.

However, I noticed that when I enable trim on my SSD, the WAN in errors seem to have decreased by a margin, but are still there. Any other suggestions on tuning this thing?