pfSense 22.05 breaks VLANS, restoring pfSense 22.01 fixes the issue
-
@stephenw10
Sorry to bother you, did you had time to look over those files? Or, more time is needed? I can wait, no worries.
Thank you -
Sorry I haven't had time yet. I should be able to review that shortly....
-
@stephenw10
take your time
Thank you -
@stephenw10
Investigating the config file was indeed a good idea, and, in the end I found the culprit.
It's not pfSense, it's the DriverIn my config, I have done some NIC optimization a while ago, and for that I used shellcmd, to run it after boot.
You can see it here:
<shellcmd>ifconfig ix2 -txcsum -rxcsum -tso4 -tso6 -lro -txcsum6 -rxcsum6 -vlanhwtso -vlanhwfilter -vlanhwtag -vlanhwcsum</shellcmd>
Now, to explain why I did the above.
In 2.6.0 or earlier, I don't remember exactly,(at least for my driver) Suricata did not pass traffic for VLANs, enabled on the parent interface. In order to solve that I wrote the above.In 22.05 though, I had to add "vlanhwtag" again, otherwise even with Suricata uninstalled, no VLANs traffic will pass through.
I played with others tags, the only one that I must leave enabled is "vlanhwtag".
So the issues with the VLANs are solved, but now I cannot run Suricata in Inline mode with "vlanhwtag" enabled,(it will not pass traffic) or at least that is how it worked for me in previous versions.
Many thanks to @johnpoz and @stephenw10 for investigating with me.
-
Huh, that's fascinating. So as long as you do not disable hardware VLAN tagging on ix2 the VLANs work? And presumably a pcap no longer shows VLAN0 tagged traffic?
That's almost the opposite of what I'd expect....
I was never able to replicate it here.
-
@stephenw10 said in pfSense 22.05 breaks VLANS, restoring pfSense 22.01 fixes the issue:
Huh, that's fascinating. So as long as you do not disable hardware VLAN tagging on ix2 the VLANs work? And presumably a pcap no longer shows VLAN0 tagged traffic?
That's almost the opposite of what I'd expect....
I was never able to replicate it here.
The results are like this:
- Disable "vlanhwtag" - the VLAN traffic will cease almost immediately, and also the pcap shows VLAN0 again.
- Enable "vlanhwtag" - the VLAN traffic will work, and pcap will not show any VLAN0 anymore.
In 22.01 this did not happen with or without "vlanhwtag" .
The only inconvenient now is that I cannot run Suricata in Inline mode. I mean, I can, but the traffic for VLANS will not work. Only in legacy mode.
Other than that no complaints.
-
Wow. Boom!
20:30:18.349238 00:51:82:11:22:02 > ff:ff:ff:ff:ff:ff, ethertype 802.1Q (0x8100), length 68: vlan 0, p 0, ethertype 802.1Q, vlan 1001, p 0, ethertype ARP, Ethernet (len 6), IPv4 (len 4), Request who-has 10.101.0.1 tell 10.101.0.12, length 46
That's crazy! Time to dig deeper...
-
@stephenw10 so you were able to duplicate it then?
-
Yup. Bizarre. I would have sworn it wasn't real but...
-
@stephenw10 well guess you got a new "toy" to play with then - figuring out.. Can't wait to hear what you discover on what/how/why this happens..
So does this happen on other then ix nics? Could I remove the vlanhw setting on say an interface on my sg4860 and see this happening with vlan0? I assume you have already tried that?
-
Not yet, feel free
The same patch or something close to it went into all the Intel drivers so...possibly. Though it seems unlikely no-one would have hit it in igb.Edit: igb in current 2.7 snapshots appears unaffected.
-
@stephenw10 said in pfSense 22.05 breaks VLANS, restoring pfSense 22.01 fixes the issue:
Not yet, feel free
The same patch or something close to it went into all the Intel drivers so...possibly. Though it seems unlikely no-one would have hit it in igb.Edit: igb in current 2.7 snapshots appears unaffected.
Wow, I'm glad that this ended up with a defect. It took a while :) Do you think this is something that can be fixed by Netgate or we should wait for an upstream fix?
-
Almost certainly upstream. Need to confirm it still exists in FreeBSD head first though.
-
@stephenw10 said in pfSense 22.05 breaks VLANS, restoring pfSense 22.01 fixes the issue:
Almost certainly upstream. Need to confirm it still exists in FreeBSD head first though.
Thanks again for the support, let's hope it will not take 5 years :) . Yes I know, it's out of your hands.
-
Mmm, this is the most interesting issue I've seen in a while!
You need to disable vlan hardware tagging for Suricata to be able to see the traffic? Is that is you run Suricata on the parent NIC only?
-
@stephenw10 said in pfSense 22.05 breaks VLANS, restoring pfSense 22.01 fixes the issue:
Mmm, this is the most interesting issue I've seen in a while!
You need to disable vlan hardware tagging for Suricata to be able to see the traffic? Is that is you run Suricata on the parent NIC only?
Exactly, I'll try to explain better.
For 22.01 I found a workaround in order for the Suricata package to not break VLAN traffic.For that I disabled the vlanhwtag.
If I left vlanhwtag enabled, only traffic from the parrent interface will pass.
Suricata is enabled only on the parent interface, as @bmeeks recommends.
On 22.05 I cannot apply the same workaround anymore, due to, when I disable vlanhwtag, that VLAN0 will appear and VLANs traffic will cease, even with Suricata uninstalled.
-
@stephenw10
I saw you commented on this defect
https://redmine.pfsense.org/issues/13381that you tested on FreeBSD 14
The behavior is the same?
-
It is the same in FreeBSD 14 snapshots yes. We are looking at the best way to handle it.
-
@stephenw10
Was this https://reviews.freebsd.org/D36139
rejected and closed or rejected and closed ?
Just wanted to make sure it wasn't rejected and closed.
Thank you -
@nrgia Our redmine is still open and active.