peterdh44,
We run multiple PF boxes at work (A Data Center) and about a dozen pair of PFSense boxes. On similar hardware a PFSense box has similar throughput. But it has a VERY nice GUI. PF is designed to be more efficient with a Last Rule Matching ruleset. but that is generally not an easy rule set for people to think their way through. The rule sets in the GUI for PFSense are first rule matching, But either there is little performance advantage, or the PFSense kernel mods are such that the performance is made up.
From testing I can tell you with certainty that there is VERY little difference in performance, And the difference I do see is small enough to be non noticeable in a real world situation.
Basically, First rule matching is a better fit in this case. So the Quick option gets used.