Pfsense 2.1 vmware cpu host high usage
-
I seem to have the same problem. 1.5 GhZ in ESX, ~14% on pfsense, about 1 MBit (!) traffic… :(
Happens to both VM's of a HA Pair. Using Intel NICs, E1000 vNIC -
- What is the ESXi host machine and processor? Supermicro X8DTU / E5620
- Which version of pfSense and whether 32 or 64-bit? 64-bit
- How many vCPUs have you allocated to the VM? 1
- How much memory have you allocated to the VM? 1 GB
- Have you installed the pfSense packaged VM tools or the VMware-supplied tools? Open-VM-Tools
- Are you using the e1000 adapter type or something else? E1000
-
I have the same problem with pfsense 2.1.5 running on KVM on Ubuntu Server 14.04.
See attached screenshot with pfsense running top on the right and the host machine running the VMs on the left.
- What is the ESXi host machine and processor? Thinkserver TS140 / Intel Xeon CPU E3-1225 v3 @ 3.20GHz
- Which version of pfSense and whether 32 or 64-bit? pfsense 2.1.5 - 32-bit
- How many vCPUs have you allocated to the VM? 4
- How much memory have you allocated to the VM? 2 GB
- Have you installed the pfSense packaged VM tools or the VMware-supplied tools? No
- Are you using the e1000 adapter type or something else? E1000
-
I have the same issue running pfsense 2.1.5 within a proxmox (kvm) virtualization.
- What is the ESXi host machine and processor? PCEngines APU (AMD G-T40E, 2*1GHZ, 4GB RAM), running proxmox 3.2 under debian 7
- Which version of pfSense and whether 32 or 64-bit? pfsense 2.1.5 - 64-bit
- How many vCPUs have you allocated to the VM? 2
- How much memory have you allocated to the VM? 2 GB
- Are you using the e1000 adapter type or something else? testet all kind of virtual NICs including virtio
pfsense idle: while pfsense assuming less than 10% on both CPUs, the hosts recognizes about 50-60% on both cores.
pfsense busy: while pfsense assuming about 30% on both CPUs, the hosts recognizes about 70-80% on both cores. Throughput is limited to aprox. 80 MBit/s.
Other guests like a Debian installation consume only 1-2 % of host CPU during idle state.I also tried the latest 2.2 snapshot. The cpu consumption decreased to 20-30%, which is still to much, but much better than 2.1.5, but the throughput was limited to ~40 MBit, so this is not an option since my internet connection is 100 MBit/s
Another issue is, that I have to emulate the CPU as an qemu64 cpu, becaus using "host" causes pfsense to crash during bootup (other guests are ok with the "host" option). I also had to turn of all kind of checksum offloading to reach these throughputs. with checksum offloading enabled, the throughput is less than 1 MBit/s
I have no packages installed.
-
Just to help.
- What is the ESXi host machine and processor? Supermicro H8DCL / AMD 4386 / ESX 5.5.0 2068190
- Which version of pfSense and whether 32 or 64-bit? 32-bit - 2.1.5 (no tweak)
- How many vCPUs have you allocated to the VM? 1
- How much memory have you allocated to the VM? 1 GB
- Have you installed the pfSense packaged VM tools or the VMware-supplied tools? Open-VM-Tools
- Are you using the e1000 adapter type or something else? E1000
Idle Time : 371 Mhz / 10% in Performance Vmware Tabs / 0% in Pfsense Dashboard
High Load (download full speed) : 5857 Mhz / 100 % in Performance Vmware Tabs / 98% in Pfsense Dahboard -
More to help
- i have the same issue on several vm's on our esxi 5.5 Cluster
- 100% CPU and only 3-5 mbit traffic.
- also i have the Problem with openvpn on heavy load >200mbit the ip stack Crash an i get DUP! icmp pings.
- now i install one new pfs 2.1.5 with a clean config and e1000 nics.
- i change the vm today in the evening.
- all vm's have 2gb ram and 1 vcpu.
- the new one is our boarder router with bgp that shut route 1000mbit.
- i Report the results next week.
- we have esii 5.5 U1.
regards alexander
-
so here the results of my test
the current pfs 2.1.5 have defenitly a bug under vmware 5.5u1.
in one test the failure occurse 2 min after the restart, i think the reasen was the high load (400mbit) traffic.
after several time something is Crash and i get Dup! if i make a ping.
the power will trunkated by 100mbit on each Interface.
i test it with 8 cores then with 4 cores. my vm have 4 nics all Intel 1000.
here the ping:
PING 193.84.xxx.xxx (193.84.178.161) 56(84) bytes of data.
64 bytes from 193.84.xxx.xxx: icmp_seq=1 ttl=64 time=8.10 ms
64 bytes from 193.84.xxx.xxx: icmp_seq=1 ttl=64 time=8.10 ms (DUP!)
64 bytes from 193.84.xxx.xxx: icmp_seq=2 ttl=64 time=8.38 ms
64 bytes from 193.84.xxx.xxx: icmp_seq=2 ttl=64 time=8.38 ms (DUP!)after a reboot of the pfs everything is ok.
now after one day tests …. i have News.
i tested several versions with 8 cores with 6 cores with 4 cores but only 2 cores are stable.
now with 1 socket and 2 cores no error occures since 6 hours.
the Performance is poor but no error's. the cpu is constantly at 70%.
last pid: 12575; load averages: 0.10, 0.17, 0.19 up 0+06:28:28 16:14:20
98 processes: 3 running, 78 sleeping, 17 waitingMem: 64M Active, 25M Inact, 111M Wired, 408K Cache, 24M Buf, 7698M Free
Swap: 2048M Total, 2048M FreePID USERNAME PRI NICE SIZE RES STATE C TIME WCPU COMMAND
11 root 171 ki31 0K 32K CPU0 0 351:45 82.96% [idle{idle: cpu0}]
11 root 171 ki31 0K 32K RUN 1 343:38 73.00% [idle{idle: cpu1}]
0 root -68 0 0K 224K - 1 40:29 13.96% [kernel{em0 taskq}]
0 root -68 0 0K 224K - 0 30:51 10.99% [kernel{em4 taskq}]
12 root -32 - 0K 272K WAIT 0 0:02 1.95% [intr{swi4: clock}]
12 root -32 - 0K 272K WAIT 0 0:58 0.00% [intr{swi4: clock}]
14 root -16 - 0K 16K - 1 0:37 0.00% [yarrow]
0 root -16 0 0K 224K sched 1 0:36 0.00% [kernel{swapper}]
0 root -68 0 0K 224K - 1 0:24 0.00% [kernel{em3 taskq}]
256 root 76 20 6908K 1380K kqread 1 0:17 0.00% /usr/local/sbin/check_reload_status
79750 root 44 0 59596K 6756K select 1 0:06 0.00% /usr/local/bin/vmtoolsd -c /usr/local/share
20203 root 44 0 24232K 5420K kqread 0 0:06 0.00% /usr/local/sbin/lighttpd -f /var/etc/lighty
15152 root 44 0 5784K 1464K select 0 0:05 0.00% /usr/local/sbin/apinger -c /var/etc/apinger
0 root -68 0 0K 224K - 0 0:05 0.00% [kernel{em1 taskq}]
86756 root 52 0 150M 38940K piperd 1 0:05 0.00% /usr/local/bin/php
79831 root 44 0 146M 33480K accept 1 0:04 0.00% /usr/local/bin/php
16 root -16 - 0K 16K pftm 1 0:02 0.00% [pfpurge]
53737 root 44 0 6960K 1652K select 1 0:02 0.00% /usr/sbin/syslogd -s -c -c -l /var/dhcpd/vareally good shit, if i found no solution i should say bye bye pfsense :(
-
Has one of the minority who's seeing this actually contacted VMware support? It only happens to a tiny fraction. The VM isn't using that much CPU, if the hypervisor is…guess whose fault that is? More than likely not ours or FreeBSD's.
-
@cmb:
Has one of the minority who's seeing this actually contacted VMware support? It only happens to a tiny fraction. The VM isn't using that much CPU, if the hypervisor is…guess whose fault that is? More than likely not ours or FreeBSD's.
As somebody already noted in one of previous posts - it is likely that the majority is hitting this issue, but are simply unaware of it. Because under low or modest load you never notice the high CPU usage in VMware, if you aren't explicitly monitoring the VM from VMware side (and many users are only monitoring load inside pfsense).
If there are any users, who are not seeing this issues (make sure you actually look at VMware virtual machine CPU usage under modest load) - please, post your configuration or VMware environment description.
-
Just today I'm being hit with 90+Mbps of external junk traffic. I started getting VMware alarms telling me that my pfSense CPU usage was excessive. When I check via pfSense dashboard, CPU is at ~45%. When I check VMware performance, CPU is at 90+%. See image.
-
As somebody already noted in one of previous posts - it is likely that the majority is hitting this issue, but are simply unaware of it.
And as I've already noted, that's not true, I've reviewed this with many customers with a wide range of specifics in the config. Including our own high load production systems, and our dev/test systems.
-
And as I've already noted, that's not true
The problem here is that we trust you and pfSense, but we also trust VMware and were getting conflicting information. Right now this very moment, pfSense says 2% CPU and VMware says 7%. Resource Alloc page says I'm using 316 MHz out of 4520 MHz, which is ~7%. I don't have budget for a VMware service contract so calling them is not an option for me. They are a much larger organization with a metric buttload of production installations, and you would think that such a CPU usage discrepancy would have been caught by them long ago. Kind of a stalemate.
-
i can give a developer access to our cloud platform. we can also open a vmware ticket.
but i need help to solve this problem.
the problem only occurs when high load is present.
i have 50 pfsense online one have the problem. it is the border router with bgp connection to the internet.
the error occurs if the load takes more than one cpu. this is the result of my actual tests.
i routet my heavy traficc between the datacenters now over a windows 2012 r2 machine with softether.
i need för 500mbit load 3 cpu's with 2,6 Ghz. 2 l3 routing instances and encryption. -
i can give a developer access to our cloud platform. we can also open a vmware ticket.
but i need help to solve this problem.
I could pursue it sometime in January if you're willing to do it as a professional services engagement. I'd give you a break on the hours as I'm curious what's going on here, though still talking probably a $1000 USD project to evaluate the issue and determine its source. Couldn't promise any fix, especially since the problem is likely VMware's, but that'd be enough to assemble a proper problem report to escalate to VMware if that's the source of the issue, or elsewhere if it's elsewhere. Email <professional.services <at="">pfsense <dot>org> if that might be of interest and we can discuss further and put together a scope and statement of work.
Otherwise, at some point post-2.2 release I might take you up on it. Professional services is the only way I'll be able to look at it in the next couple months.</dot></professional.services>
-
We have the exact same issue and are running 2.1 on ESXi 5.1.
But it isn't only VMware environments that are having the issue. There are reports above of it happening on KVM too. Therefore doesn't it seem more like something in pfsense/FreeBSD itself?I am also happy to open a case with VMware if you think there is a chance they will look at it seriously. But I am sure they are more likely going to point it back at FreeBSD/pfSense.
-
If you're seeing that, I'd first upgrade to the latest available 2.2 snapshot (which is very close to what will be release), or wait at most a few days for the official release.
The only CPU-related complaints I've seen related to KVM is in high CPU usage within the VM, it's the same CPU usage outside the VM. That's generally solved enabling virtio, but for those who found that inadequate, upgrading to 2.2 resolved the issue.
-
Its 2.2.4 with VMWaretools. VMWare - Workstation
In Pfsense 8-9%
Hosttaskmanager 30%