Telegraf / InfluxDB - additional config fails after X days
-
Hi All,
Just curious if anyone else has had a similar issue. I run Telegraf to InfluxDB, and after a certain period of time (a few days usually), my metrics from the "additional config" section stop functioning. This primarily seems to affect Pfblocker metrics via the [[inputs.logparser]] entry.
Generally if I click "save" on the Telegraf service it will kick it back on, or if I reboot Pfsense.
I am using: https://github.com/VictorRobellini/pfSense-Dashboard
Any thoughts or assistance is welcome. I considered some sort of cron job to restart telegraf every night, if that would work.
-
hi,
Have an original thread here in the forum, I think ask your question there:
https://forum.netgate.com/topic/152132/grafana-dashboard-using-telegraf-with-additional-plugins/2Maybe you can get answers faster
-
Thank you, I did have a dig to see if someone else had the same problem but it was unique, I meant to come back sooner saying I had resolved.
As it turns out, two issues I had were actually one. I noticed during this that my listen queue was regularly overflowing by a considerable amount. Eventually I discovered that a particular (blocked) domain was being hammered, with multiple pings every second, over and over.
I have since modified hosts files on the client machine to prevent it, and now everything works as intended. The sheer volume of connection attempts was causing DNSBL to fall over, and killing the listen queue.
Thanks again for your reply.
-
Good news, learning is for a lifetime