<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason]]></title><description><![CDATA[<p dir="auto">Hi,</p>
<p dir="auto">Note: the log sample below shows newer records at the top, older at the bottom.</p>
<p dir="auto">I get recurring messages regarding one of my gateways going down, intermittently. Sometimes within minutes, there will be a repeat. Other times, hours will go by without any activity:</p>
<pre><code>Sep 23 17:11:37	php-fpm	330	/rc.dyndns.update: MONITOR: WAN2GW is available now, adding to routing group WAN2failtoWAN 8.8.4.4|74.51.222.14|WAN2GW|15.522ms|4.223ms|0.0%|none
Sep 23 17:11:36	check_reload_status		Reloading filter
Sep 23 17:11:36	check_reload_status		Restarting OpenVPN tunnels/interfaces
Sep 23 17:11:36	check_reload_status		Restarting ipsec tunnels
Sep 23 17:11:36	check_reload_status		updating dyndns WAN2GW
Sep 23 17:10:20	php-fpm	328	/rc.openvpn: OpenVPN: One or more OpenVPN tunnel endpoints may have changed its IP. Reloading endpoints that may use WAN2GW.
Sep 23 17:10:20	php-fpm	328	/rc.dyndns.update: MONITOR: WAN2GW is down, omitting from routing group WAN2failtoWAN 8.8.4.4|74.51.222.14|WAN2GW|1341.057ms|2554.332ms|0.0%|down
Sep 23 17:10:19	check_reload_status		Reloading filter
Sep 23 17:10:19	check_reload_status		Restarting OpenVPN tunnels/interfaces
Sep 23 17:10:19	check_reload_status		Restarting ipsec tunnels
Sep 23 17:10:19	check_reload_status		updating dyndns WAN2GW
Sep 23 17:06:32	php-fpm	329	/rc.openvpn: OpenVPN: One or more OpenVPN tunnel endpoints may have changed its IP. Reloading endpoints that may use WAN2GW.
Sep 23 17:06:32	php-fpm	329	/rc.dyndns.update: MONITOR: WAN2GW is available now, adding to routing group WAN2failtoWAN 8.8.4.4|74.51.222.14|WAN2GW|69.166ms|119.861ms|0.0%|none
Sep 23 17:06:31	check_reload_status		Reloading filter
Sep 23 17:06:31	check_reload_status		Restarting OpenVPN tunnels/interfaces
Sep 23 17:06:31	check_reload_status		Restarting ipsec tunnels
Sep 23 17:06:31	check_reload_status		updating dyndns WAN2GW

</code></pre>
<p dir="auto">This only seems to happen when the <strong>check_reload_status</strong> occurs.</p>
<p dir="auto">Any thoughts or comments?</p>
<p dir="auto">Thanks.</p>
]]></description><link>https://forum.netgate.com/topic/120605/multi-wan-one-of-two-wan-in-failover-drops-1-2-min-for-unknown-reason</link><generator>RSS for Node</generator><lastBuildDate>Tue, 09 Jun 2026 08:36:51 GMT</lastBuildDate><atom:link href="https://forum.netgate.com/topic/120605.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 24 Sep 2017 00:52:15 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Sun, 08 Oct 2017 05:22:52 GMT]]></title><description><![CDATA[<p dir="auto">Hi,</p>
<p dir="auto">After reviewing the ping payload size, and also your recommendations, I still have the same issue.<br />
Let me know if any other suggestions come to mind. Thx.</p>
<pre><code>Oct 7 15:31:19	dpinger		WAN2GW 8.8.4.4: duplicate echo reply received
Oct 7 15:31:19	dpinger		WAN2GW 8.8.4.4: duplicate echo reply received
Oct 7 15:29:46	dpinger		WAN2GW 8.8.4.4: Alarm latency 46725667us stddev 0us loss 95%
Oct 7 15:28:14	dpinger		WAN2GW 8.8.4.4: Alarm latency 15032us stddev 3426us loss 25%
Oct 7 15:26:44	dpinger		WAN2GW 8.8.4.4: Clear latency 15014us stddev 2740us loss 0%
</code></pre>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/wm408">@<bdi>wm408</bdi></a>:</p>
<blockquote>
<p dir="auto">Hi Derelict,</p>
<p dir="auto">Typically for the Monitor IP, I choose the ISP gateway or one hop past (as observed with traceroute). But lately for at least testing, I've set the problematic gateway's Monitor IP to a google DNS server also as that's been a popular choice throughout the forums.</p>
<p dir="auto">Thanks for your other tips. I will circle back and review each of your points after I look at the results with the topic I mentioned in an earlier post, re: ping payload size.</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/derelict">@<bdi>Derelict</bdi></a>:</p>
<blockquote>
<p dir="auto">Well, there you go. dpinger is doing its job.</p>
<p dir="auto">If you have gateway monitoring on WAN (the default setting), the system is automatically keeping track of two pings per second in Status &gt; Monitoring.</p>
<p dir="auto">From there select settings, change the left axis to Quality / WANGW (or the local equivalent).</p>
<p dir="auto">A good place to start with Options: 8 hours, Resolution: 1 minute.</p>
<p dir="auto">Another place to check is in Status &gt; System Logs, Gateways. Any events there with "Alarm" in them are times when the ping monitor had excessive loss or latency.</p>
<p dir="auto">A failure will look something like this: Jan 7 15:05:31 dpinger WANGW 8.8.8.8: Alarm latency 0us stddev 0us loss 100%</p>
<p dir="auto">Lines like this are just the dpinger process starting or reloading and are normal:</p>
<p dir="auto">dpinger send_interval 500ms loss_interval 2000ms time_period 60000ms report_interval 0ms data_len 0 alert_interval 1000ms latency_alarm 500ms loss_alarm 20% dest_addr 8.8.4.4 bind_addr 198.51.0.16 identifier "DSLGW "</p>
<p dir="auto">Sometimes it is beneficial to change your monitoring address to something further out. In that example you can see that I am monitoring a google DNS server there. In general, monitoring the ISP gateway is fine if it reliably responds to pings. Changes to the monitor IP address can be made in System &gt; Routing and editing the appropriate gateway.</p>
</blockquote>
</blockquote>
]]></description><link>https://forum.netgate.com/post/725871</link><guid isPermaLink="true">https://forum.netgate.com/post/725871</guid><dc:creator><![CDATA[wm408]]></dc:creator><pubDate>Sun, 08 Oct 2017 05:22:52 GMT</pubDate></item><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Sun, 01 Oct 2017 23:33:52 GMT]]></title><description><![CDATA[<p dir="auto">Hi Derelict,</p>
<p dir="auto">Typically for the Monitor IP, I choose the ISP gateway or one hop past (as observed with traceroute). But lately for at least testing, I've set the problematic gateway's Monitor IP to a google DNS server also as that's been a popular choice throughout the forums.</p>
<p dir="auto">Thanks for your other tips. I will circle back and review each of your points after I look at the results with the topic I mentioned in an earlier post, re: ping payload size.</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/derelict">@<bdi>Derelict</bdi></a>:</p>
<blockquote>
<p dir="auto">Well, there you go. dpinger is doing its job.</p>
<p dir="auto">If you have gateway monitoring on WAN (the default setting), the system is automatically keeping track of two pings per second in Status &gt; Monitoring.</p>
<p dir="auto">From there select settings, change the left axis to Quality / WANGW (or the local equivalent).</p>
<p dir="auto">A good place to start with Options: 8 hours, Resolution: 1 minute.</p>
<p dir="auto">Another place to check is in Status &gt; System Logs, Gateways. Any events there with "Alarm" in them are times when the ping monitor had excessive loss or latency.</p>
<p dir="auto">A failure will look something like this: Jan 7 15:05:31 dpinger WANGW 8.8.8.8: Alarm latency 0us stddev 0us loss 100%</p>
<p dir="auto">Lines like this are just the dpinger process starting or reloading and are normal:</p>
<p dir="auto">dpinger send_interval 500ms loss_interval 2000ms time_period 60000ms report_interval 0ms data_len 0 alert_interval 1000ms latency_alarm 500ms loss_alarm 20% dest_addr 8.8.4.4 bind_addr 198.51.0.16 identifier "DSLGW "</p>
<p dir="auto">Sometimes it is beneficial to change your monitoring address to something further out. In that example you can see that I am monitoring a google DNS server there. In general, monitoring the ISP gateway is fine if it reliably responds to pings. Changes to the monitor IP address can be made in System &gt; Routing and editing the appropriate gateway.</p>
</blockquote>
]]></description><link>https://forum.netgate.com/post/724702</link><guid isPermaLink="true">https://forum.netgate.com/post/724702</guid><dc:creator><![CDATA[wm408]]></dc:creator><pubDate>Sun, 01 Oct 2017 23:33:52 GMT</pubDate></item><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Sun, 01 Oct 2017 19:44:07 GMT]]></title><description><![CDATA[<p dir="auto">Well, there you go. dpinger is doing its job.</p>
<p dir="auto">If you have gateway monitoring on WAN (the default setting), the system is automatically keeping track of two pings per second in Status &gt; Monitoring.</p>
<p dir="auto">From there select settings, change the left axis to Quality / WANGW (or the local equivalent).</p>
<p dir="auto">A good place to start with Options: 8 hours, Resolution: 1 minute.</p>
<p dir="auto">Another place to check is in Status &gt; System Logs, Gateways. Any events there with "Alarm" in them are times when the ping monitor had excessive loss or latency.</p>
<p dir="auto">A failure will look something like this: Jan 7 15:05:31 dpinger WANGW 8.8.8.8: Alarm latency 0us stddev 0us loss 100%</p>
<p dir="auto">Lines like this are just the dpinger process starting or reloading and are normal:</p>
<p dir="auto">dpinger send_interval 500ms loss_interval 2000ms time_period 60000ms report_interval 0ms data_len 0 alert_interval 1000ms latency_alarm 500ms loss_alarm 20% dest_addr 8.8.4.4 bind_addr 198.51.0.16 identifier "DSLGW "</p>
<p dir="auto">Sometimes it is beneficial to change your monitoring address to something further out. In that example you can see that I am monitoring a google DNS server there. In general, monitoring the ISP gateway is fine if it reliably responds to pings. Changes to the monitor IP address can be made in System &gt; Routing and editing the appropriate gateway.</p>
]]></description><link>https://forum.netgate.com/post/724681</link><guid isPermaLink="true">https://forum.netgate.com/post/724681</guid><dc:creator><![CDATA[Derelict]]></dc:creator><pubDate>Sun, 01 Oct 2017 19:44:07 GMT</pubDate></item><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Sun, 01 Oct 2017 19:27:38 GMT]]></title><description><![CDATA[<p dir="auto">Hi,</p>
<p dir="auto">I am going to test "Set ping payload size" to the problematic gateway. cmb advised this for "…buggy upstream devices...", which I may be experiencing here. Refer to this post: https://forum.pfsense.org/index.php?topic=110043.0</p>
<p dir="auto">This is a sample from my Gateways log:</p>
<p dir="auto">Oct 1 10:45:23 dpinger WAN2GW 8.8.4.4: Clear latency 209224us stddev 412652us loss 0%<br />
Oct 1 10:43:40 dpinger WAN2GW 8.8.4.4: Alarm latency 762289us stddev 1626523us loss 0%<br />
Oct 1 10:29:15 dpinger WAN2GW 8.8.4.4: Clear latency 42259us stddev 123654us loss 0%<br />
Oct 1 10:27:10 dpinger WAN2GW 8.8.4.4: Alarm latency 755434us stddev 2017180us loss 4%<br />
Oct 1 10:14:02 dpinger WAN2GW 8.8.4.4: Clear latency 443760us stddev 978113us loss 0%<br />
Oct 1 10:13:35 dpinger WAN2GW 8.8.4.4: Alarm latency 504314us stddev 971819us loss 0%</p>
<p dir="auto">I didn't know Chris had moved on, till now. I saw his post. Makes sense! thanks for the heads up.</p>
<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/heper">@<bdi>heper</bdi></a>:</p>
<blockquote>
<p dir="auto">CMB has gone to where the grass appears greener.</p>
<p dir="auto">Check your gateway logs. That should provide more insights in the reason why the gateway goes down</p>
</blockquote>
]]></description><link>https://forum.netgate.com/post/724676</link><guid isPermaLink="true">https://forum.netgate.com/post/724676</guid><dc:creator><![CDATA[wm408]]></dc:creator><pubDate>Sun, 01 Oct 2017 19:27:38 GMT</pubDate></item><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Thu, 28 Sep 2017 12:54:28 GMT]]></title><description><![CDATA[<p dir="auto">CMB has gone to where the grass appears greener.</p>
<p dir="auto">Check your gateway logs. That should provide more insights in the reason why the gateway goes down</p>
]]></description><link>https://forum.netgate.com/post/724191</link><guid isPermaLink="true">https://forum.netgate.com/post/724191</guid><dc:creator><![CDATA[heper]]></dc:creator><pubDate>Thu, 28 Sep 2017 12:54:28 GMT</pubDate></item><item><title><![CDATA[Reply to Multi-WAN - One of two WAN in failover drops ~1-2 min. for unknown reason on Thu, 28 Sep 2017 01:42:21 GMT]]></title><description><![CDATA[<p dir="auto"><em>Bump</em>.</p>
<p dir="auto">Jimp? or cmb?</p>
<p dir="auto">Any thoughts from you guys?  8)</p>
]]></description><link>https://forum.netgate.com/post/724103</link><guid isPermaLink="true">https://forum.netgate.com/post/724103</guid><dc:creator><![CDATA[wm408]]></dc:creator><pubDate>Thu, 28 Sep 2017 01:42:21 GMT</pubDate></item></channel></rss>