GW_WAN XXX.XXX.XXX.XXX ***down*** openvpn never recovers



  • I had a client give me a call today and said they could not connect to the VPN(site-site openvpn config).

    I connected to the client side and looked at the services and Openvpn said stopped. I clicked to start and immediately they were back working again. I am trying to figure out what has happened. From looking at my logs it looks like it somehow lost connection with the ISP gateway. FYI - the client could still browse the internet with no issues. The problem is that my Openvpn connection never recovered and I had to manually start it again. Is this a bug or is there a setting I need to change? Also, I am connected to a bridged router using PPoE credentials. I also get "dhclient: FAIL" in my log ever minute or so.

    Help would be appreciated,

    Rhett

    SYSTEM LOGS
    apinger:                  ALARM: GW_WAN(88.xx.xxx.245) *** down ***
    May 2 11:53:44 check_reload_status: reloading filter
    May 2 11:53:58 dhclient: FAIL
    May 2 11:54:26 check_reload_status: Rewriting resolv.conf
    May 2 11:54:26 kernel: ovpnc1: link state changed to DOWN
    May 2 11:54:26 check_reload_status: reloading filter
    May 2 11:54:27 php: : Could not find gateway for interface(wan).
    May 2 11:54:29 dnsmasq[42466]: no servers found in /etc/resolv.conf, will retry
    May 2 11:54:29 dnsmasq[42466]: no servers found in /etc/resolv.conf, will retry
    May 2 11:55:00 dhclient: FAIL
    May 2 11:55:00 check_reload_status: Rewriting resolv.conf
    May 2 11:55:00 apinger: alarm canceled: GW_WAN(88.xx.xxx.245) *** down ***
    May 2 11:55:01 php: : ROUTING: change default route to 88.xx.xxx.245
    May 2 11:55:01 check_reload_status: reloading filter
    May 2 11:55:01 apinger: Exiting on signal 15.
    May 2 11:55:02 check_reload_status: reloading filter
    May 2 11:55:02 apinger: Starting Alarm Pinger, apinger(48345)
    May 2 11:55:08 php: : Resyncing OpenVPN instances for interface WAN.
    May 2 11:55:22 dnsmasq[42466]: reading /etc/resolv.conf
    May 2 11:55:22 dnsmasq[42466]: using nameserver 100.xxx.x.65#53
    May 2 11:55:22 dnsmasq[42466]: using nameserver 100.xxx.x.65#53



  • Here is the build of 2.0-RC1 I am on:

    2.0-RC1 (i386)
    built on Fri Apr 22 14:05:23 EDT 2011



  • Can you try on a more recent build there were some fixes on dhcp script that might hit you.



  • I will do that tonight and I will watch it and let you guys know if I still have issues.


Locked