Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    High Avail. Sync broken

    Scheduled Pinned Locked Moved HA/CARP/VIPs
    22 Posts 8 Posters 15.4k Views 3 Watching
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • V Offline
      vigorfac
      last edited by

      yes master (10.88.88.1) can ping slave on 10.88.88.2
      and slave can ping master on 10.88.88.1.

      I have this in log now on master

      Time Process PID Message
      Nov 7 12:40:52 php-fpm 48758 /rc.filter_synchronize: Beginning XMLRPC sync data to https://10.88.88.2:443/xmlrpc.php.
      Nov 7 12:40:52 php-fpm 48758 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
      Nov 7 12:40:52 php-fpm 48758 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
      Nov 7 12:40:43 php-fpm 38230 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method filter_configure:
      Nov 7 12:40:43 php-fpm 38230 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method filter_configure:
      Nov 7 12:40:38 check_reload_status Reloading filter
      Nov 7 12:40:37 php-fpm 51322 /rc.filter_synchronize: Beginning XMLRPC sync data to https://10.88.88.2:443/xmlrpc.php.
      Nov 7 12:40:36 check_reload_status Syncing firewall
      Nov 7 12:40:18 pfsense1 nginx: 2017/11/07 12:40:18 [error] 12688#100122: send() failed (54: Connection reset by peer)
      Nov 7 12:40:18 php-fpm 51646 /status_logs_settings.php: The command '/usr/local/sbin/unbound -c /var/unbound/unbound.conf' returned exit code '1', the output was '[1510054818] unbound[90624:0] error: bind: address already in use [1510054818] unbound[90624:0] fatal error: could not open ports'
      Nov 7 12:40:15 syslogd kernel boot file is /boot/kernel/kernel

      S 1 Reply Last reply Reply Quote 0
      • DerelictD Offline
        Derelict LAYER 8 Netgate
        last edited by

        Are you passing the traffic on the sync interface on the secondary?

        Are both nodes set to the same webgui settings (http/https/port) and have the same username and password set?

        1 Reply Last reply Reply Quote 1
        • V Offline
          vigorfac
          last edited by

          Sorry fo the response delay,

          Yes both node are on https same port.

          Both node use dedicated ionterface for the sync, no vlan .

          Here are the new log :

          Nov 10 09:49:29 php-fpm 34743 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
          Nov 10 09:49:10 check_reload_status Reloading filter
          Nov 10 09:49:10 php-fpm 57111 /rc.filter_synchronize: The pfSense software configuration version of the other member could not be determined. Skipping synchronization to avoid causing a problem!
          Nov 10 09:49:10 php-fpm 57111 /rc.filter_synchronize: XMLRPC versioncheck: – 17.3
          Nov 10 09:49:10 php-fpm 57111 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
          Nov 10 09:49:10 php-fpm 57111 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
          Nov 10 09:48:54 php-fpm 66735 /rc.filter_synchronize: Beginning XMLRPC sync data to https://10.88.88.2:443/xmlrpc.php.
          Nov 10 09:48:54 php-fpm 66735 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
          Nov 10 09:48:54 php-fpm 66735 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version:
          Nov 10 09:48:44 php-fpm 56798 /rc.filter_synchronize: Beginning XMLRPC sync data to https://10.88.88.2:443/xmlrpc.php.

          Master and slave can ping each other.

          Each time i make a change on master it's very long to validate .

          Thanks for your help Derelict.

          1 Reply Last reply Reply Quote 0
          • DerelictD Offline
            Derelict LAYER 8 Netgate
            last edited by

            Can you bring up the webgui on the secondary at the time?

            Do firewall rules pass xmlrpc (webgui) traffic on the sync interface?

            If looks like the primary cannot connect to the secondary there. Need to isolate the reason why that is so.

            1 Reply Last reply Reply Quote 0
            • K Offline
              kikuyu
              last edited by

              Hallo

              I have exactly the same issue.

              Master and slave pfsense same version 2.4.2-p1,

              2.4.2-RELEASE-p1 (amd64)
              built on Tue Dec 12 13:45:26 CST 2017
              FreeBSD 11.1-RELEASE-p6

              same admin, password, same webgui https port
              Master ans slave can ping each other on the HA interfaces

              Here the logs on the master

              Jan 1 17:42:09 check_reload_status Syncing firewall
              Jan 1 17:42:10 php-fpm 13794 /system_hasync.php: waiting for pfsync…
              Jan 1 17:42:10 php-fpm 21804 /rc.filter_synchronize: Beginning XMLRPC sync data to https://192.168.100.2:443/xmlrpc.php.
              Jan 1 17:42:20 php-fpm 21804 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:42:20 php-fpm 21804 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:42:20 php-fpm 21804 /rc.filter_synchronize: Beginning XMLRPC sync data to https://192.168.100.2:443/xmlrpc.php.
              Jan 1 17:42:30 php-fpm 21804 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:42:30 php-fpm 21804 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:42:30 php-fpm 21804 /rc.filter_synchronize: XMLRPC versioncheck: -- 17.3
              Jan 1 17:42:30 php-fpm 21804 /rc.filter_synchronize: The pfSense software configuration version of the other member could not be determined. Skipping synchronization to avoid causing a problem!
              Jan 1 17:42:42 php-fpm 13794 /system_hasync.php: pfsync done in 30 seconds.
              Jan 1 17:42:42 php-fpm 13794 /system_hasync.php: Configuring CARP settings finalize...
              Jan 1 17:47:00 check_reload_status Syncing firewall
              Jan 1 17:47:01 php-fpm 58070 /rc.filter_synchronize: Beginning XMLRPC sync data to https://192.168.100.2:443/xmlrpc.php.
              Jan 1 17:47:03 check_reload_status Reloading filter
              Jan 1 17:47:11 php-fpm 58070 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:47:11 php-fpm 58070 /rc.filter_synchronize: New alert found: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out
              Jan 1 17:47:11 php-fpm 58070 /rc.filter_synchronize: Beginning XMLRPC sync data to https://192.168.100.2:443/xmlrpc.php.
              Jan 1 17:47:21 php-fpm 58070 /rc.filter_synchronize: A communications error occurred while attempting to call XMLRPC method host_firmware_version: Unable to connect to tls://192.168.100.2:443. Error: Operation timed out

              Could you solve already the issue you had?

              Kind Regards

              and first!!!

              Happy NEW YEAR 2018

              1 Reply Last reply Reply Quote 0
              • K Offline
                kikuyu
                last edited by

                Hi,

                After double checking the configuration, all is working fine!!!

                THX

                1 Reply Last reply Reply Quote 0
                • I Offline
                  IKO007
                  last edited by

                  @kikuyu:

                  Hi,

                  After double checking the configuration, all is working fine!!!

                  THX

                  And what was your configuration problem ? may be I am still missing something.

                  1 Reply Last reply Reply Quote 0
                  • K Offline
                    kikuyu
                    last edited by

                    Hallo,

                    I used the HA1 IP 192.168.200.1 and HA2 IP 192.168.200.2. Normally netmask /24 for the network 192.160.200.0/24.
                    But after double checking, on the slave, the netmask was /32. I don't know how it was here?
                    After correction, all is now working fine.

                    Rgds.
                    Kikuyu

                    1 Reply Last reply Reply Quote 0
                    • I Offline
                      IKO007
                      last edited by

                      Hello,
                      Thanks for tip, I had this correctly configured, but I find out fact, that my sync is not working, when I have my DNS resolver Turned ON. May be this will also help somebody.

                      1 Reply Last reply Reply Quote 2
                      • T Offline
                        tgrubbs
                        last edited by

                        I wanted to simply add that I had this same problem and thankfully found your suggestion to turn off DNS Resolver....this is unfortunate that it is required for HA Sync to work. I'll continue to investigate.

                        My error log also stated "The pfSense software configuration version of the other member could not be determined. Skipping synchronization to avoid causing a problem!"

                        2.4.3-RELEASE-p1 (arm)
                        built on Thu May 10 15:59:52 CDT 2018
                        FreeBSD 11.1-RELEASE-p10

                        The system is on the latest version.
                        Version information updated at Sat Jul 14 1:35:11 UTC 2018

                        1 Reply Last reply Reply Quote 0
                        • DerelictD Offline
                          Derelict LAYER 8 Netgate
                          last edited by

                          This post is deleted!
                          1 Reply Last reply Reply Quote 0
                          • S Offline
                            SteveITS Rebel Alliance @vigorfac
                            last edited by

                            @vigorfac said in High Avail. Sync broken:

                            Nov 7 12:40:18 php-fpm 51646 /status_logs_settings.php: The command '/usr/local/sbin/unbound -c /var/unbound/unbound.conf' returned exit code '1', the output was '[1510054818] unbound[90624:0] error: bind: address already in use [1510054818] unbound[90624:0] fatal error: could not open ports'

                            The above error sounds similar to this bug in pfSense, which was since resolved:
                            https://redmine.pfsense.org/issues/7326#note-2 (the code didn't wait long enough for unbound to stop before trying to start it again...in our case the master server was unaffected but the backup router would end up with unbound not running)

                            re: HA sync, we have "DNS Forwarder and DNS Resolver configurations" checked in our setup and have no sync issues. So I don't think that by itself is an issue.

                            Only install packages for your version, or risk breaking it. Select your branch in System/Update/Update Settings.
                            When upgrading, allow 10-15 minutes to reboot, or more depending on packages, and device or disk speed.
                            Upvote 👍 helpful posts!

                            1 Reply Last reply Reply Quote 0
                            • First post
                              Last post
                            Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.