Hi,
since the last server-crash (our utm is a virtual machine) we got this error messages in ha-log and the state is "Syncing" since three days
This thread was automatically locked due to age.
Hi,
since the last server-crash (our utm is a virtual machine) we got this error messages in ha-log and the state is "Syncing" since three days
You'll loose reporting data but:
Step 1: login to master node, su to root
Step 2: open a new ssh window, login to master again, su to root
Step 3: on 2nd window, enter: ha_utils ssh
Step 4: in the 2nd window, login to slave as loginuser, then su to root
Step 5: on both ssh windows, enter: killall repctl
Step 6: on both ssh windows, enter: /etc/init.d/postgresql92 rebuild
Step 7: after database rebuilds, enter on both ssh windows: repctl
You'll loose reporting data but:
Step 1: login to master node, su to root
Step 2: open a new ssh window, login to master again, su to root
Step 3: on 2nd window, enter: ha_utils ssh
Step 4: in the 2nd window, login to slave as loginuser, then su to root
Step 5: on both ssh windows, enter: killall repctl
Step 6: on both ssh windows, enter: /etc/init.d/postgresql92 rebuild
Step 7: after database rebuilds, enter on both ssh windows: repctl
Hello,
as a member of the sophos Support i have to address very important points about this "tutorial":
1. Database rebuilds are only to be done by a certain support level and only by Sophos Support Members.
2. Rebuilding a Database without support recommendation on your own, "voids" your support until you re-image
the machine.
3. There are several unpredictable problems/errors that could occur after rebuilding a database the wrong way
or making mistakes that we can't fix or rather wont fix as per point 2, as this is the central database for
most UTM services!
4. This is not the correct way to rebuild a database and many times not even necessary. For analysis if this is even needed,
you can contact Support or your Partner / Distributor.
Regards,
Cheerok
Danny,
Thank you for this post. Just had to use the information in it - again.
Worth adding that
<M> utm:/home/login # tail -f /var/log/system.log
2019:05:02-15:20:12 utm-2 ulogd[7753]: pg1: connect: could not connect to server: No such file or directory
2019:05:02-15:20:17 utm-2 ulogd[7753]: pg1: connect: could not connect to server: No such file or directory
2019:05:02-15:20:22 utm-2 ulogd[7753]: pg1: connect: could not connect to server: No such file or directory
^C
<M> utm:/home/login # telnet 127.0.0.1 5432
Trying 127.0.0.1...
telnet: connect to address 127.0.0.1: Connection refused
Is a good indication that the database is corrupt.
And to Cheerok's points I think it would be reasonable to add
So as a customer you have to be careful. Be well informed. Contact support where possible. And when not possible - tread carefully.
So it is not in place to prevent customers from using root, only to dissuade people from doing "stupid stuff" like trying to install device drivers using root access - which actually happened back in ASG V4 days and resulted in the root changes may void support rule.
All the best,
Adrien.