FAQ | This is a LIVE service | Changelog

Skip to content
Snippets Groups Projects
  1. Jul 08, 2021
    • Dr Adam Thorn's avatar
      Ensure pg-dump-script includes a dump of roles · 67d141b5
      Dr Adam Thorn authored
      We don't always need the role data, if the presumption is that we'll
      be doing a pg_restore in conjunction with an ansible role which creates
      all required roles. But, having a copy of the role data will never hurt!
      It also gives us a straightforward way of restoring a database to a
      standalone postgres instance without having to have provisioned a
      dedicated VM with the relevant ansible roles.
      0.9-ch84
      67d141b5
    • Dr Adam Thorn's avatar
      Add a script to do a postgres backup via pg_dump · 5b4a8757
      Dr Adam Thorn authored
      At present we use myriad one-off per host scripts to do a pg_dump,
      and they all do (or probably should do) the same thing. In combination
      with setting options in the host's backup config file, I think
      this single script covers all our routine pg backups.
      0.9-ch83
      5b4a8757
    • Dr Adam Thorn's avatar
      Call PRE and POST with same args as zfs-rsync.sh · e01d7ebc
      Dr Adam Thorn authored
      we were just passing the hostname. Adding extra args should
      not impact any existing script, but will let us write better/
      more maintainable/deduplicated PRE scripts
      0.9-ch82
      e01d7ebc
  2. Jun 29, 2021
    • Dr Catherine Pitt's avatar
      Add an outline script for moving a whole zpool · 7ae60f97
      Dr Catherine Pitt authored
      This came about because a disk has failed on nest-backup, which only has
      subdirectory backups of nest-filestore-0 and so move-machine.sh was not
      going to be helpful - it assumes all tasks for a machine are on the same
      zpool which isn't true there. In this case I did the move by hand, but
      have sketched out the steps in the script in the hope that next time we
      have to do this we'll do it by looking at the script and running bits by
      hand, then improve the script a bit, and continue until it's usable.
      7ae60f97
  3. Jun 18, 2021
  4. Jun 08, 2021
  5. May 12, 2021
    • Dr Catherine Pitt's avatar
      Fix a bug in the move-machine script · 75db08dc
      Dr Catherine Pitt authored
      The generation of the command to unexport NFS filesystems could generate
      an invalid command. Leading spaces were not being stripped, and in cases
      where there is more than one backup target for a machine we need to
      unexport every target. Because we also had 'set -e' in operation at this
      point, the script would fail there and never clean up the moved ZFS. I
      don't mind if we fail to unexport; if that's subsequently a problem for
      removing the ZFS then the script will fail at that point.
      
      This change makes the script generate better exportfs -u commands and
      not exit if they fail.
      0.9-ch76
      75db08dc
  6. Apr 30, 2021
    • Dr Catherine Pitt's avatar
      Make database connections short-lived · e40c1a55
      Dr Catherine Pitt authored
      The code used to open a database connection for each thread and leave
      them open for as long as the scheduler ran. This worked reasonably well
      until we moved to PostgreSQL 13 on Focal, although the scheduler would
      fail if the database was restarted because there was no logic to
      reconnect after a connection dropped.
      
      On Focal/PG13 the connection for the 'cron' thread steadily consumes
      memory until it has exhausted everything in the machine. This appears to
      be a Postgres change rather than a Perl DBI change: the problem can be
      reproduced by sitting in psql and running 'select * from backup_queue'
      repeatedly. Once or twice a minute an instance of this query will cause
      the connection to consume another MB of RAM which is not released until
      the database connection is closed. The cron thread runs that query every
      two seconds. My guess is it's something peculiar about the view that
      query selects from - the time interval thing is interesting.
      This needs more investigation.
      
      But in the meantime I'd like to have backup servers that don't endlessly
      gobble RAM, so this change makes the threads connect to the database
      only when they need to, and closes the connection afterwards. This
      should also make things work better over database restarts but that's
      not been carefully tested.
      0.9-ch75
      e40c1a55
  7. Dec 11, 2020
  8. Oct 06, 2020
  9. Apr 07, 2020
  10. Dec 18, 2019
  11. Jul 30, 2019
  12. Jul 23, 2019
  13. Apr 23, 2019
    • Dr Catherine Pitt's avatar
      new-backup-rsnapshot better support for postgres backups · 6f4dfa12
      Dr Catherine Pitt authored
      The new-backup-rsnapshot script understands a 'postgres' argument, but
      this set up a postgres backup in an old style that we no longer use.
      This change updates it to do some of the work of setting up a new style
      postgres backup and tell the user what else they might need to edit to
      make it go; it varies quite a lot depending on server.
      0.9-ch61
      6f4dfa12
  14. Jan 16, 2019
  15. Nov 06, 2018
  16. Oct 22, 2018
  17. Oct 18, 2018
  18. Jun 19, 2018
  19. Jan 11, 2018
  20. Aug 11, 2017
    • Dr Adam Thorn's avatar
      Raise a red dot if a machine has never completed a backup · efad37d6
      Dr Adam Thorn authored
      The script runs some SQL which ultimately determines the time at which the last
      backup with an exit code of zero was. If there has never been a successful
      backup, we ended up reporting a clear dot. Whilst this may lead to a few
      transitory red dots immediately after adding a new host, I think this is
      preferable to not realising for months that a host isn't being backed up!
      See ticket 154527
      efad37d6
  21. May 22, 2017
  22. Mar 29, 2017
  23. Mar 28, 2017
  24. Feb 14, 2017
  25. Jan 12, 2017
  26. Dec 08, 2016
  27. Dec 02, 2016
  28. Sep 07, 2016
Loading