Bypassing SST in PXC with incremental backups

Beware the SST
In Percona XtraDB Cluster (PXC) I often run across users who are fearful of SSTs on their clusters. I’ve always maintained that if you can’t cope with a SST, PXC may not be right for you, but that doesn’t change the fact that SSTs with multiple Terabytes of data can be quite costly.
SST, by current definition, is a full backup of a Donor to Joiner. The most popular method is Percona XtraBackup, so we’re talking about a donor node that must:

Run a full XtraBackup that reads its entire datadir

Keep up with Galera replication to it as much as possible (though laggy donors don’t send flow control)

Possibly still be serving application traffic if you don’t remove Donors from rotation.
So, I’ve been interested in alternative ways to work around state transfers and I want to present one way I’ve found that may be useful to someone out there.

Percona XtraBackup and Incrementals

It is possible to use Percona XtraBackup Full and Incremental backups to build a datadir that might possibly SST. First we’ll focus on the mechanics of the backups, preparing them and getting the Galera GTID and then later discuss when it may be viable for IST.

Suppose I have fairly recent full Xtrabackup and and one or more incremental backups that I can apply on top of that to get VERY close to realtime on my cluster (more on that ‘VERY’ later).

# innobackupex --no-timestamp /backups/full
... sometime later ...
# innobackupex --incremental /backups/inc1 --no-timestamp --incremental-basedir /backups/full
... sometime later ...
# innobackupex --incremental /backups/inc2 --no-timestamp --incremental-basedir /backups/inc1

# innobackupex --no-timestamp /backups/full

... sometime later ...

# innobackupex --incremental /backups/inc1 --no-timestamp --incremental-basedir /backups/full

... sometime later ...

# innobackupex --incremental /backups/inc2 --no-timestamp --incremental-basedir /backups/inc1

In my proof of concept test, I now have a full and two incrementals:

# du -shc /backups/*
909M	full
665M	inc1
812M	inc2
2.4G	total

# du -shc /backups/*

909M full

665M inc1

812M inc2

2.4G total

To recover this data, I follow the normal Xtrabackup incremental apply process:

# cp -av /backups/full /backups/restore
# innobackupex --apply-log --redo-only --use-memory=1G /backups/restore
...
xtrabackup: Recovered WSREP position: 1663c027-2a29-11e5-85da-aa5ca45f600f:35694784
...
# innobackupex --apply-log --redo-only /backups/restore --incremental-dir /backups/inc1 --use-memory=1G
# innobackupex --apply-log --redo-only /backups/restore --incremental-dir /backups/inc2 --use-memory=1G
...
xtrabackup: Recovered WSREP position: 1663c027-2a29-11e5-85da-aa5ca45f600f:46469942
...
# innobackupex --apply-log /backups/restore --use-memory=1G

# cp -av /backups/full /backups/restore

# innobackupex --apply-log --redo-only --use-memory=1G /backups/restore

...

xtrabackup: Recovered WSREP position: 1663c027-2a29-11e5-85da-aa5ca45f600f:35694784

...

# innobackupex --apply-log --redo-only /backups/restore --incremental-dir /backups/inc1 --use-memory=1G

# innobackupex --apply-log --redo-only /backups/restore --incremental-dir /backups/inc2 --use-memory=1G

...

xtrabackup: Recovered WSREP position: 1663c027-2a29-11e5-85da-aa5ca45f600f:46469942

...

# innobackupex --apply-log /backups/restore --use-memory=1G

I can see that as I roll forward on my incrementals, I get a higher and higher GTID. Galera’s GTID is stored in the Innodb recovery information, so Xtrabackup extracts it after every batch it applies to the datadir we’re restoring.

We now have a datadir that is ready to go, we need to copy it into the datadir of our joiner node and setup a grastate.dat. Without a grastate, starting the node would force an SST no matter what.

# innobackupex --copy-back /backups/restore
# ... copy a grastate.dat from another running node ...
# cat /var/lib/mysql/grastate.dat
# GALERA saved state
version: 2.1
uuid:    1663c027-2a29-11e5-85da-aa5ca45f600f
seqno:   -1
cert_index:
  
# chown -R mysql.mysql /var/lib/mysql/

# innobackupex --copy-back /backups/restore

# ... copy a grastate.dat from another running node ...

# cat /var/lib/mysql/grastate.dat

# GALERA saved state

version: 2.1

uuid: 1663c027-2a29-11e5-85da-aa5ca45f600f

seqno: -1

cert_index:

# chown -R mysql.mysql /var/lib/mysql/

If I start the node now, it should see the grastate.dat with the -1 seqo and run –wsrep_recover to extract the GTID from Innodb (I could have also just put that directly into my grastate.dat).

This will allow the node to startup from merged Xtrabackup incrementals with a known Galera GTID.

But will it IST?

That’s the question. IST happens when the selected donor has all the transactions the joiner needs to get it fully caught up inside of the donor’s gcache. There are several implications of this:

A gcache is mmap allocated and does not persist across restarts on the donor. A restart essentially purges the mmap.
You can query the oldest GTID seqno on a donor by checking the status variable ‘wsrep_local_cached_downto’. This variable is not available on 5.5, so you are forced to guess if you can IST or not.
most PXC 5.6 will auto-select a donor based on IST. Prior to that (i.e., 5.5) donor selection was not based on IST candidacy at all, meaning you had to be much more careful and do donor selection manually.
There’s no direct mapping from the earliest GTID in a gcache to a specific time, so knowing at a glance if a given incremental will be enough to IST is difficult.
It’s also difficult to know how big to make your gcache (set in MB/GB/etc.) with respect to your backups (which are scheduled by the day/hour/etc.)

All that being said, we’re still talking about backups here. The above method will only work if and only if:

You do frequent incremental backups
You have a large gcache (hopefully more on this in a future blog post)
You can restore a backup faster than it takes for your gcache to overflow

2 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Reiner

8 years ago

I just worked on nearly the same.

One idea we realized is that you _never_ have to make a full backup again (if all works fine):

1) make full backup to dir “1”
2) make incrementalt to dir “n”
3) if “n” > n-max end!
4) make full backup by an restore run to an archive (complete roll forwards, step by step…)
(not mentioned to pause between incremental backup runs…)
5) the result is also a “full backup” stored (full prepare) in dir “1”

This also works fine by using qpress (which you _should_ use due to filesize and restoring speed)

I scripted something like a robot, which runs on all nodes which copies its data to a central fileserver/storage.

I found out there are some things to know:
– if your data changes once a day (many updates), the incremental steps could be as big as a full backup which depends on the frequency of backups (pause time)
– if you got a extremly large database but which data not changes much, incremental backup is best you could do

Jay Janssen

Author

8 years ago

@Reiner — you can chain backups forever sure, but then you run a risk where any link in that chain breaking means you can’t restore.

I didn’t mention in the post, but I used innodb_track_changed_pages available in PS and PXC to make the incrementals a lot faster. https://www.percona.com/doc/percona-server/5.6/management/changed_page_tracking.html However, this would not help much if the churn in the database is high.

MySQL 5.7
End of Life

Compare Percona to Leading Database Solutions

Software
Downloads

Product
Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Bypassing SST in Percona XtraDB Cluster with incremental backups

Percona XtraBackup and Incrementals

But will it IST?

Related

Related Blog Articles

RECOMMENDED ARTICLES

New Valkey Packages by Percona

Can We Set up a Replicate Filter Within the Percona XtraDB Cluster?

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7 End of Life

Compare Percona to Leading Database Solutions

Software Downloads

Product Documentation

Resource Hub

Financial Services

Driving Database Success

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

Bypassing SST in Percona XtraDB Cluster with incremental backups

Percona XtraBackup and Incrementals

But will it IST?

Related

Share This Post!

Want to get weekly updates listing the latest blog posts?

Related Blog Articles

RECOMMENDED ARTICLES

New Valkey Packages by Percona

Can We Set up a Replicate Filter Within the Percona XtraDB Cluster?

Choosing the Right Database: Comparing MariaDB vs. MySQL, PostgreSQL, and MongoDB

MOST POPULAR ARTICLES

Auditing login attempts in MySQL

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL “Got an error reading communication packet”

MySQL 5.7
End of Life

Software
Downloads

Product
Documentation