Bug 1817036
Summary: | Cleanup race with wsrep_rsync_sst_tunnel may prevent full galera cluster bootstrap | |||
---|---|---|---|---|
Product: | Red Hat Enterprise Linux 8 | Reporter: | Damien Ciabrini <dciabrin> | |
Component: | mariadb | Assignee: | Michal Schorm <mschorm> | |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Lukáš Zachar <lzachar> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 8.0 | CC: | databases-maint, dciabrin, hhorak, michele, mschorm | |
Target Milestone: | rc | Keywords: | ZStream | |
Target Release: | 8.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | If docs needed, set a value | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1899021 1899022 1899024 1899025 (view as bug list) | Environment: | ||
Last Closed: | 2021-01-15 15:41:45 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1856699, 1899021, 1899022, 1899024, 1899025 |
Description
Damien Ciabrini
2020-03-25 13:08:57 UTC
Fixed upstream in https://github.com/dciabrin/wsrep_sst_rsync_tunnel/commit/590d23665dae33940e15ef48c2e28737ec151953 Hi, Here's the test build for you: http://people.redhat.com/mschorm/.rhbz-1817036/ Please let us know if that works for you (both the build itself and the way I gave you access to it) Hey Michal sorry I overlook the answer...
> Please let us know if that works for you (both the build itself and the way
> I gave you access to it)
Thanks for the repo that made my life easier.
The build works just fine thank you. Here's how I validated it:
1. start from an empty galera database, and configure galera to use TLS and the tunnelled SST script wsrep_sst_rsync_tunnel:
wsrep_provider_options = gmcast.listen_addr=tcp://172.17.54.218:4567;socket.ssl_key=/tls/mysql.key;socket.ssl_cert=/tls/mysql.crt;socket.ssl_ca=/tls/all-mysql.crt;
wsrep_sst_method=rsync_tunnel
[sst]
tca=/tls/all-mysql.crt
tcert=/tls/mysql.pem
sockopt="verify=1"
2. confirm that the cluster start and SST runs properly
3. stop the cluster, and tweak the wsrep_sst_rsync_tunnel script to add an additional delay that would cause the issue 100% before the fix:
[...]
echo "done $STATE"
>>> sleep 8
[...]
4. clean grastate and restart the cluster to force SST at bootstrap
rm -rf /var/lib/mysql/grastate.dat
I confirm the cluster restarts just fine, so the problem is fixed.
|