Won't Do
Details
Assignee
UnassignedUnassignedReporter
Pablo HiguerasPablo HiguerasNeeds QA
YesAffects versions
Priority
Medium
Details
Details
Assignee
Unassigned
UnassignedReporter
Pablo Higueras
Pablo HiguerasNeeds QA
Yes
Affects versions
Priority
Smart Checklist
Smart Checklist
Smart Checklist
Created June 11, 2024 at 1:41 PM
Updated June 24, 2024 at 11:28 AM
Resolved June 24, 2024 at 11:28 AM
We have a 3 nodes cluster with 8.0.32 and we suffered a full cluster crush twice in a week for the same reason. I will explain last:
After trying to join a node (c39b) using
wsrep_sst_method=clone
, the new member try to sync himself with the others using IST and then it started a voting process after an ERROR happend for a foreign key constraint fail:All the rest of the nodes got stuck as well after the voting process and those started to abort connections too:
Assuming they all have
gcs.vote_policy=0
,it is assumed that the majority formed by nodes c39a and c39c should have been accepted, and it should not have caused a full cluster crash at all ( ):