Issues
- Flow control flapping hangs the clusterPXC-4453Resolved issue: PXC-4453Kamil Holubicki
- Telemetry Ph1 - adjustments needed for PXCPXC-4436Resolved issue: PXC-4436Kamil Holubicki
- Garbd terminates SST script's children after SST is completePXC-4428Resolved issue: PXC-4428Kamil Holubicki
- PXC 8.4.0PXC-4408Resolved issue: PXC-4408Kamil Holubicki
- Innodb semaphore wait timeout failure after upgrade from 8.0.34 to 8.0.35PXC-4367Resolved issue: PXC-4367Kamil Holubicki
- PXC node evicted when create function by user don't have super privilege and binary loggin is enabledPXC-4362Resolved issue: PXC-4362Kamil Holubicki
- Cluster state interruption with MDL BF-BF conflict and exec-mode:toiPXC-4348Resolved issue: PXC-4348Kamil Holubicki
- Statements executed in RSU mode generate local GTID eventsPXC-4313Resolved issue: PXC-4313parveez.baig
- ALTER TABLE causes wsrep_cluster_status DisconnectedPXC-4298Resolved issue: PXC-4298
9 of 9
On PXC 8.0.36, a flapping flow control scenario may hang the cluster in a multi-writer environment. It also affects 5.7.44 and 5.7.25.
InnoDB status from the affected node shows threads in replicating state:
The receive queue does not show write-sets:
And flow control is still active:
Node 2 and 3 also shows flow control as active:
Killing the threads doesn't fix the issue, the node needs to be restarted to fix the cluster:
How to repeat:
Use the attached my.cnf to create a 3 nodes PXC 8.0.36 cluster.
Create the following tables:
On node 1, configure a 8M redo and strict durability settings:
On node 1, run the following command to produce a flow control flapping behavior:
And run the following workload:
On node 2, run the following commands:
Monitor the flow control on node 1, you may need adding more inserts in case the flapping happens between several seconds.
Since it’s a race condition, it may take seconds to minutes to trigger the bug.