Issues

Select view

List view

Detail view

Select search mode

Basic

JQL

2 cluster nodes go in to NON-PRIMARY state MDL BF-BF conflict
PXC-4512
A PXC node receiving write statements will become unresponsive when another node enters or leaves the cluster, and innodb_thread_concurrency is non-zero.
PXC-4400
FLUSH TABLES during writes to table stalls the cluster node
PXC-4399
Flapping node network makes the cluster Non-Primary
PXC-4380
Resolved issue: PXC-4380
Please provide glibc2.28 (el8) tarballs for PXC8
PXC-4374
Resolved issue: PXC-4374
table id problem
PXC-4366
Resolved issue: PXC-4366
Cluster state interruption with MDL BF-BF conflict and exec-mode:toi
PXC-4348
Resolved issue: PXC-4348
Seeming bug related to fulltext indexes on XtraDB Cluster 8.0.33-25-1
PXC-4347
Resolved issue: PXC-4347
Executing prepared statement can abort node after FLUSH TABLES
PXC-4341
Resolved issue: PXC-4341
Nonexisting Definer Creates Inconsistency
PXC-4324
Resolved issue: PXC-4324
long semaphore wait crash due to ha_commit_low does not commit an empty transaction
PXC-4318
Resolved issue: PXC-4318
Nodes "changing identity" can prevents primary groups
PXC-4316
Resolved issue: PXC-4316
No BF-abort but 'MDL conflict ... solved by abort' printed
PXC-4315
Resolved issue: PXC-4315
Statements executed in RSU mode generate local GTID events
PXC-4313
Resolved issue: PXC-4313
DROP EVENT IF EXISTS generates local GTID event
PXC-4312
Resolved issue: PXC-4312
Error logs are not rotated
PXC-4306
Resolved issue: PXC-4306
[DOC] Status variable wsrep_flow_control_requested missing in documentation
PXC-4303
Resolved issue: PXC-4303
GRANT statement may be replicated in a wrong way if partial_revokes=1
PXC-4302
Resolved issue: PXC-4302
ALTER TABLE causes wsrep_cluster_status Disconnected
PXC-4298
Resolved issue: PXC-4298
garbd 8.0.33 reports wrong version
PXC-4296
Resolved issue: PXC-4296
SST fails when cleaning the datadir
PXC-4292
Galera Arbitrator (garbd) uses 100% CPU
PXC-4288
Resolved issue: PXC-4288
mysqld is killed with signal 11 in case of sst error
PXC-4285
Document wsrep_provider_option - pc.wait_restored_prim_timeout
PXC-4202
Resolved issue: PXC-4202
Update wsrep_sst_method documentation (ist_only)
PXC-4201
Resolved issue: PXC-4201

25 of 25

2 cluster nodes go in to NON-PRIMARY state MDL BF-BF conflict

General

Escalation

General

Escalation

Description

We have a 3 node + 1 arbitrator cluster

During online schema change we hit the following errors on 2 of the nodes, all 3 nodes log included below.

001:

002

003

This was running during a load test - we also did some no-op DDL changes with online-schema change (10’s of them) and not every one of them broke.

We have HAProxy in front of them and do read/write splitting at the DNS level. 001 node receives all write traffic where 002 and 003 nodes receives read only traffic.

Environment

None

Details

Assignee

Aaditya Dubey

Reporter

Chris Jack

Needs QA

Yes

Time tracking

20m logged

Sprint

MySQL Sprint March 2025

Affects versions

Priority

High

Smart Checklist

Created September 26, 2024 at 3:37 PM

Updated March 2, 2025 at 8:00 PM

Configure

Activity

Show:

Kamil Holubicki December 13, 2024 at 4:07 PM
Edited

Hi , The issue didn’t progress.

Please follow these instructions to upload.

Scott Hooper December 13, 2024 at 2:45 PM

Does anyone know if this issue goes away with 8.0.39 which was just released?

Scott Hooper December 2, 2024 at 1:18 PM

We have a jenkins job that causes this one, we have sleeps in it to slow it down but it still happens from time to time. We mostly run it off hours to limit impact. If you can get me a secure sftp link like you all do for the memory dumps I can upload the jenkins jobs scriptw

Kamil Holubicki November 28, 2024 at 3:11 PM

In my opinion we have two similar situations:

Case 1:

This should be fixed in 8.0.36, 8.4.0 by

Case 2:

Here we see again a local thread (granted), being aborted by replication thread. It seems the granted thread didn’t release all MDL locks before releasing CommitOrder monitor in galera (before letting the other thread, which is replicaton thread, to go on). So most probably there is yet another execution path similar to the one fixed in .

Reproduction steps would be very helpful.

Aaditya Dubey November 28, 2024 at 2:54 PM
Edited

Hello

I’ve tried to repeat the issue from my end, but unfortunately, it is not repeating; please find my test case below:

Create the following tables:

Add the data to tables:
On session 1, run sysbench:
On session 2, run optimize table:
On session 3, run delete-insert:
On session 4, run rename table:
After some time, restart either node2 or node3.

Please check and let me know if it is the same scenario that is being executed in your environment. If not, please let me know what other processes need to run or if there is any specific order that can be used to repeat the issue.