Issues

Select view

List view

Detail view

Select search mode

Basic

JQL

18 of 18

PITR restore hung with duplicate key error

Done

General

Escalation

General

Escalation

Description

It seems that HAProxy is exposed at the time when PITR restore pod is running and if they have conflicting transactions it would lead to PITR pod stuck on duplicate key error.

Test:

Create database and table

Create one script that inserts data and execute

Create one script that shows latest inserts and execute

Current Inserts at 3:30PM PHT:

Scheduled Full Backup at 3:30PM PHT:

Current inserts at 3:33PM PHT:

Stop writes and truncate table at 3:35PM PHT

Content of table at 3:35PM PHT

Apply PITR up to 3:34PM PHT

Start insert script again. Insert script cannot connect now.

Restore process has begun. However, strangely, HAProxy has been allowed external access to the MySQL at the same time when PITR restore job is running at the very bottom of this output:

From the select script you can see that full backup has been restored but not PITR. Also, new data has been inserted on the table:

From the logs, you can see that the restore pod is still running but is not able to apply PITR due to duplicate key error:

On Everest side, the status is still in “Restoring” state

Environment

None

Attachments

Details
Assignee
ege.gunes
Reporter
Jaime Sicam
Labels
P3
Needs QA
Yes
Fix versions
1.16.0
Affects versions
1.14.0
Priority
Medium

Smart Checklist

Created July 16, 2024 at 1:23 AM

Updated December 19, 2024 at 8:11 PM

Resolved December 2, 2024 at 8:57 AM

Activity

Eleonora ZinchenkoNovember 21, 2024 at 10:27 AM

Hi,

Verified. Proxies are scaled down to 0 during pitr restore, pitr finishes successfully. Agreed with that we will add to tests check that proxies are scaled down during pitr restore so moving task back to in progress.

Slava SarzhanNovember 8, 2024 at 10:03 AM

Hi , thank you for task. We have improved this PITR restoration behavior. During PITR operator will not start (scale down) proxy pods. It was added into 1.16.0. You can use main image if you want to test it.

Issues

PITR restore hung with duplicate key error

Description

Environment

Attachments

DetailsAssigneeege.gunesege.gunesReporterJaime SicamJaime SicamLabelsP3Needs QAYesFix versions1.16.0Affects versions1.14.0PriorityMedium

Details

Assignee

Reporter

Labels

Needs QA

Fix versions

Affects versions

Priority

Smart ChecklistOpen Smart Checklist

Smart Checklist

Activity

Eleonora ZinchenkoNovember 21, 2024 at 10:27 AM

Slava SarzhanNovember 8, 2024 at 10:03 AM

Details
Assignee
ege.gunes
Reporter
Jaime Sicam
Labels
P3
Needs QA
Yes
Fix versions
1.16.0
Affects versions
1.14.0
Priority
Medium

Smart Checklist