Issues

Select view

List view

Detail view

Select search mode

Basic

JQL

Cascading disk fill-up due to WAL file accumulation
K8SPG-685
Secondary pod sporadically fails to get ready after restore
K8SPG-647
Sidecars for pgbouncer do not work
K8SPG-645
Resolved issue: K8SPG-645
Tolerations not rendered correctly in pg-db helm chart
K8SPG-644
Resolved issue: K8SPG-644
Add section on how to use walVolumeClaimSpec functionality.
K8SPG-639
Resolved issue: K8SPG-639
Review Switchover Documentation
K8SPG-636
Resolved issue: K8SPG-636
Can't use cluster in another namespace as data source with namespace-scoped operator
K8SPG-633
Resolved issue: K8SPG-633
Review restore documentation
K8SPG-631
Resolved issue: K8SPG-631
Adding Custom TLS Certificate for external and internal communication
K8SPG-627
Resolved issue: K8SPG-627
Add support for Using S3ForcePathStyle / verifyTLS customExtensions
K8SPG-624
Need a fature to retry backup in the backup pod for a specified number of times before abandoing the pod.
K8SPG-619
Resolved issue: K8SPG-619

11 of 11

Cascading disk fill-up due to WAL file accumulation

General

Escalation

General

Escalation

Description

In the everest github repo, a user reported a WAL file accumulation on the repo host that eventually filled up the PVC and then led to the primary and replica pods to also fill up their respective PVCs and crashed the cluster.

https://github.com/percona/everest/issues/781

We require some investigation to understand the reason for this WAL file accumulation and assess possible solutions.

Environment

None

Attachments

21 Nov 2024, 02:37 PM

Details
Assignee
Unassigned
Reporter
Diogo Recharte
Needs QA
Yes
Affects versions
2.4.1
Priority
Medium

Smart Checklist

Created November 21, 2024 at 2:37 PM

Updated November 21, 2024 at 3:04 PM

Activity

Charly BatistaNovember 21, 2024 at 3:04 PM

Based on the symptoms, it seems the problem was related to “replication slot”. My theory is there was a replication slot used by one replica that may have crashed and was later replaced by the operator, or never replaced, and that replication slot prevented the primary from removing old WAL files. Another possibility is the WAL retention configuration wasn’t ideal.

Issues

Cascading disk fill-up due to WAL file accumulation

Description

Environment

Attachments

DetailsAssigneeUnassignedUnassignedReporterDiogo RecharteDiogo RecharteNeeds QAYesAffects versions2.4.1PriorityMedium

Details

Assignee

Reporter

Needs QA

Affects versions

Priority

Smart ChecklistOpen Smart Checklist

Smart Checklist

Activity

Charly BatistaNovember 21, 2024 at 3:04 PM

Details
Assignee
Unassigned
Reporter
Diogo Recharte
Needs QA
Yes
Affects versions
2.4.1
Priority
Medium

Smart Checklist