Issues

Select view

List view

Detail view

Select search mode

Basic

JQL

Unable to delete failed backup jobs
PBM-927
Resolved issue: PBM-927
Fix replaying oplog on system collections during the restore
PBM-871
Resolved issue: PBM-871
by stopping the pbm-agent with the unit systemd, the agent status remains ok
PBM-783
Resolved issue: PBM-783
Restore failures are not reported
PBM-745
Resolved issue: PBM-745
Review read/write concerns for pbm* collections
PBM-741
Resolved issue: PBM-741
pbm status fails when pbm user's password uses special symbols
PBM-736
Resolved issue: PBM-736
Add proper error message when PBM agents aren't available
PBM-731
PBM backup erroring out with (CursorNotFound) cursor id not found / Mux ending but selectCases still open
PBM-730
Resolved issue: PBM-730
make docker image universal
PBM-726
Resolved issue: PBM-726
restore fails with applyOps: (Location10065) invalid parameter: expected an object ()
PBM-725
Resolved issue: PBM-725
Unique name for restore operation
PBM-723
Resolved issue: PBM-723
Fix pbm-agent crash during the delete-pitr request execution if there is nothing to delete
PBM-722
Resolved issue: PBM-722
PBM: retry upload if it fails in S3
PBM-721
Resolved issue: PBM-721
"pbm delete-pitr" doesn't remove pitr slices
PBM-717
Resolved issue: PBM-717
Fix backup and PITR routines alignment algorithm to avoid backup failure
PBM-714
Resolved issue: PBM-714
Paging for CLI pbm logs
PBM-713
Avoid writing 'read/write on closed pipe' error in logs on expected connection closure
PBM-705
Resolved issue: PBM-705
PITR restore fails due to error "Failed to apply operation due to missing collection config.transactions"
PBM-703
Resolved issue: PBM-703
Prevent restore to time which isn't covered by PITR chunks
PBM-701
Resolved issue: PBM-701
Add support of MongoDB 5.0 TS collections
PBM-697
Resolved issue: PBM-697

20 of 20

Unable to delete failed backup jobs

Incomplete

General

Escalation

General

Escalation

Description

This issue occurred in a production environment using 1.6.0. I have not managed to reproduce the initial race condition in 1.8.1, but I have confirmed the resulting inability to delete jobs in a bad state still applies which is what I consider to be the bug.

Steps to reproduce:

PBM is configured to take PTIR backups every 10 minutes
A full backup is triggered
The full backup fails due to starting ac the exact point a PITR backup was being taken. This caused the full backup to fail.
There is no way to delete the failed job record, except by manual deletion from Mongo or forcing a resync. There may be circumstances where forcing a resync doesn't work, as job results have been written to disk.

You will note from the timestamps this occurred a while ago. The impact of the failure has only recently come to light, as cleanup of old backups failed.

Logs from the initial backup failure:

The above failure created the following record in pbmBackups:

Which pbm status 1.6.0 reported as:

pbm status 1.8.1 reports the same status as

From my perspective, the bug isn't that the backup failed – failures can happen for lots of reasons. The bug is that PBM profiles no way to delete the failed job without manual intervention:

It should be possible to force PBM to cleanup any traces of an incomplete backup, possibly with an extra flag?

Environment

None

Smart Checklist

Details
Assignee
Unassigned
Reporter
Daniel Oliver
Affects versions
1.6.0
1.8.1
Priority
Medium

Smart Checklist

Created August 19, 2022 at 11:06 AM

Updated December 10, 2023 at 8:36 AM

Resolved December 10, 2023 at 8:36 AM

Configure

Activity

Show:

Aaditya DubeyDecember 10, 2023 at 8:36 AM

Hi ,

Closing the report, no activity for a long!

Aaditya DubeyJanuary 27, 2023 at 2:19 PM

Hi ,

Thank you for the report.
Please let me know if issue is still persists.

Issues

Unable to delete failed backup jobs

Description

Environment

Smart Checklist

DetailsAssigneeUnassignedUnassignedReporterDaniel OliverDaniel OliverAffects versions1.6.01.8.1PriorityMedium

Details

Assignee

Reporter

Affects versions

Priority

Smart ChecklistOpen Smart Checklist

Smart Checklist

Activity

Aaditya DubeyDecember 10, 2023 at 8:36 AM

Aaditya DubeyJanuary 27, 2023 at 2:19 PM

Details
Assignee
Unassigned
Reporter
Daniel Oliver
Affects versions
1.6.0
1.8.1
Priority
Medium

Smart Checklist