Physical restore fails with "there was an index build in progress"

Description

In some cases, physical restore can fail with the following error:

this happens after PBM starts a node in standalone mode as part of the restore operation, and then tries to shut it down to continue the restore.

The relevant part of the code is

We should perform that shutdown with force: true to make sure it completes. Otherwise cluster is left in broken state.


Refinement comment:

  • The “force” usage does not have to be the best solution, we might investigate a more graceful solution

  • maybe we should try some retries before the force shutdown

  • can we reproduce it? it will be complex anyway…

Environment

None

Activity

Show:

Jan Mynar June 27, 2024 at 9:52 AM

please, can we get mongo logs, filelist.pbm (list of files), backup metadata, and possibly some “steps to reproduce”?

Done

Details

Assignee

Reporter

Labels

Needs QA

Yes

Components

Sprint

Fix versions

Priority

Smart Checklist

Created June 25, 2024 at 1:39 PM
Updated last week
Resolved August 19, 2024 at 7:50 AM