[FIXED] Make sure to process extended purge operations correctly when being replayed. #4212

derekcollison · 2023-06-04T00:58:26Z

This is an extension to the excellent work by @MauriceVanVeen and his original PR #4197 to fully resolve for all use cases.

Signed-off-by: Derek Collison derek@nats.io

Resolves #4196

…eplayed on a restart. Signed-off-by: Derek Collison <derek@nats.io>

wallyqs

LGTM

MauriceVanVeen · 2023-06-04T10:09:33Z

Awesome! 🎉

I believe there might be a bug with both sequence and keep being set, though. Have opened a PR with a potential fix, and an explanation about the issue.

PR #4212 fixed the issue I reported in #4196. However, I believe there might be a bug when both `sequence` and `keep` are set during recovery. In the `PurgeEx` the following check is done (for both `filestore.go` and `memstore.go`): ```go if sequence > 1 && keep > 0 { return 0, ErrPurgeArgMismatch } ``` The `TestJetStreamClusterPurgeExReplayAfterRestart` also triggers this case, meaning that during the test this error is returned but it succeeds because the purge was already performed. Is this intended behaviour? To elaborate a bit more, I believe the following happens: - when running the purge normally it will properly run the `keep` (since it's not combined with `sequence` yet) - when replaying the purge though, the `sequence` is added to the `keep`, which errors out in the above if Which means that during normal operation all will be well, but purges with `keep` will be ignored upon replaying. I'm proposing to remove the `sequence > 1 && keep > 0` check and subsequent error. Which, for reference, was introduced in #3121. Hoping this ensures that during recovery, purges that haven't executed yet will still be executed. An alternative approach, which wouldn't remove the error: not allow combining `sequence` and `keep` normally and only allowing it during recovery. Which would preserve the current behaviour, and correctly apply `sequence+keep` during recovery still. However, not sure if it's possible to know if we're in "recovery mode" from within `PurgeEx`. Resolves #4196

Make sure to process extended purge operations correctly when being r…

dee5324

…eplayed on a restart. Signed-off-by: Derek Collison <derek@nats.io>

derekcollison requested a review from a team as a code owner June 4, 2023 00:58

wallyqs approved these changes Jun 4, 2023

View reviewed changes

derekcollison merged commit e1f8064 into main Jun 4, 2023
2 checks passed

derekcollison deleted the fix-4196 branch June 4, 2023 01:12

derekcollison mentioned this pull request Jun 4, 2023

fix: clustered stream recovery, ensure PurgeEx is executed instead of full purge #4197

Closed

7 tasks

MauriceVanVeen mentioned this pull request Jun 4, 2023

Fix PurgeEx replay with sequence & keep succeeds #4213

Merged

MauriceVanVeen referenced this pull request Jul 11, 2023

allow purging up to the maxSequence

c40a6a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIXED] Make sure to process extended purge operations correctly when being replayed. #4212

[FIXED] Make sure to process extended purge operations correctly when being replayed. #4212

derekcollison commented Jun 4, 2023

wallyqs left a comment

MauriceVanVeen commented Jun 4, 2023

[FIXED] Make sure to process extended purge operations correctly when being replayed. #4212

[FIXED] Make sure to process extended purge operations correctly when being replayed. #4212

Conversation

derekcollison commented Jun 4, 2023

wallyqs left a comment

Choose a reason for hiding this comment

MauriceVanVeen commented Jun 4, 2023