Introduce artificial random storage service failures in storage-service
for simulations.
#3304
+127
−7
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation
We had some problem with
ScyllaDb
running on the storage. Being able to reproduce problem is useful for finding unexpected schemes of failure.Proposal
A scheme is introduced for adding random read and/or write errors in the running of the storage service client. There is a different probability for read and for writes. This is a deterministic error introduction scheme in order to be as deterministic as possible.
Test Plan
The test did show some problems. Whether it is a true problem or some other errors remains to be determined.
The commands to get one problem is the following:
Running the
storage-service
:cargo run --release -p linera-storage-service -- memory --endpoint $LINERA_STORAGE_SERVICE
Running the validators:
For which we get the error
Release Plan
Follow the normal release plan.
Links
None