Fix blob storage #3502

ma2bd · 2025-03-08T05:15:04Z

Motivation

As noted by @MathieuDutSik in Add the cache_key_absence to the LruCachingConfig. #3496, LRU-caching is incorrect outside views.
However, journaling outside of views is also incorrect

Proposal

On top of #3501, we simply deactivate the features that require exclusive access to an object in storage.

Journaling is not authorized.
LRU cache should never insert None but instead forget about the deleted key. (Not that we delete blobs but who knows in the future.)

We may want to rename connect and clone_with_root_keys later in another PR.

Test Plan

CI
Verified that this solves the issue with Write committees as blobs to storage. #3453

Release Plan

In principle, this chould be backported to the latest testnet branch.

afck

Looks good to me!

The assumption that after clone_with_root_key we have exclusive access is not something that storage guarantees, is it? It's just that in our use case, all cloned stores are for root views, and those are always only accessed by a single worker?

linera-views/src/backends/journaling.rs

rust-toolchain.toml

linera-views/src/backends/lru_caching.rs

MathieuDutSik · 2025-03-08T11:59:21Z

linera-views/src/backends/lru_caching.rs

+    /// Whether we have exclusive R/W access to the keys under the root key of the store.
+    has_exclusive_access: bool,


The use of the terminology has_exclusive_access is adequate for the terminology.
But the use in the case !use_exclusive_access makes sense only in a system where data is written in keys are only written, never modified, never erased. That needs to be said. But yes, it is better than cache_key_absence or PR #3496.

The problem of this design is that it forces an API. Conceivably, we could do a clone_with_root_key and not have exclusive access to it. Also, we could access to a store and have access to it. Therefore, I think it would make sense to have that in the input parameter to the LruCachingConfig.

We need to rename clone_with_root_key. In fact the types returned by connect and clone_with_root_key should probably be different.

MathieuDutSik · 2025-03-08T13:10:42Z

linera-views/src/backends/journaling.rs

+            if !self.has_exclusive_access {
+                return Err(JournalConsistencyError::JournalRequiresExclusiveAccess.into());
+            }


The action is indeed necessary for the system to work correctly.
However, I would say that this kills the interest of journaling.

The problem is that what is stored as blobs is large data like Wasm compiled code. Given the limits on size for the fastpath this could be a problem. Still, I have not been able to enable the problem.

Right now, I do not see how to address this issue and have journal and non-exclusive access.

Co-authored-by: Andreas Fackler <[email protected]> Signed-off-by: Mathieu Baudet <[email protected]>

MathieuDutSik · 2025-03-08T15:09:03Z

linera-views/tests/store_tests.rs

    let store = linera_views::dynamo_db::DynamoDbStore::new_test_store()
        .await
        .unwrap();
+    let store = store.clone_with_root_key(&[]).unwrap();


I do not think that is needed as the run_big_write_read is very much not cached.

This seems to be required:
https://github.com/linera-io/linera-protocol/actions/runs/13734903398/job/38417381624

It's more about unsafe journaling actually

MathieuDutSik · 2025-03-08T15:25:16Z

linera-views/tests/store_tests.rs

+    use linera_views::store::AdminKeyValueStore as _;
+
+    for scenario in get_random_test_scenarios() {
+        let store = linera_views::scylla_db::ScyllaDbStore::new_test_store()
+            .await
+            .unwrap();
+        let store = store.clone_with_root_key(&[]).unwrap();
+        run_reads(store, scenario).await;
+    }
+}
+
+#[cfg(with_scylladb)]
+#[tokio::test]
+async fn test_reads_scylla_db_no_journaling() {


Running locally those tests, it is impossible to see some differences in the timing. And running both tests takes the same runtime as one single test.

Also, I object to the no_journaling terminology:

The journaling is still present in the journaling. If we are in exclusive access then operation fail.

The difference is potentially about caching not journaling. And there is still caching, just caching of values, not absence of values.

But actually there is no difference. The difference between exclusive access and no exclusive access is about the checking of absence, and that caching of absence is not going to be present.

ok fair. let me rename to test_reads_scylla_db_no_root_key

MathieuDutSik · 2025-03-08T15:29:25Z

linera-views/src/backends/lru_caching.rs

+        match &self.lru_read_values {
+            None => (),
+            Some(lru_read_values) => {
+                let mut lru_read_values = lru_read_values.lock().unwrap();
+                lru_read_values.has_exclusive_access = true;
+            }
+        }
+    }


This can be shortened as

if let Some(lru_read_values) = &self.lru_read_values { let mut lru_read_values = lru_read_values.lock().unwrap(); lru_read_values.has_exclusive_access = true; }

This could also be inlined in the clone_with_root_key.

MathieuDutSik

Thanks for addressing the issue.

ma2bd · 2025-03-08T16:56:38Z

Looks good to me!

The assumption that after clone_with_root_key we have exclusive access is not something that storage guarantees, is it? It's just that in our use case, all cloned stores are for root views, and those are always only accessed by a single worker?

Yes that's correct

ma2bd requested review from MathieuDutSik and afck March 8, 2025 05:27

ma2bd force-pushed the fix_blob_storage branch from 4d8b7da to 69e1f12 Compare March 8, 2025 06:15

ma2bd mentioned this pull request Mar 8, 2025

Verify that journaling is never triggered outside of views #3503

Open

ma2bd force-pushed the fix_blob_storage branch from 69e1f12 to bae97ee Compare March 8, 2025 06:26

afck reviewed Mar 8, 2025

View reviewed changes

linera-views/src/backends/journaling.rs Outdated Show resolved Hide resolved

rust-toolchain.toml Outdated Show resolved Hide resolved

linera-views/src/backends/lru_caching.rs Outdated Show resolved Hide resolved

MathieuDutSik reviewed Mar 8, 2025

View reviewed changes

ma2bd added 6 commits March 8, 2025 09:26

refuse to use journaling outside of views

380d175

do not cache absence of keys outside of views

a0fdb0b

nits

a9c380d

delete entries from the queue as well

f709185

fix test_dynamo_db_big_write_read and test_reads_dynamo_db

8f64109

more fixes in the DB tests

61e44b0

ma2bd force-pushed the fix_blob_storage branch from 5e9d76d to 61e44b0 Compare March 8, 2025 14:26

ma2bd and others added 2 commits March 8, 2025 09:27

Update linera-views/src/backends/journaling.rs

bd7cbf7

Co-authored-by: Andreas Fackler <[email protected]> Signed-off-by: Mathieu Baudet <[email protected]>

Update linera-views/src/backends/lru_caching.rs

bb16c44

Co-authored-by: Andreas Fackler <[email protected]> Signed-off-by: Mathieu Baudet <[email protected]>

ma2bd mentioned this pull request Mar 8, 2025

Add the cache_key_absence to the LruCachingConfig. #3496

Closed

MathieuDutSik reviewed Mar 8, 2025

View reviewed changes

MathieuDutSik approved these changes Mar 8, 2025

View reviewed changes

address reviewer's comments

fcc5b49

ma2bd merged commit 7cae5f9 into linera-io:main Mar 8, 2025
25 checks passed

ma2bd deleted the fix_blob_storage branch March 8, 2025 16:55

MathieuDutSik mentioned this pull request Mar 8, 2025

Add the test for the behavior of the cache in the shared scenario. #3511

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix blob storage #3502

Fix blob storage #3502

ma2bd commented Mar 8, 2025 •

edited

Loading

afck left a comment

MathieuDutSik Mar 8, 2025

ma2bd Mar 8, 2025

MathieuDutSik Mar 8, 2025

MathieuDutSik Mar 8, 2025

ma2bd Mar 8, 2025

ma2bd Mar 8, 2025

MathieuDutSik Mar 8, 2025

ma2bd Mar 8, 2025

MathieuDutSik Mar 8, 2025 •

edited

Loading

ma2bd Mar 8, 2025

MathieuDutSik left a comment

ma2bd commented Mar 8, 2025

		/// Whether we have exclusive R/W access to the keys under the root key of the store.
		has_exclusive_access: bool,

Fix blob storage #3502

Fix blob storage #3502

Conversation

ma2bd commented Mar 8, 2025 • edited Loading

Motivation

Proposal

Test Plan

Release Plan

afck left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MathieuDutSik Mar 8, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MathieuDutSik left a comment

Choose a reason for hiding this comment

ma2bd commented Mar 8, 2025

ma2bd commented Mar 8, 2025 •

edited

Loading

MathieuDutSik Mar 8, 2025 •

edited

Loading