Allow write through to parquet cache #25887

hiltontj · 2025-01-21T16:50:28Z

Problem statement

The parquet cache currently only has one way to populate, which is via a GET request to the object store. See:

influxdb/influxdb3_cache/src/parquet_cache/mod.rs

Lines 43 to 51 in d1fd155

    
           /// A request to fetch an item at the given `path` from an object store 
        
           /// 
        
           /// Contains a notifier to notify the caller that registers the cache request when the item 
        
           /// has been cached successfully (or if the cache request failed in some way) 
        
           #[derive(Debug)] 
        
           pub struct CacheRequest { 
        
               path: Path, 
        
               notifier: oneshot::Sender<()>, 
        
           }

This fetch-based method of populating the cache is still needed, but seems like an inefficient option where it is used from the write buffer.

Currently, during the snapshot process, once a parquet file is persisted, we submit a cache request, which will fetch the parquet data that was just written to object store. We should be able to cache the written bytes directly, vs. having to do this additional request to the object store.

Proposed solution

Expand the CacheRequest type into an enum with variants to support:

Fetch-based cache request (what it does currently)
Write-through cache request

The latter will accept bytes, somehow, and write them into the cache for a given object store path.

The text was updated successfully, but these errors were encountered:

closes: #25887

part of: #25887

hiltontj added the v3 label Jan 21, 2025

praveen-influx added a commit that referenced this issue Jan 23, 2025

feat: first stab at locally updating parquet cache

d9a48d7

closes: #25887

praveen-influx linked a pull request Jan 23, 2025 that will close this issue

feat: first stab at locally updating parquet cache #25904

Draft

praveen-influx added a commit that referenced this issue Jan 23, 2025

refactor: use enums to separate out the modes

e66107d

part of: #25887

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow write through to parquet cache #25887

Allow write through to parquet cache #25887

hiltontj commented Jan 21, 2025

Allow write through to parquet cache #25887

Allow write through to parquet cache #25887

Comments

hiltontj commented Jan 21, 2025

Problem statement

Proposed solution