Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.springtail.io/llms.txt

Use this file to discover all available pages before exploring further.

Shared Memory Cache Subsystem

Overview

The Shared Memory Cache Subsystem enables efficient inter-process communication (IPC) between the XID Manager daemon, the PgXidSubscriberMgr daemon, and PostgreSQL Foreign Data Wrapper (FDW) processes. This subsystem allows multiple FDW processes to share system metadata and transaction mutation data without repeated RPC calls to backend services.

Key Benefits

  • Zero-copy data sharing between processes using Boost Interprocess shared memory
  • Reduced RPC overhead by caching frequently accessed metadata locally
  • Transaction-aware caching with XID (transaction ID) history tracking
  • Automatic staleness detection and cache refresh mechanisms
  • LRU-based memory management to prevent unbounded growth

Architecture

High-Level Data Flow

┌─────────────────────────────────────┐
│   XID Manager (xid_mgr_daemon)      │
│   - Manages transaction commitments │
└────────────────┬────────────────────┘
                 │ gRPC: XidPushResponse

┌─────────────────────────────────────────────────────────────┐
│ PgXidSubscriberMgr (daemon, separate thread)               │
│ - Receives XID notifications                               │
│ - Creates 5 shared memory caches                           │
│ - Populates caches on transaction commit                   │
│ - Worker threads fetch metadata proactively                │
└───┬─────────────────────────────────────────────────────────┘
    │ Shared Memory IPC
    ├─ SHM_CACHE_ROOTS         (table metadata)
    ├─ SHM_CACHE_SCHEMAS       (table schemas)
    ├─ SHM_CACHE_USERTYPES     (user-defined types)
    ├─ SHM_CACHE_TABLE_IDS     (pending mutations)
    └─ SHM_CACHE_EXTENTS       (mutation records)

    ↓ (Opens and reads)
┌──────────────────────────────┐
│   PgFdwMgr (FDW process)     │
│   - Runs in PostgreSQL       │
│   - Opens existing caches    │
│   - Queries pending XIDs     │
│   - Applies mutations        │
└──────────────────────────────┘

Components

ComponentRoleProcess Type
PgXidSubscriberMgrCache producer - creates and maintains cachesDaemon thread
PgFdwMgrCache consumer - opens and uses cachesPostgreSQL FDW process
ShmCacheShared memory abstraction using Boost InterprocessLibrary class
MsgCacheMulti-index container with LRU evictionTemplate class
XidMgrClientStreams XID commit notificationsgRPC client

Cache Types

The subsystem manages 5 distinct shared memory caches, each serving a specific purpose:

1. Roots Cache (SHM_CACHE_ROOTS)

Purpose: Stores table root/metadata information Key Properties:
  • XID history: Enabled
  • Tracks schema changes across transactions
  • Contains table root objects with metadata pointers
Data Structure:
Key: (DbId, TableId)
Value: vector<proto::Root> sorted by XID
Location:
  • Created: pg_xid_subscriber_mgr.cc:53
  • Used by: sys_tbl_mgr::Client::get_roots()

2. Schema Cache (SHM_CACHE_SCHEMAS)

Purpose: Stores table schema definitions Key Properties:
  • XID history: Disabled
  • Contains column definitions, types, constraints
  • Updated on DDL operations
Data Structure:
Key: (DbId, TableId)
Value: vector<proto::Schema>
Location:
  • Created: pg_xid_subscriber_mgr.cc:56
  • Used by: sys_tbl_mgr::Client::get_schema()

3. UserType Cache (SHM_CACHE_USERTYPES)

Purpose: Stores user-defined type information Key Properties:
  • XID history: Disabled
  • Contains custom type definitions
  • Required for schema deserialization
Data Structure:
Key: (DbId, TypeId)
Value: vector<proto::UserType>
Location:
  • Created: pg_xid_subscriber_mgr.cc:61
  • Used by: sys_tbl_mgr::Client::get_usertype()

4. Table IDs Cache (SHM_CACHE_TABLE_IDS)

Purpose: Maps (DbId, XID) to list of TableIds modified in each transaction Key Properties:
  • XID history: Enabled
  • Tracks pending mutations per transaction
  • Critical for read-after-write consistency
Data Structure:
Key: (DbId, Xid)
Value: serialized proto::XidPushResponse
Fields: {db_id, xid, has_schema_changes, real_commit, table_ids[]}
Location:
  • Created: pg_xid_subscriber_mgr.cc:64
  • Used by: PgFdwMgr::_get_table():645-654
Usage Pattern:
// Query pending XIDs
_table_ids_shm_cache->get_pending_xids(db_id)

// Fetch mutation metadata
_table_ids_shm_cache->find(db_id, xid)

5. Extents Cache (SHM_CACHE_EXTENTS)

Purpose: Stores mutation records (extents) from WriteCacheClient Key Properties:
  • XID history: Disabled
  • Contains actual data mutations
  • Populated before applying to base tables
Data Structure:
Key: (DbId, ExtentId)
Value: vector<proto::Extent>
Location:
  • Created: pg_xid_subscriber_mgr.cc:68
  • Used by: WriteCacheClient for mutation staging

Detailed Component Documentation

PgXidSubscriberMgr

Header: include/pg_fdw/pg_xid_subscriber_mgr.hh:22-105 Implementation: src/pg_fdw/pg_xid_subscriber_mgr.cc

Responsibilities

  1. Cache Creation (init() method):
    • Creates all 5 shared memory caches with configured sizes
    • Removes stale caches from previous runs
    • Initializes XID history cleaners for enabled caches
  2. XID Push Notification Handling (task() method):
    • Subscribes to XidMgrClient push stream
    • Receives proto::XidPushResponse messages
    • Dispatches based on real_commit flag
  3. Asynchronous Population (worker threads):
    • Configurable thread pool size
    • Fetches metadata on transaction commit
    • Calls Client::get_roots(), get_schema(), get_usertype()
    • Automatically populates caches via registered ShmCache instances

PgFdwMgr

Header: include/pg_fdw/pg_fdw_mgr.hh:133-468 Implementation: src/pg_fdw/pg_fdw_mgr.cc

Responsibilities

  1. Cache Opening (_try_create_cache() method):
    • Opens existing caches created by PgXidSubscriberMgr
    • Verifies cache liveness with is_alive() check
    • Registers caches with Client and WriteCacheClient singletons
  2. Mutation Application (_get_table() method):
    • Queries pending XIDs since last committed
    • Fetches mutation metadata from table_ids_cache
    • Retrieves extent data from WriteCache
    • Applies mutations to base table as ChangeSet
  3. Lifecycle Management:
    • Lazy initialization on first query
    • Automatic cache reopening if stale
    • Background thread for XID updates
Key Points:
  • Opens in read mode (size=0 parameter)
  • Handles case where PgXidSubscriberMgr hasn’t started yet
  • Checks is_alive() to detect stale caches
  • Registers with appropriate singleton clients
Performance Considerations:
  • Only processes XIDs <= snapshot_xid (transaction isolation)
  • Limits to MAX_WRITE_CACHE_EXTENTS (10) per query
  • Resets pending XIDs if too many extents found
  • Avoids redundant extent fetches

ShmCache Class

Header: include/sys_tbl_mgr/shm_cache.hh:73-360 Implementation: src/sys_tbl_mgr/shm_cache.cc

Purpose

Provides a shared memory abstraction using Boost Interprocess for inter-process data sharing.

Key Features

  1. Dual-mode construction:
    • Create mode: ShmCache(type, size, xid_history) with size > 0
    • Open mode: ShmCache(type, 0, xid_history) opens existing cache
  2. XID tracking (if enabled):
    • Tracks committed XID per database
    • Maintains list of pending XIDs
    • Records XID history for schema changes
  3. Keep-alive mechanism:
    • Caches must be kept alive every 60ms
    • is_alive() checks if cache is stale
    • Automatic reconnection if stale detected
Usage:
  • PgXidSubscriberMgr calls keep_alive() every 60ms
  • PgFdwMgr checks is_alive() before using cache
  • If stale, PgFdwMgr reopens cache

MsgCache Template

Header: include/sys_tbl_mgr/msg_cache.hh:31-323

Purpose

Multi-index container with LRU eviction, providing both fast lookup and insertion-order tracking. Configuration:
  • Free memory limit: 30% (trigger eviction)
  • Free memory watermark: 50% (stop eviction)
  • Retention: Top 10% frequently accessed items protected (msg_cache.hh:236)

Synchronization and Concurrency

Thread Safety Strategy

Mutex Hierarchy

// PgFdwMgr (pg_fdw_mgr.hh:298)
std::shared_mutex _shm_cache_mutex;  // Protects cache pointers

// ShmCache (shm_cache.hh:96)
boost::interprocess::named_sharable_mutex* _mutex;  // Per-cache lock

// PgXidSubscriberMgr (pg_xid_subscriber_mgr.hh:93)
std::mutex _mutex;  // Protects populate job queue
std::condition_variable _cv;  // Worker coordination

Configuration

Configuration Properties

File: Properties singleton (commonly loaded from YAML/JSON config) Schema:
{
  "sys_tbl_mgr": {
    "roots_shm_cache_size": 1073741824,
    "schema_shm_cache_size": 536870912,
    "usertype_shm_cache_size": 268435456,
    "table_ids_shm_cache_size": 536870912,
    "extents_shm_cache_size": 1073741824,
    "rpc_config": {
      "server_worker_threads": 4
    }
  }
}
Parameters:
ParameterTypeDefaultDescription
roots_shm_cache_sizesize_t1 GBSize of roots cache in bytes
schema_shm_cache_sizesize_t512 MBSize of schema cache in bytes
usertype_shm_cache_sizesize_t256 MBSize of usertype cache in bytes
table_ids_shm_cache_sizesize_t512 MBSize of table IDs cache in bytes
extents_shm_cache_sizesize_t1 GBSize of extents cache in bytes
server_worker_threadssize_t4Number of worker threads for async population

Usage Examples

Example 1: Querying Table with Pending Mutations

Scenario: PostgreSQL FDW process executes SELECT * FROM table WHERE id = 123 with snapshot XID 1000. Steps:
// 1. PgFdwMgr::_get_table() is called
auto table = _get_table(db_id, table_id, snapshot_xid=1000);

// 2. Fetch base table from SysTableMgr
auto base_table = SysTableMgr::get_instance()->get_table(db_id, table_id, 1000);

// 3. Query pending XIDs from table_ids_cache
auto pending_xids = _table_ids_shm_cache->get_pending_xids(db_id);
// Result: [950, 975, 1050]

// 4. Filter XIDs <= snapshot_xid
// Process: 950, 975 (skip 1050)

// 5. For XID 950:
auto msg = _table_ids_shm_cache->find(db_id, 950);
proto::XidPushResponse response;
response.ParseFromString(msg);
// response.table_ids() = [10, 25, 30]

// 6. Check if table_id matches
if (table_id == 25) {
    // 7. Fetch extents from WriteCache
    auto extents = WriteCacheClient::get_instance()->get_extents(db_id, table_id, 950);

    // 8. Apply extents as ChangeSet
    ChangeSet cs;
    for (auto& extent : extents) {
        cs.add_extent(extent);
    }
    base_table->apply_changeset(cs, 950);
}

// 9. Repeat for XID 975
// ... similar logic ...

// 10. Return merged table
return base_table;
Result: Query sees all committed data plus uncommitted mutations from XIDs 950 and 975, ensuring read-after-write consistency.

Example 3: Cache Staleness Detection and Recovery

Scenario: PgXidSubscriberMgr crashes and restarts while FDW processes are running. Steps:
// 1. PgFdwMgr detects stale cache
bool ShmCache::is_alive() const {
    auto now = std::chrono::steady_clock::now();
    auto elapsed = now - _last_keep_alive;
    return elapsed < std::chrono::milliseconds(60);
}

// 2. PgFdwMgr::_try_create_cache() checks liveness
if (!_roots_shm_cache || !_roots_shm_cache->is_alive()) {
    // 3. Reopen cache
    _roots_shm_cache = std::make_shared<sys_tbl_mgr::ShmCache>(
        sys_tbl_mgr::SHM_CACHE_ROOTS, 0, true);  // Open mode

    // 4. Re-register with Client
    sys_tbl_mgr::Client::get_instance()->use_roots_cache(_roots_shm_cache);
}

// 5. PgXidSubscriberMgr restarts
PgXidSubscriberMgr::start();

// 6. New caches created
// Old shared memory segments removed
// Fresh XIDs streamed from XID Manager

// 7. FDW processes reopen caches on next query
// Seamless recovery with minimal downtime

4. LRU Eviction Strategy

Problem: Unbounded cache growth causes OOM Solution: Multi-index container with sequenced index for LRU Benefits:
  • Automatic eviction at 30% free memory
  • Stops eviction at 50% free memory watermark

Debugging Commands

# List shared memory segments
ipcs -m

# Remove stale shared memory
ipcrm -m <shmid>

# Check cache sizes
ls -lh /dev/shm/

# Monitor cache usage
watch -n 1 'ipcs -m -u'

# Verify PgXidSubscriberMgr is running
ps aux | grep pg_xid_subscriber_mgr

# Check XID Manager connection
netstat -an | grep <xid_manager_port>
## Design Patterns

1. Producer-Consumer with Lazy Initialization

Pattern:
  • Producer: PgXidSubscriberMgr creates and maintains caches
  • Consumer: PgFdwMgr opens and uses caches
  • Trigger: Query execution in PostgreSQL FDW process
Benefits:
  • Loose coupling between producer and consumer
  • Graceful handling of startup ordering
  • Automatic recovery on producer restart

References

Source Files

FileDescription
include/pg_fdw/pg_xid_subscriber_mgr.hhPgXidSubscriberMgr class definition
src/pg_fdw/pg_xid_subscriber_mgr.ccPgXidSubscriberMgr implementation
include/pg_fdw/pg_fdw_mgr.hhPgFdwMgr class definition
src/pg_fdw/pg_fdw_mgr.ccPgFdwMgr implementation
include/sys_tbl_mgr/shm_cache.hhShmCache class definition
src/sys_tbl_mgr/shm_cache.ccShmCache implementation
include/sys_tbl_mgr/msg_cache.hhMsgCache template definition
src/proto/xid_manager.protoXidPushResponse protobuf definition
include/write_cache/write_cache_client.hhWriteCacheClient class definition

External Dependencies

LibraryPurposeDocumentation
Boost InterprocessShared memory IPCboost.org/doc/libs/interprocess
Boost Multi-IndexMulti-index containersboost.org/doc/libs/multi_index
Protocol BuffersSerializationdevelopers.google.com/protocol-buffers
gRPCRPC frameworkgrpc.io
  • PostgreSQL Foreign Data Wrapper API
  • Springtail Transaction Management
  • XID Manager Architecture
  • WriteCache Design