Chapter 3: Memory Management and Pools

The Problem with Traditional Memory Management

In traditional C programming, memory management is manual and error-prone:

char *buffer = malloc(1024);
if (!buffer) return ERROR;

process_data(buffer);

// Oops! Forgot to free on this error path
if (some_error) {
    return ERROR;  // Memory leak!
}

free(buffer);
return OK;

Web servers make this especially dangerous because:

Thousands of requests per second, each allocating many small objects
Complex code paths with multiple return points create many opportunities to miss a free()
Long-running processes amplify even tiny leaks into eventual OOM kills
Multi-threaded access makes double-free and use-after-free bugs timing-dependent and hard to reproduce

Apache’s Solution: Memory Pools

Apache uses hierarchical memory pools (sometimes called “arenas”). The concept is simple:

Create a pool
Allocate from the pool (no individual frees needed)
Destroy the pool (everything allocated from it is freed at once)

apr_pool_t *pool;
apr_pool_create(&pool, parent_pool);

char *buffer = apr_palloc(pool, 1024);
char *name = apr_pstrdup(pool, username);
char *msg = apr_psprintf(pool, "Hello, %s", name);

// All error paths are safe - just return
if (some_error) {
    return ERROR;  // No leak! Pool cleanup handles it
}

// When done, one call frees everything
apr_pool_destroy(pool);

The key insight: you never call free() on individual allocations. Instead, you tie allocations to a pool with a well-defined lifetime, and the pool frees everything when it’s destroyed. This eliminates entire categories of bugs: memory leaks (the pool always cleans up), double-free (there’s no free() to call twice), and dangling pointers (as long as you don’t use pool memory after the pool is destroyed).

Pool Hierarchy in Apache

Pools form a tree structure. When a parent pool is destroyed, all child pools are automatically destroyed too. Apache’s pool hierarchy mirrors its request-processing architecture:

        graph TD
    GP["Global Pool (pconf)<br />Lives for server lifetime"]
    GP --> VH1["Child Pool<br />(vhost 1)"]
    GP --> VH2["Child Pool<br />(vhost 2)"]
    GP --> PT["ptemp<br />(temporary, cleared<br />after config parsing)"]
    VH1 --> CP1["Connection Pool<br />(c->pool)<br />Lives for TCP connection"]
    VH1 --> CP2["Connection Pool<br />(c->pool)"]
    CP1 --> RP1["Request Pool<br />(r->pool)<br />Lives for single HTTP request"]
    CP1 --> RP2["Request Pool<br />(r->pool)"]

    style GP fill:#e74c3c,stroke:#c0392b,color:#000
    style VH1 fill:#e67e22,stroke:#d35400,color:#000
    style VH2 fill:#e67e22,stroke:#d35400,color:#000
    style PT fill:#95a5a6,stroke:#7f8c8d,color:#000
    style CP1 fill:#3498db,stroke:#2980b9,color:#000
    style CP2 fill:#3498db,stroke:#2980b9,color:#000
    style RP1 fill:#2ecc71,stroke:#27ae60,color:#000
    style RP2 fill:#2ecc71,stroke:#27ae60,color:#000

Each level in the hierarchy corresponds to a different scope in Apache’s request processing:

Red (Global): Server-level pools survive the entire process lifetime
Orange (Virtual Host): Created per-virtual-host during configuration
Blue (Connection): Created when a TCP connection is accepted, destroyed when it closes (may span multiple keep-alive requests)
Green (Request): Created for each HTTP request, destroyed after the response is sent. This is by far the most frequently created/destroyed pool and is what most module code allocates from

Apache’s Standard Pools

`pconf` - Configuration Pool

Created at startup, destroyed on shutdown
Used for: server configuration, loaded modules, directive strings
Lifetime: Entire server process

`plog` - Logging Pool

Used for log file handles
Lifetime: Until log rotation

`ptemp` - Temporary Pool

Destroyed after configuration parsing completes
Used for: temporary allocations during config (expanding wildcard includes, building intermediate arrays)
Lifetime: Configuration phase only

Connection Pool (`c->pool`)

Created when a connection is accepted
Destroyed when the connection closes
Lifetime: TCP connection (may span multiple requests with keep-alive)

Request Pool (`r->pool`)

Created for each HTTP request
Destroyed after the response is sent and logging is complete
Lifetime: Single request/response cycle
This is the pool you’ll use most in module code

The pool lifetime determines when memory is freed, which is why choosing the right pool matters:

        sequenceDiagram
    participant S as Server Start
    participant C as Connection Accept
    participant R1 as Request 1
    participant R2 as Request 2
    participant D as Connection Close

    Note over S: pconf pool created
    S->>C: Accept TCP connection
    Note over C: c->pool created
    C->>R1: Read HTTP request
    Note over R1: r->pool created
    R1->>R1: Process request
    Note over R1: r->pool destroyed
    R1->>R2: Keep-alive: next request
    Note over R2: new r->pool created
    R2->>R2: Process request
    Note over R2: r->pool destroyed
    R2->>D: Connection closes
    Note over D: c->pool destroyed

Pool API

Real-World Code Patterns

Subpools for Temporary Work

When you need to do work that generates many temporary allocations inside a loop, allocating from the request pool would cause memory to grow unboundedly until the request finishes. The solution is to create a subpool and clear it each iteration:

static int process_large_data(request_rec *r, apr_array_header_t *items)
{
    // Create a subpool for temporary work
    apr_pool_t *tmp_pool;
    apr_pool_create(&tmp_pool, r->pool);

    for (int i = 0; i < items->nelts; i++) {
        // Heavy allocations in subpool
        char *expanded = expand_item(tmp_pool, items[i]);
        process_item(r, expanded);

        // Clear subpool each iteration to prevent buildup
        apr_pool_clear(tmp_pool);
    }

    apr_pool_destroy(tmp_pool);
    return OK;
}

Without the subpool, 10,000 iterations of apr_psprintf would leave 10,000 temporary strings allocated in the request pool. With the subpool, only one iteration’s worth of memory is live at any time.

Pool Debugging and Fuzzing

APR has a built-in debug mode that fundamentally changes how pools allocate memory. This is critically important for fuzzing.

The short version: normally, apr_palloc carves sub-allocations out of a large slab (typically 8KB). ASan only tracks the slab boundaries, not the sub-allocation boundaries, so small overflows between sub-allocations are invisible. When you configure with --enable-pool-debug=yes, every apr_palloc becomes a direct malloc(), and every apr_pool_destroy becomes a direct free(). ASan can then see every allocation boundary.

// Tag pools for debugging - helps identify them in debug output
apr_pool_tag(pool, "my_module_request_pool");

Common pool-related bugs and how pools prevent them:

Traditional Bug	With Pools
Memory leak (forgot free)	Very unlikely - pool handles it
Double free	Very unlikely - no individual free
Use after free	Rare - usually obvious lifetime
Fragmentation	Minimized - pools allocate in chunks

How Pool Allocation Actually Works

Understanding the internal allocation strategy helps explain why ASan needs special configuration. Pools use a bump-pointer allocator within fixed-size memory blocks:

        graph TD
    subgraph "apr_pool_t"
        PP["parent pointer"]
        CL["child list head"]
        SB["sibling pointers"]
        CU["cleanup list"]
        AB["active block pointer"]
    end

    AB --> B1

    subgraph B1["Memory Block 1 (8KB)"]
        A1["allocation 1 (16 bytes)"]
        A2["allocation 2 (64 bytes)"]
        A3["allocation 3 (128 bytes)"]
        FREE1["[free space]"]
    end

    B1 -->|"when block fills up"| B2

    subgraph B2["Memory Block 2"]
        A4["allocation 4"]
        A5["allocation 5"]
        FREE2["[free space]"]
    end

    style A1 fill:#3498db,stroke:#2980b9,color:#000
    style A2 fill:#3498db,stroke:#2980b9,color:#000
    style A3 fill:#3498db,stroke:#2980b9,color:#000
    style A4 fill:#3498db,stroke:#2980b9,color:#000
    style A5 fill:#3498db,stroke:#2980b9,color:#000
    style FREE1 fill:#95a5a6,stroke:#7f8c8d,color:#000
    style FREE2 fill:#95a5a6,stroke:#7f8c8d,color:#000

Each allocation just increments a pointer within the current block - O(1) and extremely fast, much cheaper than malloc(). When a block fills, a new one is allocated. On pool destroy/clear, all blocks are freed at once. There’s no per-allocation metadata overhead, no free-list management, and no fragmentation within a pool.

Best Practices

Summary

Memory pools are fundamental to Apache:

No memory leaks: Pool destruction frees everything
Simple code: No tracking individual allocations
Fast: Bump-pointer allocation is O(1)
Hierarchical: Child pools auto-destroyed with parent
Cleanups: Handle non-memory resources

Key points:

Use request_rec::pool for request-scoped allocations
Use conn_rec::pool for connection-scoped allocations
Create subpools for temporary/loop work
Register cleanups for external resources
Never call free() on pool-allocated memory
For fuzzing with ASan, use --enable-pool-debug=yes to make sub-allocation boundaries visible

This pool system is what makes Apache’s modular architecture practical - modules don’t need to carefully track memory because the framework handles it through pool lifetimes.

The next chapter covers Apache’s configuration system - how httpd.conf directives are parsed, stored (in pool-allocated memory), and used by modules. good luck! :^)