Chapter 2: APR - Apache Portable Runtime

What is APR?

APR (Apache Portable Runtime) is a C library that provides a consistent, cross-platform interface to underlying OS functionality. Think of it as Apache’s “standard library” that abstracts away differences between Linux, Windows, BSD, and other operating systems.

Note

If you’re reading Apache code and see a function starting with apr_, it’s an APR function. You’ll almost never see raw POSIX or Win32 calls in Apache modules.

Why APR Exists

Consider the problem of writing portable C code:

// Linux/POSIX:
#include <unistd.h>
#include <sys/socket.h>
int fd = socket(AF_INET, SOCK_STREAM, 0);

// Windows:
#include <winsock2.h>
SOCKET s = socket(AF_INET, SOCK_STREAM, 0);
// Plus: WSAStartup(), different error handling, etc.

With APR:

#include "apr_network_io.h"
apr_socket_t *sock;
apr_socket_create(&sock, APR_INET, SOCK_STREAM, APR_PROTO_TCP, pool);
// Works identically on all platforms

APR doesn’t just wrap system calls - it normalizes error codes, resource lifecycle (everything ties into pools), and calling conventions across platforms. Every APR function takes a pool parameter, which means every APR-allocated resource is automatically cleaned up when the pool is destroyed. This is a fundamental design choice that pervades all of Apache.

APR vs APR-util

APR is split into two libraries. APR-core provides the low-level OS abstractions, and APR-util adds higher-level data structures and services on top:

        %%{init: {"gantt": {"displayMode": "compact", "barHeight": 30, "leftPadding": 85}}}%%
gantt
    title APR Library Stack
    tickInterval 10day
    dateFormat YYYY-MM-DD
    axisFormat " "
    section Apache HTTPD
    httpd core + modules                       : 2024-01-01, 10d
    section APR-util
    DBD, Buckets, Crypto, XML, URI             : 2024-01-01, 10d
    section APR (core)
    Pools, File I/O, Network, Threads, Tables  : 2024-01-01, 10d
    section Operating System
    Linux / Windows / BSD / macOS              : 2024-01-01, 10d

APR (core)

Memory pools (the foundation - see Chapter 3)
File I/O
Network I/O
Process/thread management
Atomic operations
Time functions
Environment variables

APR-util

Database abstraction (DBD)
Bucket brigades (the I/O abstraction for Apache filters - see Chapter 7)
Cryptographic functions (used by mod_session_crypto)
URI/URL handling
XML parsing
Queue/reslist (resource pools)
Memcache client

In the source tree:

srclib/
├── apr/          # Core APR
│   ├── include/  # apr_*.h headers
│   └── ...
└── apr-util/     # APR utilities
    ├── include/  # apu_*.h headers
    └── ...

Note

Fuzzing note: When building Apache for fuzzing, both libraries are compiled from source using -with-included-apr. This ensures APR is instrumented with the same compiler flags (sanitizers, coverage) as Apache itself. Using system-installed APR would mean APR code is uninstrumented, hiding bugs that occur inside APR functions.

APR Naming Conventions

APR follows consistent naming patterns that make Apache code readable once you know the system:

// Types end with _t
apr_pool_t      // Memory pool
apr_socket_t    // Network socket
apr_file_t      // File handle
apr_thread_t    // Thread handle
apr_table_t     // Key-value table

// Functions are apr_<module>_<action>
apr_pool_create()
apr_socket_create()
apr_file_open()
apr_thread_create()
apr_table_get()

// Return status
apr_status_t    // Return type for most functions
APR_SUCCESS     // Success constant (usually 0)
APR_EOF         // End of file
APR_EAGAIN      // Try again (non-blocking)

This pattern extends to Apache’s own API layer, which uses ap_ for server functions and AP_ for constants:

ap_hook_handler()        // Register a handler hook
ap_run_handler()         // Run all registered handlers
ap_get_module_config()   // Get module config from a vector
AP_INIT_TAKE1            // Directive that takes one argument

Essential APR Types and Functions

APR in Apache Context

The following diagram shows how a typical module handler interacts with APR subsystems. Every arrow represents an APR function call, and every resource is allocated from the request pool:

        flowchart LR
    Handler["Request<br />Handler"]
    Handler -->|"apr_psprintf<br />apr_pstrdup"| Pools["APR Pools"]
    Handler -->|"apr_table_set<br />apr_table_get"| Tables["APR Tables"]
    Handler -->|"apr_file_open<br />apr_file_read"| FileIO["APR File I/O"]
    Handler -->|"apr_socket_*"| NetIO["APR Network I/O"]
    Pools --> OS["Operating System"]
    Tables --> OS
    FileIO --> OS
    NetIO --> OS

In Apache code, you’ll see APR used everywhere:

static int example_handler(request_rec *r)
{
    // String operations use request pool
    char *greeting = apr_psprintf(r->pool, "Hello, %s!",
                                  r->useragent_ip);

    // Headers are apr_table_t
    apr_table_set(r->headers_out, "X-Custom-Header", "value");

    // File operations
    apr_file_t *fp;
    apr_file_open(&fp, r->filename, APR_READ, APR_OS_DEFAULT, r->pool);

    return OK;
}

Notice that every operation uses r->pool. This is the request pool - it’s created when the request starts and destroyed when the response is sent. Everything allocated from it (the greeting string, the file handle) is automatically freed. The handler doesn’t need a single free() call, and there are no possible memory leaks regardless of which error path is taken.

Common APR Usage Patterns

Pattern 1: Error Handling

apr_status_t rv;
char errbuf[256];

rv = apr_socket_connect(sock, addr);
if (rv != APR_SUCCESS) {
    ap_log_error(APLOG_MARK, APLOG_ERR, rv, s,
                 "Failed to connect: %s",
                 apr_strerror(rv, errbuf, sizeof(errbuf)));
    return HTTP_SERVICE_UNAVAILABLE;
}

Pattern 2: Pool-based Resource Management

// Create a subpool for temporary allocations
apr_pool_t *subpool;
apr_pool_create(&subpool, r->pool);

// Do work with subpool
char *temp = apr_palloc(subpool, 10000);
process_data(temp);

// Clean up when done - frees everything allocated from subpool
apr_pool_destroy(subpool);

Pattern 3: Iteration with APR

// Iterate over table entries
const apr_array_header_t *tarr = apr_table_elts(table);
const apr_table_entry_t *telts = (const apr_table_entry_t*)tarr->elts;

for (int i = 0; i < tarr->nelts; i++) {
    printf("%s: %s\n", telts[i].key, telts[i].val);
}

Finding APR Documentation

APR headers has useful inline comments. See:

srclib/apr/include/apr_*.h - Core APR
srclib/apr-util/include/apr_*.h - APR-util

Each header has comments explaining every function, its parameters, return values, and edge cases. When in doubt about an APR function’s behavior, read the header :D

Summary

APR is Apache’s foundation library providing:

Portability: Same code works on Linux, Windows, BSD, etc.
Consistency: Uniform error handling, naming conventions
Memory safety: Pool-based allocation prevents leaks
Rich functionality: Covers files, network, threads, data structures

Before writing any Apache code, become comfortable with:

apr_pool_t and memory pools (next chapter)
apr_table_t for headers
apr_status_t for error handling
String functions: apr_pstrdup, apr_psprintf, apr_pstrcat

The next chapter dives deeper into APR’s most important feature: memory pools.