Chapter 8: Request Processing Pipeline

The Big Picture

When an HTTP request arrives, Apache processes it through a carefully orchestrated pipeline of hooks and filters. Each phase has a specific responsibility – URI translation, access control, authentication, content generation – and modules register callbacks at precisely the phases where they need to act.

Note

Understanding this pipeline is essential for both module development and fuzzing. For fuzzing, it tells you which code paths your input will exercise: a malformed request line will be caught in phase 3 (request parsing), while a crafted session cookie will flow all the way to the handler phase and into mod_session_crypto’s decryption logic.

┌─────────────────────────────────────────────────────────────────────┐
│                        REQUEST LIFECYCLE                            │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│  1. Connection Accepted (MPM)                                       │
│          │                                                          │
│          ▼                                                          │
│  2. Connection Setup (pre_connection hooks)                         │
│          │                                                          │
│          ▼                                                          │
│  3. Read Request Line & Headers                                     │
│          │                                                          │
│          ▼                                                          │
│  4. Request Processing Phases (hooks)                               │
│     ┌─────────────────────────────────────────┐                     │
│     │  post_read_request                      │                     │
│     │  translate_name                         │                     │
│     │  map_to_storage                         │                     │
│     │  header_parser                          │                     │
│     │  access_checker                         │                     │
│     │  check_user_id (authn)                  │                     │
│     │  auth_checker (authz)                   │                     │
│     │  type_checker                           │                     │
│     │  fixups                                 │                     │
│     │  handler                                │                     │
│     └─────────────────────────────────────────┘                     │
│          │                                                          │
│          ▼                                                          │
│  5. Send Response (output filters)                                  │
│          │                                                          │
│          ▼                                                          │
│  6. Log Transaction                                                 │
│          │                                                          │
│          ▼                                                          │
│  7. Cleanup (pool destruction)                                      │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Internal Redirects

Apache can redirect internally without a new HTTP round-trip. This creates a new request_rec that re-runs the pipeline from phase 4, but reuses the same connection and avoids sending a 3xx response to the client. ErrorDocument directives use this mechanism – a 404 error on /missing-page internally redirects to /error/404.html:

// In a handler or hook:
ap_internal_redirect("/new/path", r);

// Or with modified request:
request_rec *new_r = ap_sub_req_lookup_uri("/new/path", r, NULL);
ap_run_sub_req(new_r);
ap_destroy_sub_req(new_r);

Internal redirects create a new request_rec but reuse the connection.

Subrequests

Subrequests are “virtual” requests that run the pipeline for a different URI within the context of the current request. Unlike internal redirects (which replace the current request), subrequests run alongside it. The subrequest gets its own request_rec with a pool that’s a child of the parent request’s pool:

// Lookup what would handle a URI
request_rec *sub = ap_sub_req_lookup_uri("/includes/header.html",
                                          r, r->output_filters);
if (sub->status == HTTP_OK) {
    // Run the subrequest
    ap_run_sub_req(sub);
}
ap_destroy_sub_req(sub);

Used by:

mod_include (SSI)
mod_negotiation
mod_dir

Error Handling

When an error occurs:

// Return HTTP error from any hook/handler
return HTTP_FORBIDDEN;  // 403

// Or set r->status and return OK
r->status = HTTP_NOT_FOUND;
ap_send_error_response(r, 0);
return OK;

Apache then:

Sets error status
Looks for ErrorDocument
Generates error response
Runs log hooks

Summary

The request pipeline is Apache’s orchestration of:

Connection setup - MPM accepts, hooks initialize
Request parsing - HTTP line and headers
URI processing - Translate and map to handler
Security checks - Access, authentication, authorization
Content generation - Handler produces response
Response delivery - Filters transform and send
Logging - Record the transaction
Cleanup - Free resources

Key insights for fuzzing:

Entry point: The harness calls ap_process_connection() directly, bypassing the MPM’s accept loop. This enters the pipeline at phase 2 (connection setup)
Input source: The core input filter is replaced with one that reads from the fuzzer’s memory buffer instead of a socket
Output sink: The core output filter is replaced with one that discards data (or writes to /dev/null)
All phases are hook-driven: Every module callback registered via ap_hook_*() runs exactly as it would in production
Pool-scoped allocations: After each request, apr_pool_destroy frees everything, which is when ASan (with --enable-pool-debug=yes) checks for memory errors
Internal redirects and subrequests can be triggered by fuzzer input (e.g., a request to a path with an ErrorDocument directive), exercising additional code paths beyond the initial request