Request Matchers

Request matchers can be used to filter (or classify) requests by specific criteria.

Menu

Syntax

In the Caddyfile, a matcher token immediately following the directive can limit that directive’s scope. The matcher token can be one of these forms:

  1. * to match all requests (wildcard; default).
  2. /path start with a forward slash to match a request path.
  3. @name to specify a named matcher.

Matcher tokens are usually optional. If a matcher token is omitted, it is the same as a wildcard matcher (*).

Examples

This directive applies to all HTTP requests:

  1. reverse_proxy localhost:9000

And this is the same:

  1. reverse_proxy * localhost:9000

But this directive applies only to requests having a path starting with /api/:

  1. reverse_proxy /api/* localhost:9000

To match on anything other than a path, define a named matcher and refer to it using @name:

  1. @postfoo {
  2. method POST
  3. path /foo/*
  4. }
  5. reverse_proxy @postfoo localhost:9000

Wildcard matchers

The wildcard (or “catch-all”) matcher * matches all requests, and is only needed if a matcher token is required. For example, if the first argument you want to give a directive also happens to be a path, it would look exactly like a path matcher! So you can use a wildcard matcher to disambiguate, for example:

  1. root * /home/www/mysite

Otherwise, this matcher is not often used. It is convenient to omit it when possible; just a matter of preference.

Path matchers

Because matching by path is so common, a single path matcher can be inlined, like so:

  1. redir /old.html /new.html

Path matcher tokens must start with a forward slash /.

Path matching is an exact match by default; you must append a * for a fast prefix match. Note that /foo* will match /foo and /foo/ as well as /foobar; you might actually want /foo/* instead.

Named matchers

All matchers that are not comprised of a single path matcher or a wildcard matcher must be named matchers. This is a matcher that is defined outside of any particular directive, and can be reused.

Defining a matcher with a unique name gives you more flexibility, allowing you to combine any available matchers into a set:

  1. @name {
  2. ...
  3. }

or, if there is only one matcher in the set:

  1. @name ...

Then you can use the matcher like so: @name

For example:

  1. @websockets {
  2. header Connection *Upgrade*
  3. header Upgrade websocket
  4. }
  5. reverse_proxy @websockets localhost:6001

This proxies only the requests that have a header field named “Connection” containing the word “Upgrade”, and another field named “Upgrade” with a value of “websocket”.

If the matcher set consists of only one matcher, a one-liner syntax also works:

  1. @post method POST
  2. reverse_proxy @post localhost:6001

Like directives, named matcher definitions must go inside the site blocks that use them.

A named matcher definition constitutes a matcher set. Matchers in a set are AND’ed together; i.e. all must match. For example, if you have both a header and path matcher in the set, both must match.

For most matchers that accept multiple values, those values are OR’ed; i.e. one must match in order for the matcher to match.

Standard matchers

Full matcher documentation can be found in each respective matcher module’s docs.

expression

⚠️ This module is still experimental and, as such, may experience breaking changes.

  1. expression <cel...>

By any CEL (Common Expression Language) expression that returns true or false.

As a special case, Caddy placeholders (or Caddyfile shorthands) may be used in these CEL expressions, as they are preprocessed and converted to regular CEL function calls before being interpreted by the CEL environment.

Examples:

Match requests whose methods start with P, e.g. PUT or POST.

  1. expression {method}.startsWith("P")

Match requests where handler returned error status code 404, would be used in conjunction with the handle_errors directive.

  1. expression {http.error.status_code} == 404

file

  1. file {
  2. root <paths>
  3. try_files <files...>
  4. try_policy first_exist|smallest_size|largest_size|most_recent_modified
  5. split_path <delims...>
  6. }

By files.

  • root defines the directory in which to look for files. Default is the current working directory, or the root variable ({http.vars.root}) if set (can be set via the root directive).
  • try_files checks files in its list that match the try_policy.
  • try_policy specifies how to choose a file. Default is first_exist.
    • first_exist checks for file existence. The first file that exists is selected.
    • smallest_size chooses the file with the smallest size.
    • largest_size chooses the file with the largest size.
    • most_recent_modified chooses the file that was most recently modified.
  • split_path will cause the path to be split at the first delimiter in the list that is found in each filepath to try. For each split value, the left-hand side of the split including the delimiter itself will be the filepath that is tried. For example, /remote.php/dav/ using a delimiter of .php would try the file /remote.php. Each delimiter must appear at the end of a URI path component in order to be used as a split delimiter. This is a niche setting and is mostly used when serving PHP sites.

Because try_files with a policy of first_exist is so common, there is a one-line shortcut for that:

  1. file <files...>

An empty file matcher (one with no files listed after it) will see if the requested file—verbatim from the URI, relative to the site root—exists.

Since rewriting based on the existence of a file on disk is so common, there is also a try_files directive which is a shortcut of the file matcher and a rewrite handler.

Examples:

Match requests where the path is a file that exists.

  1. file

Match requests where the path followed by .html is a file that exists, or if not, where the path is a file that exists.

  1. file {
  2. try_files {path}.html {path}
  3. }

header

  1. header <field> <value>

By request header fields.

  • <field> is the name of the HTTP header field to check.
  • <value> is the value the field must have to match.
    • If prefixed with *, it performs a fast suffix match.
    • If suffixed with *, it performs a fast prefix match.
    • If enclosed by *, it performs a fast substring match.
    • Otherwise, it is a fast exact match.

Example:

Match requests with the Connection header containing Upgrade.

  1. header Connection *Upgrade*

header_regexp

  1. header_regexp [<name>] <field> <regexp>

Like header, but supports regular expressions. Capture groups can be accessed via placeholder like {http.regexp.name.capture_group} where name is the name of the regular expression (optional, but recommended) and capture_group is either the name or number of the capture group in the expression. Capture group 0 is the full regexp match, 1 is the first capture group, 2 is the second capture group, and so on.

Example:

Match requests where the Cookie header contains login_ followed by a hex string, with a capture group that can be accessed with {http.regexp.login.1}.

  1. header_regexp login Cookie login_([a-f0-9]+)

host

  1. host <hosts...>

Matches request by the Host header field of the request. It is not common to use this in the Caddyfile, since most site blocks already indicate hosts in the address of the site. This matcher is mostly used in site blocks that don’t define specific hostnames.

Example:

  1. host sub.example.com

method

  1. method <verbs...>

By the method (verb) of the HTTP request. Verbs should be uppercase, like POST. Can match one or many methods.

Examples:

Match requests with the GET method.

  1. method GET

Match requests with the PUT or DELETE methods.

  1. method PUT DELETE

not

  1. not <any other matcher>

or, to negate multiple matchers which get AND’ed, open a block:

  1. not {
  2. <any other matchers...>
  3. }

The results of the enclosed matchers will be negated.

Examples:

Match requests with paths that do NOT begin with /css/ OR /js/.

  1. not path /css/* /js/*

Match requests WITH NEITHER:

  • an /api/ path prefix, NOR
  • the POST request method

i.e. must have none of these to match:

  1. not path /api/*
  2. not method POST

Match requests WITHOUT BOTH:

  • an /api/ path prefix, AND
  • the POST request method

i.e. must have neither or either of these to match:

  1. not {
  2. path /api/*
  3. method POST
  4. }

path

  1. path <paths...>

By request path, meaning the path component of the request’s URI. Path matches are exact, but wildcards * may be used:

  • At the end, for a prefix match (/prefix/*)
  • At the beginning, for a suffix match (*.suffix)
  • On both sides, for a substring match (*/contains/*)
  • In the middle, for a globular match (/accounts/*/info)

path_regexp

  1. path_regexp [<name>] <regexp>

Like path, but supports regular expressions. Capture groups can be accessed via placeholder like {http.regexp.name.capture_group} where name is the name of the regular expression (optional, but recommended) and capture_group is either the name or number of the capture group in the expression. Capture group 0 is the full regexp match, 1 is the first capture group, 2 is the second capture group, and so on.

Example:

Match requests where the path ends a 6 character hex string followed by .css or .js as the file extension, with capture groups that can be accessed with {http.regexp.static.1} and {http.regexp.static.2} for each part enclosed in ( ), respectively.

  1. path_regexp static \.([a-f0-9]{6})\.(css|js)$

protocol

  1. protocol http|https|grpc

By request protocol.


query

  1. query <key>=<val>...

By query string parameters. Should be a sequence of key=value pairs. Keys are matched exactly, case-sensitively. Values are matched exactly, but also support * to match any value.

Example:

Match requests with a sort query parameter with the value asc

  1. query sort=asc

remote_ip

  1. remote_ip <ranges...>

By remote (client) IP address. Accepts exact IPs or CIDR ranges.

Example:

Match requests from private IPv4 addresses.

  1. remote_ip 192.168.0.0/16 172.16.0.0/12 10.0.0.0/8