Envoy Integration

Consul Connect has first class support for using Envoy as a proxy. Consul configures Envoy by optionally exposing a gRPC service on the local agent that serves Envoy’s xDS configuration API.

Consul can configure Envoy sidecars to proxy http/1.1, http2 or gRPC traffic at L7 or any other tcp-based protocol at L4. Prior to Consul 1.5.0 Envoy proxies could only proxy tcp at L4.

Configuration of some L7 features is possible via configuration entries. If you wish to use an Envoy feature not currently exposed through these config entries as an interim solution, you can add custom Envoy configuration in the proxy service definition allowing you to use the more powerful features of Envoy.

Note: When using Envoy with Consul and not using the consul connect envoy command Envoy must be run with the --max-obj-name-len option set to 256 or greater.

Supported Versions

Consul’s Envoy support was added in version 1.3.0. The following table shows compatible Envoy versions.

Consul VersionCompatible Envoy Versions
1.8.x1.14.5, 1.13.6, 1.12.7, 1.11.2
1.7.x1.13.1, 1.12.3, 1.11.2, 1.10.0*
1.5.2, 1.5.3, 1.6.x1.11.1, 1.10.0, 1.9.1, 1.8.0†
1.5.0, 1.5.11.9.1, 1.8.0†
1.3.x, 1.4.x1.9.1, 1.8.0†, 1.7.0†

† Envoy versions lower than 1.9.1 are vulnerable to CVE-2019-9900 and CVE-2019-9901. Both are related to HTTP request parsing and so only affect Consul Connect users if they have configured HTTP routing rules via the “escape hatch”. Still, we recommend that you use the most recent supported Envoy for your Consul version where possible.

* Envoy 1.10.0 requires setting -envoy-version in the consul connect envoy command. This was introduced in Consul 1.7.0.

Getting Started

To get started with Envoy and see a working example you can follow the Using Envoy with Connect tutorial.

Configuration

Envoy proxies require two types of configuration: an initial bootstrap configuration and dynamic configuration that is discovered from a “management server”, in this case Consul.

The bootstrap configuration at a minimum needs to configure the proxy with an identity (node id) and the location of it’s local Consul agent from which it discovers all of it’s dynamic configuration. See Bootstrap Configuration for more details.

The dynamic configuration Consul Connect provides to each Envoy instance includes:

  • TLS certificates and keys to enable mutual authentication and keep certificates rotating.
  • Service-discovery results for upstreams to enable each sidecar proxy to load-balance outgoing connections.
  • L7 configuration including timeouts and protocol-specific options.
  • Configuration to expose specific HTTP paths.

For more information on the parts of the Envoy proxy runtime configuration that are currently controllable via Consul Connect see Dynamic Configuration.

We plan to enable more and more of Envoy’s features through Connect’s first-class configuration over time, however some advanced users will need additional control to configure Envoy in specific ways. To enable this, we provide several “escape hatch” options that allow users to provide low-level raw Envoy config syntax for some sub-components in each Envoy instance. This allows operators to have full control over and responsibility for correctly configuring Envoy and ensuring version support etc.

Bootstrap Configuration

Envoy requires an initial bootstrap configuration file. The easiest way to create this is using the consul connect envoy command. The command can either output the bootstrap configuration directly to stdout or can generate it and then exec the Envoy binary as a convenience wrapper.

Because some Envoy configuration options like metrics and tracing sinks can only be specified via the bootstrap configuration, Connect as of Consul 1.5.0 adds the ability to control some parts of the bootstrap config via proxy configuration options.

Users can add the following configuration items to the global proxy-defaults configuration entry or override them directly in the proxy.config field of a proxy service definition or sidecar_service block.

  • envoy_statsd_url - A URL in the form udp://ip:port identifying a UDP StatsD listener that Envoy should deliver metrics to. For example, this may be udp://127.0.0.1:8125 if every host has a local StatsD listener. In this case users can configure this property once in the global proxy-defaults configuration entry for convenience. Currently, TCP is not supported.

    Note: currently the url must use an ip address not a dns name due to the way Envoy is setup for StatsD.

    Users can also specify the whole parameter in the form $ENV_VAR_NAME, which will cause the consul connect envoy command to resolve the actual URL from the named environment variable when it runs. This, for example, allows each pod in a Kubernetes cluster to learn of a pod-specific IP address for StatsD when the Envoy instance is bootstrapped while still allowing global configuration of all proxies to use StatsD in the global proxy-defaults configuration entry. The env variable must contain a full valid URL value as specified above and nothing else. It is not currently possible to use environment variables as only part of the URL.

  • envoy_dogstatsd_url - The same as envoy_statsd_url with the following differences in behavior:

    • Envoy will use dogstatsd tags instead of statsd dot-separated metric names.
    • As well as udp://, a unix:// URL may be specified if your agent can listen on a unix socket (e.g. the dogstatsd agent).
  • envoy_prometheus_bind_addr - Specifies that the proxy should expose a Prometheus metrics endpoint to the public network. It must be supplied in the form ip:port and port and the ip/port combination must be free within the network namespace the proxy runs. Typically the IP would be 0.0.0.0 to bind to all available interfaces or a pod IP address.

    Note: Envoy versions prior to 1.10 do not export timing histograms using the internal Prometheus endpoint.

  • envoy_stats_bind_addr - Specifies that the proxy should expose the /stats prefix to the public network. It must be supplied in the form ip:port and the ip/port combination must be free within the network namespace the proxy runs. Typically the IP would be 0.0.0.0 to bind to all available interfaces or a pod IP address.

  • envoy_stats_tags - Specifies one or more static tags that will be added to all metrics produced by the proxy.

  • envoy_stats_flush_interval - Configures Envoy’s stats_flush_interval.

There are more possibilities available in the Advanced Configuration section that allow incremental or complete control over the bootstrap configuration generated.

Dynamic Configuration

Consul automatically generates Envoy’s dynamic configuration based on its knowledge of the cluster. Users may specify default configuration options for each service such as which protocol they speak. Consul will use this information to configure appropriate proxy settings for that service’s proxies and also for the upstream listeners of any downstream service.

One example is how users can define a service’s protocol in a service-defaults configuration entry. Agents with enable_central_service_config set to true will automatically discover the protocol when configuring a proxy for a service. The proxy will discover the main protocol of the service it represents and use this to configure its main public listener. It will also discover the protocols defined for any of its upstream services and automatically configure its upstream listeners appropriately too as below.

This automated discovery results in Consul auto-populating the proxy.config and proxy.upstreams[*].config fields of the proxy service definition that is actually registered.

To learn about other options that can be configured centrally see the Configuration Entries docs.

Proxy Config Options

These fields may also be overridden explicitly in the proxy service definition, or defined in the global proxy-defaults configuration entry to act as defaults that are inherited by all services.

  • protocol - The protocol the service speaks. Connect’s Envoy integration currently supports the following protocol values:

    • tcp - Unless otherwise specified this is the default, which causes Envoy to proxy at L4. This provides all the security benefits of Connect’s mTLS and works for any TCP-based protocol. Load-balancing and metrics are available at the connection level.

    • http - This specifies that the service speaks HTTP/1.x. Envoy will setup an http_connection_manager and will be able to load-balance requests individually to available upstream services. Envoy will also emit L7 metrics such as request rates broken down by HTTP response code family (2xx, 4xx, 5xx, etc).

    • http2 - This specifies that the service speaks http2 (specifically h2c since Envoy will still only connect to the local service instance via plain TCP not TLS). This behaves much like http with L7 load-balancing and metrics but has additional settings that correctly enable end-to-end http2.

    • grpc - gRPC is a common RPC protocol based on http2. In addition to the http2 support above, Envoy listeners will be configured with a gRPC bridge filter that translates HTTP/1.1 calls into gRPC, and instruments metrics with gRPC-status trailer codes.

      Note: The protocol of a service should ideally be configured via the protocol field of a service-defaults config entry for the service. Configuring it in a proxy config will not fully enable some L7 features. It is supported here for backwards compatibility with Consul versions prior to 1.6.0.

  • bind_address - Override the address Envoy’s public listener binds to. By default Envoy will bind to the service address or 0.0.0.0 if there is not explicit address on the service registration.

  • bind_port - Override the port Envoy’s public listener binds to. By default Envoy will bind to the service port.

  • local_connect_timeout_ms - The number of milliseconds allowed to make connections to the local application instance before timing out. Defaults to 5000 (5 seconds).

Proxy Upstream Config Options

The following configuration items may be overridden directly in the proxy.upstreams[].config field of a proxy service definition or sidecar_service block.

  • protocol - Same as above in main config but affects the listener setup for the upstream.

    Note: The protocol of a service should ideally be configured via the protocol field of a service-defaults config entry for the upstream destination service. Configuring it in a proxy upstream config will not fully enable some L7 features. It is supported here for backwards compatibility with Consul versions prior to 1.6.0.

  • connect_timeout_ms - The number of milliseconds to allow when making upstream connections before timing out. Defaults to 5000 (5 seconds).

    Note: The connection timeout for a service should ideally be configured via the connect_timeout field of a service-resolver config entry for the upstream destination service. Configuring it in a proxy upstream config will override any values defined in config entries. It is supported here for backwards compatibility with Consul versions prior to 1.6.0.

  • limits - A set of limits to apply when connecting to the upstream service. These limits are applied on a per-service-instance basis. The following limits are respected:

    • max_connections - The maximum number of connections a service instance will be allowed to establish against the given upstream. Use this to limit HTTP/1.1 traffic, since HTTP/1.1 has a request per connection.
    • max_pending_requests - The maximum number of requests that will be queued while waiting for a connection to be established. For this configuration to be respected, a L7 protocol must be defined in the protocol field.
    • max_concurrent_requests - The maximum number of concurrent requests that will be allowed at a single point in time. Use this to limit HTTP/2 traffic, since HTTP/2 has many requests per connection. For this configuration to be respected, a L7 protocol must be defined in the protocol field.
  • passive_health_check - Passive health checks are used to remove hosts from the upstream cluster which are unreachable or are returning errors.

    • interval - The time between checks. Each check will cause hosts which have exceeded max_failures to be removed from the load balancer, and any hosts which have passed their ejection time to be returned to the load balancer.
    • max_failures - The number of consecutive failures which cause a host to be removed from the load balancer.

Gateway Options

These fields may also be overridden explicitly in the proxy service definition, or defined in the global proxy-defaults configuration entry to act as defaults that are inherited by all services.

Prior to 1.8.0 these settings were specific to Mesh Gateways. The deprecated names such as envoy_mesh_gateway_bind_addresses and envoy_mesh_gateway_no_default_bind will continue to be supported.

  • connect_timeout_ms - The number of milliseconds to allow when making upstream connections before timing out. Defaults to 5000 (5 seconds). If the upstream service has the configuration option connect_timeout_ms set for the service-resolver, that timeout value will take precedence over this gateway option.

  • envoy_gateway_bind_tagged_addresses - Indicates that the gateway services tagged addresses should be bound to listeners in addition to the default listener address.

  • envoy_gateway_bind_addresses - A map of additional addresses to be bound. This map’s keys are the name of the listeners to be created and the values are a map with two keys, address and port, that combined make the address to bind the listener to. These are bound in addition to the default address.

  • envoy_gateway_no_default_bind - Prevents binding to the default address of the gateway service. This should be used with one of the other options to configure the gateway’s bind addresses.

  • envoy_dns_discovery_type - Determines how Envoy will resolve hostnames. Defaults to LOGICAL_DNS. Must be one of STRICT_DNS or LOGICAL_DNS. Details for each type are available in the Envoy documentation. This option applies to terminating gateways that route to services addressed by a hostname, such as a managed databased. It also applies to mesh gateways, such as when gateways in other Consul datacenters are behind a load balancer that is addressed by a hostname.

Advanced Configuration

To support more flexibility when configuring Envoy, several “lower-level” options exist that require knowledge of Envoy’s configuration format. Many options allow configuring a subsection of either the bootstrap or dynamic configuration using your own custom protobuf config.

We separate these into two sets, Advanced Bootstrap Options and Escape Hatch Overrides. Both require writing Envoy config in the protobuf JSON encoding. Advanced options cover smaller chunks that might commonly need to be set for tasks like configuring tracing. In contrast, escape hatches give almost complete control over the proxy setup, but require operators to manually code the entire configuration in protobuf JSON.

Advanced Topic! This section covers options that allow users to take almost complete control of Envoy’s configuration. We provide these options so users can experiment or take advantage of features not yet fully supported in Consul Connect. We plan to retain this ability in the future, but it should still be considered experimental because it requires in-depth knowledge of Envoy’s configuration format. Users should consider Envoy version compatibility when using these features because they can configure Envoy in ways that are outside of Consul’s control. Incorrect configuration could prevent all proxies in your mesh from functioning correctly, or bypass the security guarantees Connect is designed to enforce.

Configuration Formatting

All configurations are specified as strings containing the serialized proto3 JSON encoding of the specified Envoy configuration type. They are full JSON types except where noted.

The JSON supplied may describe a protobuf types.Any message with an @type field set to the appropriate type (for example type.googleapis.com/envoy.api.v2.Listener), or it may be the direct encoding with no @type field.

For example, given a tracing config:

  1. "tracing": {
  2. "http": {
  3. "name": "envoy.zipkin",
  4. "config": {
  5. "collector_cluster": "zipkin",
  6. "collector_endpoint": "/api/v1/spans",
  7. "shared_span_context": false
  8. }
  9. }
  10. }

JSON escape the value of tracing into a string, for example using https://codebeautify.org/json-escape-unescape, and then use that as the value for envoy_tracing_json:

  1. {
  2. "kind": "proxy-defaults",
  3. "name": "global",
  4. "config": {
  5. "envoy_tracing_json": "{\"http\":{\"name\":\"envoy.zipkin\",\"config\":{\"collector_cluster\":\"zipkin\",\"collector_endpoint\":\"/api/v1/spans\",\"shared_span_context\":false}}}"
  6. }
  7. }

If using HCL, this escaping is done automatically:

  1. Kind = "proxy-defaults"
  2. Name = "global"
  3. Config {
  4. envoy_tracing_json = <<EOF
  5. {
  6. "http": {
  7. "name": "envoy.zipkin",
  8. "config": {
  9. "collector_cluster": "zipkin",
  10. "collector_endpoint": "/api/v1/spans",
  11. "shared_span_context": false
  12. }
  13. }
  14. }
  15. EOF
  16. }

Advanced Bootstrap Options

Users may add the following configuration items to the global proxy-defaults configuration entry or override them directly in the proxy.config field of a proxy service definition or sidecar_service block.

Escape-Hatch Overrides

Users may add the following configuration items to the global proxy-defaults configuration entry or override them directly in the proxy.config field of a proxy service definition or sidecar_service block.

  • envoy_bootstrap_json_tpl - Specifies a template in Go template syntax that is used in place of the default template when generating bootstrap via consul connect envoy command. The variables that are available to be interpolated are documented here. This offers complete control of the proxy’s bootstrap although major deviations from the default template may break Consul’s ability to correctly manage the proxy or enforce it’s security model.
  • envoy_public_listener_json - Specifies a complete Listener to be delivered in place of the main public listener that the proxy used to accept inbound connections. This will be used verbatim with the following exceptions:
    • Every FilterChain added to the listener will have its TlsContext overridden by the Connect TLS certificates and validation context. This means there is no way to override Connect’s mutual TLS for the public listener.
    • Every FilterChain will have the envoy.ext_authz filter prepended to the filters array to ensure that all inbound connections are authorized by Connect.
  • envoy_local_cluster_json - Specifies a complete Envoy cluster to be delivered in place of the local application cluster. This allows customization of timeouts, rate limits, load balancing strategy etc.

The following configuration items may be overridden directly in the proxy.upstreams[].config field of a proxy service definition or sidecar_service block.

Note: - When a service-router, service-splitter, or service-resolver config entry exists for a service the below escape hatches are ignored and will log a warning.

  • envoy_listener_json - Specifies a complete Listener to be delivered in place of the upstream listener that the proxy exposes to the application for outbound connections. This will be used verbatim with the following exceptions:
    • Every FilterChain added to the listener will have its TlsContext overridden by the Connect TLS certificates and validation context. This means there is no way to override Connect’s mutual TLS for the public listener.
  • envoy_cluster_json - Specifies a complete Envoy cluster to be delivered in place of the discovered upstream cluster. This allows customization of timeouts, circuit breaking, rate limits, load balancing strategy etc.