WAN Federation Between VMs and Kubernetes Through Mesh Gateways

1.8.0+: This feature is available in Consul versions 1.8.0 and higher

This topic requires familiarity with Mesh Gateways and WAN Federation Via Mesh Gateways.

Consul datacenters running on non-kubernetes platforms like VMs or bare metal can be federated with Kubernetes datacenters. Just like with Kubernetes, one datacenter must be the primary.

Kubernetes as the Primary

If your primary datacenter is running on Kubernetes, use the Helm config from the Primary Datacenter section to install Consul.

Once installed on Kubernetes, and with the ProxyDefaults resource created, you’ll need to export the following information from the primary Kubernetes cluster:

  • Certificate authority cert and key (in order to create SSL certs for VMs)
  • External addresses of Kubernetes mesh gateways
  • Replication ACL token
  • Gossip encryption key

The following sections detail how to export this data.

Certificates

  1. Retrieve the certificate authority cert:

    1. kubectl get secrets/consul-ca-cert --namespace consul --template='{{index .data "tls.crt" | base64decode }}' > consul-agent-ca.pem
  2. And the certificate authority signing key:

    1. kubectl get secrets/consul-ca-key --namespace consul --template='{{index .data "tls.key" | base64decode }}' > consul-agent-ca-key.pem
  3. With the consul-agent-ca.pem and consul-agent-ca-key.pem files you can create certificates for your servers and clients running on VMs that share the same certificate authority as your Kubernetes servers.

    You can use the consul tls commands to generate those certificates:

    1. # NOTE: consul-agent-ca.pem and consul-agent-ca-key.pem must be in the current
    2. # directory.
    3. $ consul tls cert create -server -dc=vm-dc -node <node_name>
    4. ==> WARNING: Server Certificates grants authority to become a
    5. server and access all state in the cluster including root keys
    6. and all ACL tokens. Do not distribute them to production hosts
    7. that are not server nodes. Store them as securely as CA keys.
    8. ==> Using consul-agent-ca.pem and consul-agent-ca-key.pem
    9. ==> Saved vm-dc-server-consul-0.pem
    10. ==> Saved vm-dc-server-consul-0-key.pem

    Note the -node option in the above command. This should be same as the node name of the Consul Agent. This is a requirement for Consul Federation to work. Alternatively, if you plan to use the same certificate and key pair on all your Consul server nodes, or you don’t know the nodename in advance, use -node "*" instead. Not satisfying this requirement would result in the following error in the Consul Server logs: [ERROR] agent.server.rpc: TLS handshake failed: conn=from= error="remote error: tls: bad certificate"

    See the help for output of consul tls cert create -h to see more options for generating server certificates.

  4. These certificates can be used in your server config file:

    Federation Between VMs and Kubernetes - 图1

    server.hcl

    1. tls {
    2. defaults {
    3. cert_file = "vm-dc-server-consul-0.pem"
    4. key_file = "vm-dc-server-consul-0-key.pem"
    5. ca_file = "consul-agent-ca.pem"
    6. }
    7. }
  5. For clients, you can generate TLS certs with:

    1. $ consul tls cert create -client
    2. ==> Using consul-agent-ca.pem and consul-agent-ca-key.pem
    3. ==> Saved dc1-client-consul-0.pem
    4. ==> Saved dc1-client-consul-0-key.pem

    Or use the auto_encrypt feature.

Mesh Gateway Addresses

Retrieve the WAN addresses of the mesh gateways:

  1. $ kubectl exec statefulset/consul-server --namespace consul -- sh -c \
  2. 'curl --silent --insecure https://localhost:8501/v1/catalog/service/mesh-gateway | jq ".[].ServiceTaggedAddresses.wan"'
  3. {
  4. "Address": "1.2.3.4",
  5. "Port": 443
  6. }
  7. {
  8. "Address": "1.2.3.4",
  9. "Port": 443
  10. }

In this example, the addresses are the same because both mesh gateway pods are fronted by the same Kubernetes load balancer.

These addresses will be used in the server config for the primary_gateways setting:

  1. primary_gateways = ["1.2.3.4:443"]

Replication ACL Token

If ACLs are enabled, you’ll also need the replication ACL token:

  1. $ kubectl get secrets/consul-acl-replication-acl-token --namespace consul --template='{{.data.token | base64decode}}'
  2. e7924dd1-dc3f-f644-da54-81a73ba0a178

This token will be used in the server config for the replication token.

  1. acls {
  2. tokens {
  3. replication = "e7924dd1-dc3f-f644-da54-81a73ba0a178"
  4. }
  5. }

NOTE: You’ll also need to set up additional ACL tokens as needed by the ACL system. See tutorial Secure Consul with Access Control Lists (ACLs) for more information.

Gossip Encryption Key

If gossip encryption is enabled, you’ll need the key as well. The command to retrieve the key will depend on which Kubernetes secret you’ve stored it in.

This key will be used in server and client configs for the encrypt setting:

  1. encrypt = "uF+GsbI66cuWU21kiXLze5JLEX5j4iDFlDTb0ZWNpDI="

Final Configuration

A final example server config file might look like:

  1. # From above
  2. tls {
  3. defaults {
  4. cert_file = "vm-dc-server-consul-0.pem"
  5. key_file = "vm-dc-server-consul-0-key.pem"
  6. ca_file = "consul-agent-ca.pem"
  7. }
  8. internal_rpc {
  9. verify_incoming = true
  10. verify_outgoing = true
  11. verify_server_hostname = true
  12. }
  13. }
  14. primary_gateways = ["1.2.3.4:443"]
  15. acl {
  16. enabled = true
  17. default_policy = "deny"
  18. down_policy = "extend-cache"
  19. tokens {
  20. agent = "e7924dd1-dc3f-f644-da54-81a73ba0a178"
  21. replication = "e7924dd1-dc3f-f644-da54-81a73ba0a178"
  22. }
  23. }
  24. encrypt = "uF+GsbI66cuWU21kiXLze5JLEX5j4iDFlDTb0ZWNpDI="
  25. # Other server settings
  26. server = true
  27. datacenter = "vm-dc"
  28. data_dir = "/opt/consul"
  29. enable_central_service_config = true
  30. primary_datacenter = "dc1"
  31. connect {
  32. enabled = true
  33. enable_mesh_gateway_wan_federation = true
  34. }
  35. ports {
  36. https = 8501
  37. http = -1
  38. grpc = 8502
  39. }

Kubernetes as the Secondary

If you’re running your primary datacenter on VMs then you’ll need to manually construct the Federation Secret in order to federate Kubernetes clusters as secondaries.

Your VM cluster must be running mesh gateways, and have mesh gateway WAN federation enabled. See WAN Federation via Mesh Gateways.

You’ll need:

  1. The root certificate authority cert placed in consul-agent-ca.pem.

  2. The root certificate authority key placed in consul-agent-ca-key.pem.

  3. The IP addresses of the mesh gateways running in your VM datacenter. These must be routable from the Kubernetes cluster.

  4. If ACLs are enabled you must create an ACL replication token with the following rules:

    1. acl = "write"
    2. operator = "write"
    3. agent_prefix "" {
    4. policy = "read"
    5. }
    6. node_prefix "" {
    7. policy = "write"
    8. }
    9. service_prefix "" {
    10. policy = "read"
    11. intentions = "read"
    12. }

    This token is used for ACL replication and for automatic ACL management in Kubernetes.

    If you’re running Consul Enterprise you’ll need the rules:

    1. operator = "write"
    2. agent_prefix "" {
    3. policy = "read"
    4. }
    5. node_prefix "" {
    6. policy = "write"
    7. }
    8. namespace_prefix "" {
    9. acl = "write"
    10. service_prefix "" {
    11. policy = "read"
    12. intentions = "read"
    13. }
    14. }
  5. If ACLs are enabled you’ll also need to modify the anonymous token policy to have the following permissions:

    1. node_prefix "" {
    2. policy = "read"
    3. }
    4. service_prefix "" {
    5. policy = "read"
    6. }

    With Consul Enterprise, use:

    1. partition_prefix "" {
    2. namespace_prefix "" {
    3. node_prefix "" {
    4. policy = "read"
    5. }
    6. service_prefix "" {
    7. policy = "read"
    8. }
    9. }
    10. }

    These permissions are needed to allow cross-datacenter requests. To make a cross-dc request the sidecar proxy in the originating DC needs to know about the services running in the remote DC. To do so, it needs an ACL token that allows it to look up the services in the remote DC. The way tokens are created in Kubernetes, the sidecar proxies have local ACL tokens–i.e tokens that are only valid in the local DC. When a request goes from one DC to another, if the request has a local token, it is stripped from the request because the remote DC won’t be able to validate it. When the request lands in the other DC, it has no ACL token and so will be subject to the anonymous token policy. This is why the anonymous token policy must be configured to allow read access to all services. When the Kubernetes DC is the primary, this is handled automatically, but when the primary DC is on VMs, this must be configured manually.

    To configure the anonymous token policy, first create a policy with the above rules, then attach it to the anonymous token. For example using the CLI:

    1. echo 'node_prefix "" {
    2. policy = "read"
    3. }
    4. service_prefix "" {
    5. policy = "read"
    6. }' | consul acl policy create -name anonymous -rules -
    7. consul acl token update -id 00000000-0000-0000-0000-000000000002 -policy-name anonymous
  6. If gossip encryption is enabled, you’ll need the key.

With that data ready, you can create the Kubernetes federation secret:

  1. kubectl create secret generic consul-federation \
  2. --from-literal=caCert=$(cat consul-agent-ca.pem) \
  3. --from-literal=caKey=$(cat consul-agent-ca-key.pem)
  4. # If ACLs are enabled uncomment.
  5. # --from-literal=replicationToken="<your acl replication token>" \
  6. # If using gossip encryption uncomment.
  7. # --from-literal=gossipEncryptionKey="<your gossip encryption key>"

If ACLs are enabled, you must next determine the Kubernetes API URL for the secondary cluster. The API URL of the must be specified in the config files for all secondary clusters because secondary clusters need to create global Consul ACL tokens (tokens that are valid in all datacenters) and these tokens can only be created by the primary datacenter. By setting the API URL, the secondary cluster will configure a Consul auth method in the primary cluster so that components in the secondary cluster can use their Kubernetes ServiceAccount tokens to retrieve global Consul ACL tokens from the primary.

To determine the Kubernetes API URL, first get the cluster name in your kubeconfig:

  1. $ export CLUSTER=$(kubectl config view -o jsonpath="{.contexts[?(@.name == \"$(kubectl config current-context)\")].context.cluster}")

Then get the API URL:

  1. $ kubectl config view -o jsonpath="{.clusters[?(@.name == \"$CLUSTER\")].cluster.server}"
  2. https://<some-url>

You’ll use this URL when setting global.federation.k8sAuthMethodHost.

Then use the following Helm config file:

  1. global:
  2. name: consul
  3. datacenter: dc2
  4. tls:
  5. enabled: true
  6. caCert:
  7. secretName: consul-federation
  8. secretKey: caCert
  9. caKey:
  10. secretName: consul-federation
  11. secretKey: caKey
  12. # Delete this acls section if ACLs are disabled.
  13. acls:
  14. manageSystemACLs: true
  15. replicationToken:
  16. secretName: consul-federation
  17. secretKey: replicationToken
  18. federation:
  19. enabled: true
  20. k8sAuthMethodHost: <kubernetes-api-url>
  21. primaryDatacenter: dc1
  22. # Delete this gossipEncryption section if gossip encryption is disabled.
  23. gossipEncryption:
  24. secretName: consul-federation
  25. secretKey: gossipEncryptionKey
  26. connectInject:
  27. enabled: true
  28. meshGateway:
  29. enabled: true
  30. server:
  31. extraConfig: |
  32. {
  33. "primary_gateways": ["<ip of your VM mesh gateway>", "<other ip>", ...]
  34. }

Notes:

  1. You must fill out the server.extraConfig section with the IPs of your mesh gateways running on VMs.
  2. Set global.federation.k8sAuthMethodHost to the Kubernetes API URL of this cluster (including https://).
  3. global.federation.primaryDatacenter should be set to the name of your primary datacenter.

With your config file ready to go, follow our Installation Guide to install Consul on your secondary cluster(s).

After installation, if you’re using consul-helm 0.30.0+, create the ProxyDefaults resource to allow traffic between datacenters.

Next Steps

In both cases (Kubernetes as primary or secondary), after installation, follow the Verifying Federation section to verify that federation is working as expected.