Ceph NFS Gateway CRD

Overview

Rook allows exporting NFS shares of the filesystem or object store through the CephNFS custom resource definition. This will spin up a cluster of NFS Ganesha servers that coordinate with one another via shared RADOS objects. The servers will be configured for NFSv4.1+ access, as serving earlier protocols can inhibit responsiveness after a server restart.

Samples

The following sample will create a two-node active-active cluster of NFS Ganesha gateways. A CephFS named myfs is used, and the recovery objects are stored in a RADOS pool named myfs-data0 with a RADOS namespace of nfs-ns.

This example requires the filesystem to first be configured by the Filesystem.

NOTE: For an RGW object store, a data pool of my-store.rgw.buckets.data can be used after configuring the Object Store.

  1. apiVersion: ceph.rook.io/v1
  2. kind: CephNFS
  3. metadata:
  4. name: my-nfs
  5. namespace: rook-ceph
  6. spec:
  7. rados:
  8. # RADOS pool where NFS client recovery data is stored.
  9. pool: myfs-data0
  10. # RADOS namespace where NFS client recovery data is stored in the pool.
  11. namespace: nfs-ns
  12. # Settings for the NFS server
  13. server:
  14. # the number of active NFS servers
  15. active: 2
  16. # A key/value list of annotations
  17. annotations:
  18. # key: value
  19. # where to run the NFS server
  20. placement:
  21. # nodeAffinity:
  22. # requiredDuringSchedulingIgnoredDuringExecution:
  23. # nodeSelectorTerms:
  24. # - matchExpressions:
  25. # - key: role
  26. # operator: In
  27. # values:
  28. # - mds-node
  29. # tolerations:
  30. # - key: mds-node
  31. # operator: Exists
  32. # podAffinity:
  33. # podAntiAffinity:
  34. # topologySpreadConstraints:
  35. # The requests and limits set here allow the ganesha pod(s) to use half of one CPU core and 1 gigabyte of memory
  36. resources:
  37. # limits:
  38. # cpu: "500m"
  39. # memory: "1024Mi"
  40. # requests:
  41. # cpu: "500m"
  42. # memory: "1024Mi"
  43. # the priority class to set to influence the scheduler's pod preemption
  44. priorityClassName:

NFS Settings

RADOS Settings

  • pool: The pool where ganesha recovery backend and supplemental configuration objects will be stored
  • namespace: The namespace in pool where ganesha recovery backend and supplemental configuration objects will be stored

EXPORT Block Configuration

Each daemon will have a stock configuration with no exports defined, and that includes a RADOS object via:

  1. %url rados://<pool>/<namespace>/conf-<nodeid>

The pool and namespace are configured via the spec’s RADOS block. The nodeid is a value automatically assigned internally by rook. Nodeids start with “a” and go through “z”, at which point they become two letters (“aa” to “az”).

When a server is started, it will create the included object if it does not already exist. It is possible to prepopulate the included objects prior to starting the server. The format for these objects is documented in the NFS Ganesha project.

Scaling the active server count

It is possible to scale the size of the cluster up or down by modifying the spec.server.active field. Scaling the cluster size up can be done at will. Once the new server comes up, clients can be assigned to it immediately.

The CRD always eliminates the highest index servers first, in reverse order from how they were started. Scaling down the cluster requires that clients be migrated from servers that will be eliminated to others. That process is currently a manual one and should be performed before reducing the size of the cluster.