Cluster Troubleshooting

  • Cluster frontend is unresponsive
    • Check if the Coordinator/s in question are started up.
    • Check if the Agency is up and a leader has been elected. If notensure that all or a majority of Agents are up and running.
    • Check if all processes have been started up with the sameJWT_SECRET. If not ensure that the JWT_SECRET used acrossthe cluster nodes is identical for every process.
    • Check if all cluster nodes have been started with SSL eitherdis- or enabled. If not decide what mode of operation you wouldlike to run your cluster in, and consistently stick with for allAgents, Coordinators and DBServers.
    • Check if network communication between the cluster nodes is suchthat all processes can directly access their peers. Do notoperate proxies between the cluster nodes.
  • Cluster front end announces errors on any number of nodes
    • This is an indication that the Agency is running but eitherCoordinators or DBServers are disconnected or shutdown. Establish network connection to or start the accordingnodes.
    • Make sure that the nodes in question share the same JWT_SECRETand SSL operation mode with the functioning nodes.Dig deeper into cluster troubleshooting by going through theArangoDB Cluster Administration Course.