Kubernetes Maintenance Support

Your EMS server-group is configured as a quorum-based StatefulSet.
This has some implications for performing Kubernetes maintenance activities, such as workload rebalancing, node-image upgrades, or security patching.

The basic strategy for minimizing downtime when relocating quorum-based StatefulSets is:

  • Add new replacement nodes.
  • Cordon current nodes - to avoid node-stickiness issues.
  • Delete each follower pod - wait for it to restart and rejoin the quorum.
  • Delete the leader pod last - wait for a new leader to be elected, and the restarted pod to rejoin.

The EMS Details page has a new Redeploy button. This can be used to safely restart all the EMS pods in the correct order to avoid downtime and minimize client reconnects.