Capacity Planning

  • How many nodes?

  • The basic starting point is two nodes with 2 cores and 4GB of memory on each node

  • For fault tolerance perspective three nodes is more appropriate for any cluster

  • What's better more nodes or bigger nodes?

  • More nodes equals IO, Memory, and GC (garbage collector) distributed processing

  • Common pitfall with distributed databases - stressing common storage e.g. SAN (system attached storage)

  • Bigger nodes means more processing can be performed on a node with fast access to in-memory data and faster local IO

  • Resizing node in production is likely more challenging than adding a new node to the cluster

  • How high can it go? Pretty high

  • So which one it is going to be: more smaller nodes or less larger nodes?

Last updated

Was this helpful?