Linear Scalability

AttentionThis page documents an earlier version. Go to the latest (v2.1)version.

With YugabyteDB, you can add nodes to scale your cluster up very efficiently and reliably in order to achieve more read and write IOPS. In this tutorial, we will look at how YugabyteDB can scale while a workload is running. We will run a read-write workload using a pre-packaged sample application against a 3-node local cluster with a replication factor of 3, and add nodes to it while the workload is running. We will then observe how the cluster scales out, by verifying that the number of read/write IOPS are evenly distributed across all the nodes at all times.

If you haven’t installed YugabyteDB yet, do so first by following the Quick Start guide.

1. Setup - create universe

If you have a previously running local universe, destroy it using the following.

  1. $ ./bin/yb-ctl destroy

Start a new local cluster - by default, this will create a 3-node universe with a replication factor of 3. We configure the number of shards(aka tablets) per table per tserver to 4 so that we can better observe the load balancing during scale-up and scale-down. Each table will now have 4 tablet-leaders in each tserver and with replication factor 3, there will be 2 tablet-followers for each tablet-leader distributed in the 2 other tservers. So each tserver will have 12 tablets (i.e. sum of 4 tablet-leaders and 8 tablet-followers) per table.

  1. $ ./bin/yb-ctl --num_shards_per_tserver 4 create

2. Run sample key-value app

Run the Cassandra sample key-value app against the local universe by typing the following command.

  1. $ java -jar ./java/yb-sample-apps.jar --workload CassandraKeyValue \
  2. --nodes 127.0.0.1:9042 \
  3. --num_threads_write 1 \
  4. --num_threads_read 4 \
  5. --value_size 4096

The sample application prints some stats while running, which is also shown below. You can read more details about the output of the sample applications here.

  1. 2018-05-10 09:10:19,538 [INFO|...] Read: 8988.22 ops/sec (0.44 ms/op), 818159 total ops | Write: 1095.77 ops/sec (0.91 ms/op), 97120 total ops | ...
  2. 2018-05-10 09:10:24,539 [INFO|...] Read: 9110.92 ops/sec (0.44 ms/op), 863720 total ops | Write: 1034.06 ops/sec (0.97 ms/op), 102291 total ops | ...

3. Observe IOPS per node

You can check a lot of the per-node stats by browsing to the tablet-servers page. It should look like this. The total read and write IOPS per node are highlighted in the screenshot below. Note that both the reads and the writes are roughly the same across all the nodes indicating uniform usage across the nodes.

Read and write IOPS with 3 nodes

4. Add node and observe linear scale out

Add a node to the universe.

  1. $ ./bin/yb-ctl --num_shards_per_tserver 4 add_node

Now we should have 4 nodes. Refresh the tablet-servers page to see the stats update. In a short time, you should see the new node performing a comparable number of reads and writes as the other nodes. The 36 tablets will now get distributed evenly across all the 4 nodes, leading to each node having 9 tablets.

The YugabyteDB universe automatically let the client know to use the newly added node for serving queries. This scaling out of client queries is completely transparent to the application logic, allowing the application to scale linearly for both reads and writes.

Read and write IOPS with 4 nodes - Rebalancing in progress

Read and write IOPS with 4 nodes

5. Remove node and observe linear scale in

Remove the recently added node from the universe.

  1. $ ./bin/yb-ctl remove_node 4
  • Refresh the tablet-servers page to see the stats update. The Time since heartbeat value for that node will keep increasing. Once that number reaches 60s (i.e. 1 minute), YugabyteDB will change the status of that node from ALIVE to DEAD. Note that at this time the universe is running in an under-replicated state for some subset of tablets.

Read and write IOPS with 4th node dead

  • After 300s (i.e. 5 minutes), YugabyteDB’s remaining nodes will re-spawn new tablets that were lost with the loss of node 4. Each remaining node’s tablet count will increase from 9 to 12, thus getting back to the original state of 36 total tablets.

Read and write IOPS with 4th node removed

6. Clean up (optional)

Optionally, you can shutdown the local cluster created in Step 1.

  1. $ ./bin/yb-ctl destroy

1. Setup - create universe

If you have a previously running local universe, destroy it using the following.

  1. $ ./bin/yb-ctl destroy

Start a new local cluster - by default, this will create a 3-node universe with a replication factor of 3. We configure the number of shards(aka tablets) per table per tserver to 4 so that we can better observe the load balancing during scale-up and scale-down. Each table will now have 4 tablet-leaders in each tserver and with replication factor 3, there will be 2 tablet-followers for each tablet-leader distributed in the 2 other tservers. So each tserver will have 12 tablets (i.e. sum of 4 tablet-leaders and 8 tablet-followers) per table.

  1. $ ./bin/yb-ctl --num_shards_per_tserver 4 create

2. Run sample key-value app

Run the Cassandra sample key-value app against the local universe by typing the following command.

  1. $ java -jar ./java/yb-sample-apps.jar --workload CassandraKeyValue \
  2. --nodes 127.0.0.1:9042 \
  3. --num_threads_write 1 \
  4. --num_threads_read 4 \
  5. --value_size 4096

The sample application prints some stats while running, which is also shown below. You can read more details about the output of the sample applications here.

  1. 2018-05-10 09:10:19,538 [INFO|...] Read: 8988.22 ops/sec (0.44 ms/op), 818159 total ops | Write: 1095.77 ops/sec (0.91 ms/op), 97120 total ops | ...
  2. 2018-05-10 09:10:24,539 [INFO|...] Read: 9110.92 ops/sec (0.44 ms/op), 863720 total ops | Write: 1034.06 ops/sec (0.97 ms/op), 102291 total ops | ...

3. Observe IOPS per node

You can check a lot of the per-node stats by browsing to the tablet-servers page. It should look like this. The total read and write IOPS per node are highlighted in the screenshot below. Note that both the reads and the writes are roughly the same across all the nodes indicating uniform usage across the nodes.

Read and write IOPS with 3 nodes

4. Add node and observe linear scale out

Add a node to the universe.

  1. $ ./bin/yb-ctl --num_shards_per_tserver 4 add_node

Now we should have 4 nodes. Refresh the tablet-servers page to see the stats update. In a short time, you should see the new node performing a comparable number of reads and writes as the other nodes. The 36 tablets will now get distributed evenly across all the 4 nodes, leading to each node having 9 tablets.

The YugabyteDB universe automatically let the client know to use the newly added node for serving queries. This scaling out of client queries is completely transparent to the application logic, allowing the application to scale linearly for both reads and writes.

Read and write IOPS with 4 nodes - Rebalancing in progress

Read and write IOPS with 4 nodes

5. Remove node and observe linear scale in

Remove the recently added node from the universe.

  1. $ ./bin/yb-ctl remove_node 4
  • Refresh the tablet-servers page to see the stats update. The Time since heartbeat value for that node will keep increasing. Once that number reaches 60s (i.e. 1 minute), YugabyteDB will change the status of that node from ALIVE to DEAD. Note that at this time the universe is running in an under-replicated state for some subset of tablets.

Read and write IOPS with 4th node dead

  • After 300s (i.e. 5 minutes), YugabyteDB’s remaining nodes will re-spawn new tablets that were lost with the loss of node 4. Each remaining node’s tablet count will increase from 9 to 12, thus getting back to the original state of 36 total tablets.

Read and write IOPS with 4th node removed

6. Clean up (optional)

Optionally, you can shutdown the local cluster created in Step 1.

  1. $ ./bin/yb-ctl destroy

1. Setup - create universe

If you have a previously running local universe, destroy it using the following.

  1. $ ./yb-docker-ctl destroy

Start a new local cluster. By default, this will create a 3-node universe with a replication factor of 3. We configure the number of shards (aka tablets) per table per tserver to 4 so that we can better observe the load balancing during scale-up and scale-down. Each table will now have 4 tablet-leaders in each tserver and with replication factor 3, there will be 2 tablet-followers for each tablet-leader distributed in the 2 other tservers. So each tserver will have 12 tablets (i.e. sum of 4 tablet-leaders and 8 tablet-followers) per table.

  1. $ ./yb-docker-ctl create --num_shards_per_tserver 4

2. Run sample key-value app

Run the Cassandra sample key-value app against the local universe by typing the following command.

  1. $ docker cp yb-master-n1:/home/yugabyte/java/yb-sample-apps.jar .
  1. $ java -jar ./yb-sample-apps.jar --workload CassandraKeyValue \
  2. --nodes localhost:9042 \
  3. --num_threads_write 1 \
  4. --num_threads_read 4 \
  5. --value_size 4096

The sample application prints some stats while running, which is also shown below. You can read more details about the output of the sample applications here.

  1. 2017-11-20 14:02:48,114 [INFO|...] Read: 9893.73 ops/sec (0.40 ms/op), 233458 total ops |
  2. Write: 1155.83 ops/sec (0.86 ms/op), 28072 total ops | ...
  3. 2017-11-20 14:02:53,118 [INFO|...] Read: 9639.85 ops/sec (0.41 ms/op), 281696 total ops |
  4. Write: 1078.74 ops/sec (0.93 ms/op), 33470 total ops | ...

3. Observe IOPS per node

You can check a lot of the per-node stats by browsing to the tablet-servers page. It should look like this. The total read and write IOPS per node are highlighted in the screenshot below. Note that both the reads and the writes are roughly the same across all the nodes indicating uniform usage across the nodes.

Read and write IOPS with 3 nodes

4. Add node and observe linear scale out

Add a node to the universe.

  1. $ ./yb-docker-ctl add_node --num_shards_per_tserver 4

Now we should have 4 nodes. Refresh the tablet-servers page to see the stats update. In a short time, you should see the new node performing a comparable number of reads and writes as the other nodes.

Read and write IOPS with 4 nodes - Rebalancing in progress

Read and write IOPS with 4 nodes - Balanced

5. Remove node and observe linear scale in

Remove the recently added node from the universe.

  1. $ ./yb-docker-ctl remove_node 4
  • Refresh the tablet-servers page to see the stats update. The Time since heartbeat value for that node will keep increasing. Once that number reaches 60s (i.e. 1 minute), YugabyteDB will change the status of that node from ALIVE to DEAD. Note that at this time the universe is running in an under-replicated state for some subset of tablets.

Read and write IOPS with 4th node dead

  • After 300s (i.e. 5 minutes), YugabyteDB’s remaining nodes will re-spawn new tablets that were lost with the loss of node 4. Each remaining node’s tablet count will increase from 18 to 24.

6. Clean up (optional)

Optionally, you can shutdown the local cluster created in Step 1.

  1. $ ./yb-docker-ctl destroy

1. Setup - create universe

If you have a previously running local universe, destroy it using the following.

  1. $ kubectl delete -f yugabyte-statefulset.yaml

Start a new local cluster - by default, this will create a 3 node universe with a replication factor of 3.

  1. $ kubectl apply -f yugabyte-statefulset.yaml

Check the Kubernetes dashboard to see the 3 yb-tserver and 3 yb-master pods representing the 3 nodes of the cluster.

  1. $ minikube dashboard

Kubernetes Dashboard

2. Check cluster status with Admin UI

In order to do this, we would need to access the UI on port 7000 exposed by any of the pods in the yb-master service (one of yb-master-0, yb-master-1 or yb-master-2). In order to do so, we find the URL for the yb-master-ui LoadBalancer service.

  1. $ minikube service yb-master-ui --url
  1. http://192.168.99.v1.0:31283

Now, you can view the yb-master-0 Admin UI is available at the above URL.

3. Add node and observe linear scale out

Add a node to the universe.

  1. $ kubectl scale statefulset yb-tserver --replicas=4

Now we should have 4 nodes. Refresh the tablet-servers page to see the stats update. YugabyteDB automatically updates application clients to use the newly added node for serving queries. This scaling out of client queries is completely transparent to the application logic, allowing the application to scale linearly for both reads and writes.

You can also observe the newly added node using the following command.

  1. $ kubectl get pods
  1. NAME READY STATUS RESTARTS AGE
  2. yb-master-0 11 Running 0 5m
  3. yb-master-1 11 Running 0 5m
  4. yb-master-2 11 Running 0 5m
  5. yb-tserver-0 11 Running 1 5m
  6. yb-tserver-1 11 Running 1 5m
  7. yb-tserver-2 11 Running 0 5m
  8. yb-tserver-3 11 Running 0 4m

4. Scale back down to 3 nodes

The cluster can now be scaled back to only 3 nodes.

  1. $ kubectl scale statefulset yb-tserver --replicas=3
  1. $ kubectl get pods
  1. NAME READY STATUS RESTARTS AGE
  2. yb-master-0 11 Running 0 6m
  3. yb-master-1 11 Running 0 6m
  4. yb-master-2 11 Running 0 6m
  5. yb-tserver-0 11 Running 1 6m
  6. yb-tserver-1 11 Running 1 6m
  7. yb-tserver-2 11 Running 0 6m
  8. yb-tserver-3 11 Terminating 0 5m

Step 6. Clean up (optional)

Optionally, you can shutdown the local cluster created in Step 1.

  1. $ kubectl delete -f yugabyte-statefulset.yaml