Riak KV Client for Yahoo! Cloud System Benchmark (YCSB)
The Riak KV YCSB client is designed to work with the Yahoo! Cloud System Benchmark (YCSB) project (https://github.com/brianfrankcooper/YCSB) to support performance testing for the 2.x.y line of the Riak KV database.
Creating a bucket type to use with YCSB
Perform the following operations on your Riak cluster to configure it for the benchmarks.
Set the default backend for Riak to LevelDB in the riak.conf
file of every node of your cluster. This is required to support secondary indexes, which are used for the scan
transactions. You can do this by modifying the proper line as shown below.
storage_backend = leveldb
Create a bucket type named "ycsb"1 by logging into one of the nodes in your cluster. Then, if you want to use the eventual consistency model (default), type:
riak-admin bucket-type create ycsb '{"props":{"allow_mult":"false"}}'
riak-admin bucket-type activate ycsb
If instead you want to use the strong consistency model2 implemented in Riak, then you have to follow the next two steps.
-
In every
riak.conf
file, search for the##strong_consistency=on
line and uncomment it. It is important that you do this before you start your cluster! -
Run the following riak-admin commands:
riak-admin bucket-type create ycsb '{"props":{"allow_mult":"false","consistent":true}}'
riak-admin bucket-type activate ycsb
Note that you may want to specify the number of replicas to create for each object. To do so, you can add "n_val":N
to the list of properties shown above (by default N
is set to 3).
Riak KV configuration parameters
You can either specify these configuration parameters via command line or set them in the riak.properties
file.
-
riak.hosts
- string list, comma separated list of IPs or FQDNs. Example:riak.hosts=127.0.0.1,127.0.0.2,127.0.0.3
orriak.hosts=riak1.mydomain.com,riak2.mydomain.com,riak3.mydomain.com
. -
riak.port
- int, the port on which every node is listening. It must match the one specified in theriak.conf
file at the linelistener.protobuf.internal
. -
riak.bucket_type
- string, it must match the value of the bucket type created during setup (see section above). -
riak.r_val
- int, the R value represents the number of Riak nodes that must return results for a read before the read is considered successfully completed. -
riak.w_val
- int, the W value represents the number of Riak nodes that must report success before an update is considered complete. -
riak.read_retry_count
- int, the number of times the client will try to read a key from Riak. -
riak.wait_time_before_retry
- int, the time (in milliseconds) before client attempts to perform another read if the previous one failed. -
riak.transaction_time_limit
- int, the time (in seconds) the client waits before aborting the current transaction. -
riak.strong_consistency
- boolean, indicates whether to use strong consistency (true) or eventual consistency (false). -
riak.debug
- boolean, enables debug mode. This displays all the properties (specified or defaults) when a benchmark is started. Moreover, it shows error causes whenever these occur.
Note: For more information on workloads and how to run them please see: https://github.com/brianfrankcooper/YCSB/wiki/Running-a-Workload
1 As specified in the riak.properties
file. See parameters configuration section for further info. ↩️
2 IMPORTANT NOTE: Currently the scan
transactions are NOT SUPPORTED for the benchmarks which use the strong consistency model! However this will not cause the benchmark to fail, but simply it won't perform any scan transaction at all. ↩️