Skip to content
Snippets Groups Projects

Riak KV Client for Yahoo! Cloud System Benchmark (YCSB)

The Riak KV YCSB client is designed to work with the Yahoo! Cloud System Benchmark (YCSB) project (https://github.com/brianfrankcooper/YCSB) to support performance testing for the 2.0.X line of the Riak KV database.

Creating a bucket type to use with YCSB

Perform the following operations on your Riak cluster to configure it for the benchmarks.

Set the default backend for Riak to LevelDB in the riak.conf file of every node of your cluster. This is required to support secondary indexes, which are used for the scan transactions. You can do this by modifying the proper line as shown below.

storage_backend = leveldb

Create a bucket type named "ycsb"1 by logging into one of the nodes in your cluster. Then, if you want to use the strong consistency model (default), you have to follow the next two steps.

  1. In every riak.conf file, search for the ##strong_consistency=on line and uncomment it. It is important that you do this before you start your cluster!

  2. Run the following riak-admin commands:

    riak-admin bucket-type create ycsb '{"props":{"allow_mult":"false","consistent":true}}'
    riak-admin bucket-type activate ycsb

If instead you want to use the eventual consistency model implemented in Riak, then type:

riak-admin bucket-type create ycsb '{"props":{"allow_mult":"false"}}'
riak-admin bucket-type activate ycsb

Note that you may want to specify the number of replicas to create for each object. To do so, you can add "n_val":N to the list of properties shown above (by default N is set to 3).

Riak KV configuration parameters

You can either specify these configuration parameters via command line or set them in the riak.properties file.

  • riak.hosts - string list, comma separated list of IPs or FQDNs. Example: riak.hosts=127.0.0.1,127.0.0.2,127.0.0.3 or riak.hosts=riak1.mydomain.com,riak2.mydomain.com,riak3.mydomain.com.
  • riak.port - int, the port on which every node is listening. It must match the one specified in the riak.conf file at the line listener.protobuf.internal.
  • riak.bucket_type - string, it must match the value of the bucket type created during setup (see section above).
  • riak.r_val - int, the R value represents the number of Riak nodes that must return results for a read before the read is considered successful.
  • riak.w_val - int, the W value represents the number of Riak nodes that must report success before an update is considered complete.
  • riak.read_retry_count - int, the number of times the client will try to read a key from Riak.
  • riak.wait_time_before_retry - int, the time (in milliseconds) before client attempts to perform another read if the previous one failed.
  • riak.transaction_time_limit - int, the time (in seconds) the client waits before aborting the current transaction.
  • riak.strong_consistency - boolean, indicates whether to use strong consistency (true) or eventual consistency (false).
  • riak.debug - boolean, enables debug mode. This displays all the properties (specified or defaults) when a benchmark is started. Moreover, it shows error causes whenever these occur.

Note: For more information on workloads and how to run them please see: https://github.com/brianfrankcooper/YCSB/wiki/Running-a-Workload

1 As specified in the riak.properties file. See parameters configuration section for further info. ↩️