Skip to content
Snippets Groups Projects
user avatar
nygard_89 authored
[riak] R, W, etc. options are ignored for strong-consistent updates. So, it's useless to specify them.
[riak] Updated README.md with new considerations about benchmarks which use strong consistency model.
465fb7ef
Name Last commit Last update
..
src
README.md
pom.xml

Riak KV Client for Yahoo! Cloud System Benchmark (YCSB)

The Riak KV YCSB client is designed to work with the Yahoo! Cloud System Benchmark (YCSB) project (https://github.com/brianfrankcooper/YCSB) to support performance testing for the 2.x.y line of the Riak KV database.

Creating a bucket type to use with YCSB

Perform the following operations on your Riak cluster to configure it for the benchmarks.

Set the default backend for Riak to LevelDB in the riak.conf file of every node of your cluster. This is required to support secondary indexes, which are used for the scan transactions. You can do this by modifying the proper line as shown below.

storage_backend = leveldb

Now, create a bucket type named "ycsb"1 by logging into one of the nodes in your cluster. Then, to use the strong consistency model2 (default), you need to follow the next two steps.

  1. In every riak.conf file, search for the ##strong_consistency=on line and uncomment it. It's important that you do this before you start your cluster!
  2. Run the following riak-admin commands:
riak-admin bucket-type create ycsb '{"props":{"consistent":true}}'
riak-admin bucket-type activate ycsb

Note that when using the strong consistency model, you may have to specify the number of replicas to create for each object. The R and W parameters (see next section) will in fact be ignored. The only information needed by this consistency model is how many nodes the system has to successfully query to consider a transaction completed. To set this parameter, you can add "n_val":N to the list of properties shown above (by default N is set to 3).

If instead you want to use the eventual consistency model implemented in Riak, then type:

riak-admin bucket-type create ycsb '{"props":{"allow_mult":"false"}}'
riak-admin bucket-type activate ycsb

Riak KV configuration parameters

You can either specify these configuration parameters via command line or set them in the riak.properties file.

  • riak.hosts - string list, comma separated list of IPs or FQDNs. For example: riak.hosts=127.0.0.1,127.0.0.2,127.0.0.3 or riak.hosts=riak1.mydomain.com,riak2.mydomain.com,riak3.mydomain.com.
  • riak.port - int, the port on which every node is listening. It must match the one specified in the riak.conf file at the line listener.protobuf.internal.
  • riak.bucket_type - string, it must match the name of the bucket type created during setup (see section above).
  • riak.r_val - int, this value represents the number of Riak nodes that must return results for a read operation before the transaction is considered successfully completed.
  • riak.w_val - int, this value represents the number of Riak nodes that must report success before an insert/update transaction is considered complete.
  • riak.read_retry_count - int, the number of times the client will try to read a key from Riak.
  • riak.wait_time_before_retry - int, the time (in milliseconds) before the client attempts to perform another read if the previous one failed.
  • riak.transaction_time_limit - int, the time (in seconds) the client waits before aborting the current transaction.
  • riak.strong_consistency - boolean, indicates whether to use strong consistency (true) or eventual consistency (false).
  • riak.debug - boolean, enables debug mode. This displays all the properties (specified or defaults) when a benchmark is started. Moreover, it shows error causes whenever these occur.

Note: For more information on workloads and how to run them please see: https://github.com/brianfrankcooper/YCSB/wiki/Running-a-Workload

1 As specified in the riak.properties file. See parameters configuration section for further info. ↩️

2 IMPORTANT NOTE: Currently the scan transactions are NOT SUPPORTED for the benchmarks which use the strong consistency model! However this will not cause the benchmark to fail, it simply won't perform any scan transaction at all. These latter will immediately return with a Status.NOT_IMPLEMENTED code. ↩️