README.md in redis_failover-0.4.0 vs README.md in redis_failover-0.5.0

- old
+ new

@@ -1,41 +1,45 @@ -# Automatic Redis Failover Client/Server +# Automatic Redis Failover Client / Node Manager [![Build Status](https://secure.travis-ci.org/ryanlecompte/redis_failover.png?branch=master)](http://travis-ci.org/ryanlecompte/redis_failover) -Redis Failover attempts to provides a full automatic master/slave failover solution for Ruby. Redis does not provide +redis_failover attempts to provides a full automatic master/slave failover solution for Ruby. Redis does not provide an automatic failover capability when configured for master/slave replication. When the master node dies, a new master must be manually brought online and assigned as the slave's new master. This manual switch-over is not desirable in high traffic sites where Redis is a critical part of the overall architecture. The existing standard Redis client for Ruby also only supports configuration for a single Redis server. When using master/slave replication, it is desirable to have all writes go to the master, and all reads go to one of the N configured slaves. -This gem attempts to address both the server and client problems. A redis failover server runs as a background -daemon and monitors all of your configured master/slave nodes. When the server starts up, it -automatically discovers who is the master and who are the slaves. Watchers are setup for each of +This gem attempts to address these failover scenarios. A redis failover Node Manager daemon runs as a background +process and monitors all of your configured master/slave nodes. When the daemon starts up, it +automatically discovers the current master/slaves. Background watchers are setup for each of the redis nodes. As soon as a node is detected as being offline, it will be moved to an "unavailable" state. If the node that went offline was the master, then one of the slaves will be promoted as the new master. All existing slaves will be automatically reconfigured to point to the new master for replication. All nodes marked as unavailable will be periodically checked to see if they have been brought back online. -If so, the newly available nodes will be configured as slaves and brought back into the list of live -servers. Note that detection of a node going down should be nearly instantaneous, since the mechanism +If so, the newly available nodes will be configured as slaves and brought back into the list of available +nodes. Note that detection of a node going down should be nearly instantaneous, since the mechanism used to keep tabs on a node is via a blocking Redis BLPOP call (no polling). This call fails nearly immediately when the node actually goes offline. To avoid false positives (i.e., intermittent flaky -network interruption), the server will only mark a node as unavailable if it fails to communicate with +network interruption), the Node Manager will only mark a node as unavailable if it fails to communicate with it 3 times (this is configurable via --max-failures, see configuration options below). This gem provides a RedisFailover::Client wrapper that is master/slave aware. The client is configured -with a single host/port pair that points to redis failover server. The client will automatically -connect to the server to find out the current state of the world (i.e., who's the current master and -who are the current slaves). The client also acts as a load balancer in that it will automatically +with a list of ZooKeeper servers. The client will automatically contact the ZooKeeper cluster to find out +the current state of the world (i.e., who is the current master and who are the current slaves). The client +also sets up a ZooKeeper watcher for the set of redis nodes controlled by the Node Manager daemon. When the daemon +promotes a new master or detects a node as going down, ZooKeeper will notify the client near-instantaneously so +that it can rebuild its set of Redis connections. The client also acts as a load balancer in that it will automatically dispatch Redis read operations to one of N slaves, and Redis write operations to the master. -If it fails to communicate with any node, it will go back and ask the server for the current list of -available servers, and then optionally retry the operation. +If it fails to communicate with any node, it will go back and fetch the current list of available servers, and then +optionally retry the operation. ## Installation +redis_failover has an external dependency on ZooKeeper. You must have a running ZooKeeper cluster already available in order to use redis_failover. ZooKeeper provides redis_failover with its high availability and data consistency between Redis::Failover clients and the Node Manager daemon. Please see the requirements section below for more information on installing and setting up ZooKeeper if you don't have it running already. + Add this line to your application's Gemfile: gem 'redis_failover' And then execute: @@ -44,72 +48,71 @@ Or install it yourself as: $ gem install redis_failover -## Server Usage +## Node Manager Daemon Usage -The redis failover server is a simple process that should be run as a background daemon. The server supports the +The Node Manager is a simple process that should be run as a background daemon. The daemon supports the following options: - Usage: redis_failover_server [OPTIONS] - -P, --port port Server port - -p, --password password Redis password - -n, --nodes nodes Comma-separated redis host:port pairs - --max-failures count Max failures before server marks node unavailable (default 3) + Usage: redis_node_manager [OPTIONS] + -p, --password password Redis password (optional) + -n, --nodes redis nodes Comma-separated redis host:port pairs (required) + -z zookeeper servers, Comma-separated zookeeper host:port pairs (required) + --zkservers + --znode-path path Znode path override for storing redis server list (optional) + --max-failures count Max failures before manager marks node unavailable (default 3) -h, --help Display all options -To start the server for a simple master/slave configuration, use the following: +To start the daemon for a simple master/slave configuration, use the following: - redis_failover_server -P 3000 -n localhost:6379,localhost:6380 + redis_node_manager -n localhost:6379,localhost:6380 -z localhost:2181,localhost:2182,localhost:2183 -The server will automatically figure out who is the master and who is the slave upon startup. Note that it is -a good idea to monitor the redis failover server process with a tool like Monit to ensure that it is restarted +The Node Manager will automatically discover the master/slaves upon startup. Note that it is +a good idea to monitor the redis Node Manager daemon process with a tool like Monit to ensure that it is restarted in the case of a failure. ## Client Usage -The redis failover client must be used in conjunction with a running redis failover server. The -client supports various configuration options, however the two mandatory options are the host -and port of the redis failover server: +The redis failover client must be used in conjunction with a running Node Manager daemon. The +client supports various configuration options, however the only mandatory option is the list of +ZooKeeper servers: - client = RedisFailover::Client.new(:host => 'localhost', :port => 3000) + client = RedisFailover::Client.new(:zkservers => 'localhost:2181,localhost:2182,localhost:2183') The client actually employs the common redis and redis-namespace gems underneath, so this should be a drop-in replacement for your existing pure redis client usage. The full set of options that can be passed to RedisFailover::Client are: - :host - redis failover server host (required) - :port - redis failover server port (required) - :password - optional password for redis nodes - :namespace - optional namespace for redis nodes - :logger - optional logger override + :zkservers - comma-separated zookeeper host:port pairs (required) + :znode_path - the Znode path override for redis server list (optional) + :password - password for redis nodes (optional) + :namespace - namespace for redis nodes (optional) + :logger - logger override (optional) :retry_failure - indicate if failures should be retried (default true) :max_retries - max retries for a failure (default 3) ## Requirements -redis_failover is actively tested against MRI 1.9.2/1.9.3. Other rubies may work, although I don't actively test against them. 1.8 is not supported. +- redis_failover is actively tested against MRI 1.9.2/1.9.3. Other rubies may work, although I don't actively test against them. 1.8 is not supported. +- redis_failover requires a ZooKeeper service cluster to ensure reliability and data consistency. ZooKeeper is very simple and easy to get up and running. Please refer to this [Zookeper Guide](http://zookeeper.apache.org/doc/trunk/zookeeperStarted.html) to get up and running quickly if you don't already have ZooKeeper as a part of your environment. ## Considerations -Note that by default the failover server will mark slaves that are currently syncing with their master as "available" based on the configuration value set for "slave-serve-stale-data" in redis.conf. By default this value is set to "yes" in the configuration, which means that slaves still syncing with their master will be available for servicing read requests. If you don't want this behavior, just set "slave-serve-stale-data" to "no" in your redis.conf file. +- Note that by default the Node Manager will mark slaves that are currently syncing with their master as "available" based on the configuration value set for "slave-serve-stale-data" in redis.conf. By default this value is set to "yes" in the configuration, which means that slaves still syncing with their master will be available for servicing read requests. If you don't want this behavior, just set "slave-serve-stale-data" to "no" in your redis.conf file. -## Limitations +- Note that it's still possible for the RedisFailover::Client instances to see a stale list of servers for a very small window. In most cases this will not be the case due to how ZooKeeper handles distributed communication, but you should be aware that in the worst case the client could write to a "stale" master for a small period of time until the next watch event is received by the client via ZooKeeper. -The redis_failover gem currently has limitations. It currently does not gracefully handle network partitions. In cases where -the network splits, it is possible that more than one master could exist until the failover server sees all of the nodes again. -If the failover client gets split from the failover server, it's also possible that it could be talking to a stale master. This would get corrected once the client could successfully reach the failover server again to fetch the latest set of master/slave nodes. This is a limitation that I hope to address in a future release. The gem can not guarantee data consistencies until this is addressed. - ## TODO -- Integrate with ZooKeeper for full reliability / data consistency. -- Rework specs to work against a set of real Redis nodes as opposed to stubs. +- Rework specs to work against a set of real Redis/ZooKeeper nodes as opposed to stubs. ## Resources -To learn more about Redis master/slave replication, see the [Redis documentation](http://redis.io/topics/replication). +- To learn more about Redis master/slave replication, see the [Redis documentation](http://redis.io/topics/replication). +- To learn more about ZooKeeper, see the official [ZooKeeper](http://zookeeper.apache.org/) site. ## License Please see LICENSE for licensing details.