README.md in worker_roulette-0.0.12 vs README.md in worker_roulette-0.1.0

- old
+ new

@@ -2,49 +2,100 @@ WorkerRoulette is designed to allow large numbers of unique devices, processes, users, or whatever communicate over individual channels without messing up the order of their messages. WorkerRoulette was created to solve two otherwise hard problems. First, other messaging solutions (I'm looking at you RabbitMQ) are not designed to handle very large numbers of queues (30,000+); because WorkerRoulette is built on top of Redis, we have successfully tested it running with millions of queues. Second, other messaging systems assume one (or more) of three things: 1. Your message consumers know the routing key of messages they are interested in processing; 2. Your messages can wait so that only one consumer is processed at a time; 3. You love to complicated write code to put your messages back in order. Sometimes, none of these things is true and that is where WorkerRoulette comes in. WorkerRoulette lets you have thousands of competing consumers (distrubted over as many machines as you'd like) processing ordered messages from millions of totally unknown message providers. It does all this and ensures that the messages sent from each message provider are processed in exactly the order it sent them. -## General Usage +## Asynchronous Api (Evented) ```ruby size_of_connection_pool = 100 -redis_config = {host: 'localhost', timeout: 5, db: 1} #Start it up -WorkerRoulette.start(size_of_connection_pool, redis_config) +#the config takes size for the connection pool size, evented to specify which api to use, then the normal redis config +WorkerRoulette.start(size: size_of_connection_pool, evented: false, host: 'localhost', timeout: 5, db: 1) #Enqueue some work sender_id = :shady +foreman = WorkerRoulette.a_foreman(sender_id) + +foreman.enqueue_work_order(['hello', 'foreman']) do |msg| + puts "work enqueued #{msg}" +end + +#Pull it off +tradesman = WorkerRoulette.a_tradesman +tradesman.work_orders! do |work_orders| #drain the queue of the next available sender + work_orders.first # => ['hello', 'foreman'] +end + +#Enqueue some more from someone else +other_sender_id = :the_real_slim_shady +other_foreman = WorkerRoulette.a_foreman(other_sender_id) +other_foreman.enqueue_work_order({'can you get me' => 'the number nine?'}) do |msg| + puts "work enqueued #{msg}" +end + +#Have the same worker pull that off +tradesman.work_orders! do |work_orders| #drain the queue of the next available sender + work_orders.first # => {'can you get me' => 'the number nine?'} +end + +#Have your workers wait for work to come in +on_subscribe_callback = -> do + puts "Huzzah! We're listening!" + foreman.enqueue_work_order('will I see you later?') + foreman.enqueue_work_order('can you give me back my dime?') +end + + +#And they will pull it off as it comes, as long as it comes +#NB: This is NOT a blocking operation, so no worries +tradesman.wait_for_work_orders(on_subscribe_callback) do |work_orders, message, channel| #drain the queue of the next available sender + work_orders # => ['will I see you later', 'can you give me back my dime?'] + message # => 'new_job_ready' + channel # => '' #the name of the channel the message was published on, if one was used -- see below +end +``` + +## Synchronous Api +```ruby +size_of_connection_pool = 100 + +#Start it up +#the config takes size for the connection pool size, evented to specify which api to use, then the normal redis config +WorkerRoulette.start(size: size_of_connection_pool, evented: false, host: 'localhost', timeout: 5, db: 1) + +#Enqueue some work +sender_id = :shady foreman = WorkerRoulette.foreman(sender_id) foreman.enqueue_work_order(['hello', 'foreman']) #Pull it off tradesman = WorkerRoulette.tradesman -messages = tradesman.work_orders! #drain the queue of the next available sender -messages.first # => ['hello', 'foreman'] +work_orders = tradesman.work_orders! #drain the queue of the next available sender +work_orders.first # => ['hello', 'foreman'] #Enqueue some more from someone else other_sender_id = :the_real_slim_shady other_foreman = WorkerRoulette.foreman(other_sender_id) other_foreman.enqueue_work_order({'can you get me' => 'the number nine?'}) #Have the same worker pull that off -messages = tradesman.work_orders! #drain the queue of the next available sender -messages.first # => {'can you get me' => 'the number nine?'} +work_orders = tradesman.work_orders! #drain the queue of the next available sender +work_orders.first # => {'can you get me' => 'the number nine?'} #Have your workers wait for work to come in on_subscribe_callback = -> do puts "Huzzah! We're listening!" foreman.enqueue_work_order('will I see you later?') foreman.enqueue_work_order('can you give me back my dime?') end #And they will pull it off as it comes, as long as it comes -#(This is a blocking operation, so it is best in Threads or EventMachine.next_tick) -tradesman.wait_for_work_orders(on_subscribe_callback) do |messages| #drain the queue of the next available sender - messages # => ['will I see you later', 'can you give me back my dime?'] +#NB: This IS a blocking operation +tradesman.wait_for_work_orders(on_subscribe_callback) do |work_orders| #drain the queue of the next available sender + work_orders # => ['will I see you later', 'can you give me back my dime?'] end ``` ## Channels You can also namespace your work orders over a channel, in case you have several sorts of competing consumers who should not step on each other's toes: @@ -63,14 +114,31 @@ tradesman.wait_for_work_orders(publish) do |work| work.to_s.should match("some old fashion work") #only got the work from the good foreman work.to_s.should_not match("evil") #channels let us ignore the other's evil orders tradesman.unsubscribe end - ``` +## Performance +Running the performance tests on my laptop, the numbers break down like this: +### Async Api + - Manual: ~4200 read-write round-trips / second + - Pubsub: ~5200 read-write round-trips / second + +### Sync Api + - Manual: ~1600 read-write round-trips / second + - Pubsub: ~2000 read-write round-trips / second + +To run the perf tests yourself run `bundle exec spec:perf` + +## Redis Pubsub and Polling +The `wait_for_work_orders` method works using Redis' pubsub mechanism. The advantage to this is that it is very fast and minimizes network traffic. The downside is that Redis' pubsub implementation is 'fire and forget', so any subscribers who are not listening at the moment the message is published will miss it. In order to compensate for this, WorkerRoulette's Async Api creates a backup timer (using EM.add_periodic_timer) that will poll redis every 20-25 seconds for new work. Since the timer is reset every time new work comes in, if you have an active publisher, the timer may never need to fire. It only serves a backup to make sure no work is left waiting in the queues because of network problems. Since there is no one polling mechanism that works for all situations in a synchrounous environment, this feature is only available through the Async Api. + +## Redis Version +WorkerRoulette uses Redis' lua scripting feature to acheive such high throughput and therefore requires a version of Redis that supports lua scripting (>= Redis 2.6) + ##Caveat Emptor -While WorkerRoulette does promise to keep the messages of each consumer processed in order by competing consumers, it does NOT guarantee the order in which the queues themselves will be processed. In general, work is processed in a FIFO order, but for performance reasons this has been left a loose FIFO. For example, if Abdul enqueues some ordered messages ('1', '2', and '3') and then so do Mark and Wanda, Mark's messages may be processed first, then it would likely be Abdul's, and then Wanda's. However, even though Mark jumped the line, Abdul's messages will still be processed the order he enqueued them ('1', '2', then '3'). +While WorkerRoulette does promise to keep the messages of each consumer processed in order by competing consumers, it does NOT guarantee the order in which the queues themselves will be processed. In general, work is processed in a FIFO order, but for performance reasons this has been left a loose FIFO. For example, if Abdul enqueues some ordered messages ('1', '2', and '3') and then so do Mark and Wanda, Mark's messages may be processed first, then it would likely be Abdul's, and then Wanda's. However, even though Mark jumped the line, Abdul's messages will still be processed in the order he enqueued them ('1', '2', then '3'). ## Installation Add this line to your application's Gemfile: