How can you make a cluster run a task only once?

Question

If you had a task that you wanted to run only once on a cluster of servers, at a regular interval what would be the best way of achieving this? The definition of cluster in this case is 2 or more identical servers with distributed sessions sitting behind a load balancer.

Use Case: You have a task that is expensive to run that should only be run once per X hours. This job could for instance iterates over a bunch of records and updates their status.

Worst case scenario is that having the job run twice invalidates your data.
Best case scenario is that the job utilises resources on all your servers.

Requirements Summary:

The job must still run even if one of the nodes are down.
The job must only be run once per schedule.
If multiple jobs are scheduled at the same time or at overlapping times that the number of running jobs is distributed equally between the servers.
The machines must have the same code base and be synchronised via NTP.
The configuration may differ between node and node, by environment variables.
The job has to start on time or within a given interval of the assigned time. (say 5 minutes for example)

Possible solutions

Set one node as the master node, this doesn't work as it violates 1 above.
Make a request that the load balancer balances to kick off the job. Unfortunatly this has the side effect that if you have multiple jobs running at the same time they may all be run by the same machine.

This would have to run in Java, in a servlet container. However it isn't coding the jobs I'm looking for.

Surely this is a solved problem with known best solution.

Related question. https://stackoverflow.com/questions/5949038/schedule-job-executes-twice-on-cluster

This isn't a duplicate as the solution is insufficient as per those 5 requirements given above. The most upvoted solution suffers from a race problem, and the second solution violates requirement 3

score 19 · Accepted Answer · answered May 15 '11 at 07:37

Do you have a shared database? I've done this using a database as the arbiter in the past.

Basically, each "job" is represented as a row in the database. You schedule a job by adding a row to the database with the time you want it to run then each server does:

SELECT TOP 1 *
FROM jobs
WHERE state = 'NotRun'
ORDER BY run_time ASC

That way, they'll all pick the job that is scheduled to run next. They all sleep so that they wake up when the job is actually supposed to run. Then, they all do this:

UPDATE jobs
SET state = 'Running'
WHERE job_id = :id
  AND state = 'NotRun'

Where :id is the identifier of the job you got in the step above. Because the update is atomic, only one of the servers will actually update the row, you can check the database's "number of rows updates" status code to determine whether you were the server that actually updated the row, and therefore whether you are the server that gets to run the job.

If you didn't "win" and you're not running the job, just go back to step 1 immediately. If you did "win", schedule the job to execute in another thread, then wait a couple of seconds before going back to step 1. That way, servers that didn't get the job this time are more likely to pick up a job that's scheduled to run immediately.

What isloation level are you using here? Read committed or serialize? — Maverick Riz, Dec 22 '15 at 04:33
@MaverickRiz it does not really matter, since there's no long running transaction. The `UPDATE` itself determines whether a job is ready to be executed on a given instance or not. — Andy, Nov 23 '20 at 10:03

score 2 · Answer 2 · answered Aug 31 '16 at 14:21

Several app servers have a feature for "cluster wide singleton services".

For example Weblogic has a Singleton Service feature that is configured via the web admin console.

You have to write a class that implements weblogic.cluster.singleton.SingletonService and use it to declare the service in the admin console. The cluster takes care of instantiating the class and notifying you when the service is started or stopped. The SingletonService interface has an activate() and a deactivate() method.

Weblogic calls activate() when it first brings up the service on one of the nodes of the cluster. If the selected node goes down, the admin server "moves" the service on a different server, calling activate() there.

http://docs.oracle.com/cd/E12839_01/apirefs.1111/e13952/taskhelp/clusters/ConfigureSingletonService.html

How can you make a cluster run a task only once?

2 Answers2