Cluster
Don't just let your server crash on an unhandled error, finish everything you were doing first. Multiple techniques used to ensure your clients don't get socket hang-ups. Cluster support and graceful shutdown on SIGTERM too!
Install / Use
/learn @thehelp/ClusterREADME
thehelp-cluster
Don't just let your server crash on an unhandled error, finish everything you were doing first. Multiple techniques used to ensure your clients don't get socket hang-ups. Cluster support and graceful shutdown on SIGTERM too! More information about thehelp.
Node version support
0.10
Not currently testing against this platform, but it should still work.
0.11/0.12
Not testing against this platform. But I do know that because this project uses node-statsd to send statistics to statsd, you'll get crashes in your master process.
iojs (1.x/2.x/3.x)
Currently testing against the most recent version of all three of these in Travis. Go for it!
4.x/5.x/6.x
All green! It's kind of amazing that this project has required so few changes after a year and a half. I had to make some test changes to work with a iojs 1.x breaking change, but that's it!
Features
GracefulExpressclass to:- Install
domain-based capture of unhandled errors for every request, ensuring that the client always gets an error message - Shut down
expressservers gracefully: stop accepting new connections, close keepalive connections, and return 503 if any requests leak through inProcessTestmode forsupertest-based in-process endpoint testing- Tested on
express3.xand4.xand withnode0.10.30-36andiojs1.4.3,1.5.1and1.6.0(note: tests no longer run on node 0.10 or below)
- Install
Masterclass to:- Start up user-provided set of worker processes via
cluster - Detect if worker processes crash too fast, then start replacements up after a delay
- Shut down process and workers gracefully, only killing worker processes if they take too long
- Start up user-provided set of worker processes via
Gracefulclass to:- Act as the process-level hub for shutdown notifications and shutdown readiness
- Listen for
SIGTERMsignal, start graceful shutdown process - Report the shutdown-causing error via
thehelp-last-ditch
Startupclass to:- Easily start up a cluster
- On launch of processes, increment
statsdcounters forprocess.env.THEHELP_APP_NAME + '.launches.' + (master/worker)Why? To catch frequent restarts if OS is killing your process, for example due to low memory) - Install
domainfor top-level errors in master and worker processes
- Logging options:
- This library can participate in your logging system via
thehelp-log-shim - If you're using
winston,setupLogs()will quickly set up per-process log files
- This library can participate in your logging system via
Setup
First, install the project as a dependency:
npm install thehelp-project --save-dev
Usage: Single-process
Even a single process could benefit from graceful shutdown, and it's really easy to set that up:
var cluster = require('thehelp-cluster');
cluster.Graceful.start();
process.on('uncaughtException', function(err) {
// Graceful uses thehelp-last-ditch to capture error information, then shuts down
cluster.Graceful.instance.shutdown(err);
});
var app = require('express')();
// GracefulExpress ensures all outstanding requests complete before shutdown
var gracefulExpress = new cluster.GracefulExpress();
// this wraps all incoming requests in a domain
app.use(gracefulExpress.middleware);
// this makes it easier to get the underlying node.js http server
gracefulExpress.listen(app, 3000, function() {
console.log('Server listening on port 3000');
});
Usage: Clustered
Okay, now say you want to use all of thehelp-cluster. Here's the full treatment - a cluster-based setup, with GracefulExpress installed on your server. First, create your cluster.js file:
var cluster = require('thehelp-cluster');
cluster.setupLogs();
// creates a Graceful instance for the process - Master and GracefulExpress need it
cluster.Graceful.start();
cluster({
worker: function() {
var server = require('./server');
server.start();
}
});
Anything run outside of the master/worker callbacks you pass to cluster will be run in all of your processes. So a Graceful instance is created for each process, and logging is set up too. Since the master callback wasn't provided, a basic default creates a Master instance and calls start().
Now, in the same directory, your server.js file:
var express = require('express');
var app = express();
var cluster = require('thehelp-cluster');
// creates a new Graceful instance if it hasn't been created yet in this process
// so we can run this file without cluster with full graceful shutdown support
cluster.Graceful.start();
var gracefulExpress = new cluster.GracefulExpress();
// ...very little should go before gracefulExpress - probably just logging...
app.use(gracefulExpress.middleware);
// ...register your endpoints and other middleware...
app.get('/', function(req, res) {
res.send('success!');
})
return {
// we expose the app to allow for supertest-based in-process testing
app: app,
// gracefulExpress needs a references to the http server itself; listen() makes that easy
start: function() {
gracefulExpress.listen(app, 3000, function() {
console.log('Worker listening on port 3000');
});
}
};
That's it! You've got a cluster of one worker process that will respond to SIGTERM and shut down gracefully. An unhandled error in middleware, or in an endpoint handler, or even in a callback will shut down your server gracefully after piping the error through the installed express error handler.
If you have winston installed, you'll get a separate log file for each process, like this:
logs/master-2014-10-11T01-04:54.602Z-80524.log
logs/master-2014-10-11T01-04:59.026Z-80528.log
logs/worker-2014-10-11T01-04:54.771Z-80525.log
logs/worker-2014-10-11T01-04:55.908Z-80526.log
logs/worker-2014-10-11T01-04:58.224Z-80527.log
logs/worker-2014-10-11T01-04:59.200Z-80529.log
But we can't forget tests! Here, test/endpoints.js specifies a mocha test and uses supertest to load the app from server.js and call it in-process:
var supertest = require('supertest');
var app = require('../server').app;
describe('endpoint test', function() {
var request;
before(function() {
request = supertest(app);
});
it('/ should return success', function(done) {
request
.get('/')
.expect('success')
.expect(200, done);
});
});
Now try throwing an error in an async callback to start testing out the error-catching capabilities of GracefulExpress!
Note: GracefulExpress behaves differently when run under mocha. By default, its inProcessTest option is set to true if we can detect that mocha is the main module for the current process. Errors will bubble all the way to mocha's top-level exception handler and be reported as standard test failures.
Configuration
You may be wondering about the knobs you can turn in this simple use case. First, two environment variables:
{
"THEHELP_NUMBER_WORKERS": "1",
"THEHELP_LOGS_DIR": "logs directory; defaults to ./logs/"
}
Error reporting
Next, by default both Graceful (for exceptions delivered by a shutdown() call) and Startup (if a Graceful instance cannot be found) use thehelp-last-ditch to save exceptions. Take a look at the documentation for that - you'll likely want to set the THEHELP_CRASH_LOG environment variable.
You can also provide your own customized LastDitch with SMS/email notifications turned on. Or, you can go further and provide a totally custom messenger callback of the form function(err, options, cb).
Logging
You'll also want to look at the documentation for thehelp-log-shim, which is used for logging. Essentially, this library will look for logging libraries your project already has installed, and will use that. If you don't like this, you can turn it off!
Going deeper
In more complex scenarios, say for example you're responding to incoming socket.io messages, you can register for shutdown notifications and delay shutdown like this:
var cluster = require('thehelp-cluster');
// this uses the Graceful instance already in place for this process
var graceful = cluster.Graceful.instance;
graceful.on('shutdown', function() {
// start shutting down all active socket.io connections
});
graceful.addCheck(function() {
// return true if ready to shut down
// called frequently when Graceful wants to shut down
})
Take a look at how Master and GracefulExpress delegate to Graceful for more detail. Graceful has a number of configuration options as well, like how long to wait for not-yet-ready addCheck() functions before shutting down anyway.
Note: For a complete socket.io example, check out test/scenarios/socket.io.js.
Detailed Documentation
Detailed docs be found at this project's GitHub Pages, thanks to groc: http://thehelp.github.io/cluster/src/server/index.html
A note on domains
[Node.js `domain
