A very fast standards-compliant ES2015 Promise library for node.
There are several fast promise implementations out there, like
bluebird
, but they all add a bunch of stuff that's not in the
ES2015 Promise spec. Further, as they've added more and more
features they've grown so complex that they can't be easily audited
against the ES2015 spec (or the official test262
test suite).
The babybird
library is a stripped-down "just ES2015" Promise
implementation, which passes the Promises/A+ and test262
test
suites. It is very competitive with the performance of bluebird
,
without having to give up standards compliance. On the doxbee
benchmark running under node 5.1.0, babybird
is 2.4 times faster
than native Promises, 4 times faster than the Promise implementation
in core-js
, and almost 9 times faster than the Promise
implementation in es6-shim
. The performance improvement is even
greater when running under node 0.10.
Further, babybird
supports subclassing (as all ES2015 Promise
implementations ought). This means that if you'd like additional
bells and whistles on your Promises, you can add then via subclassing
and without stomping on the global Promise. In fact, the prfun
library provides much the same feature set as bluebird
, but with a
clean separation of concerns. Further, prfun
runs on top of any
ES2015-compliant Promise implementation, so if babybird
is someday
supplanted, you can just swap out the core Promise implementation
underneath prfun
without having to update any of your uses of the
core ES2015 or extended prfun
API.
npm install babybird
var Promise = require('babybird');
The babybird
library plays very nicely with prfun
, if you'd like
a few bells and whistles with your library. I recommend creating a
new module, named (say) promise.js
, with these contents:
module.exports = require('prfun/wrap')(require('babybird'));
and then using the wrapped promises this way:
var Promise = require('./promise.js');
These benchmarks are derived from the benchmarks included with
bluebird
. A few of the bluebird test cases have been forked to
add "fair" versions, since bluebird obtained some of its speed by
using a faster promisify
method than that provided to other
promise implementations, and by using a few bluebird
-specific
APIs which appeared to be tuned for the benchmark.
You can reproduce these results using npm run bench
.
results for 20000 parallel executions, 1 ms per I/O op
file time(ms) slowdown memory(MB)
callbacks-baseline.js 819 0.48 32.99
promises-bluebird.js 1383 0.81 50.59
promises-bluebird-fair.js 1715 1.00 39.02
promises-cscott-babybird-noall.js 1870 1.09 41.19
promises-cscott-babybird.js 1925 1.12 61.81
promises-cscott-babybird-prfun.js 1991 1.16 48.60
promises-then-promise-es6.js 3084 1.80 64.54
promises-then-promise.js 3112 1.81 64.39
promises-paulmillr-es6shim.js 12475 7.27 94.20
promises-zloirock-corejs.js 18695 10.90 94.11
Platform info:
Linux 4.2.0-1-amd64 ia32
Node.JS 0.10.40
V8 3.14.5.9
Intel(R) Core(TM) i7 CPU L 640 @ 2.13GHz × 4
results for 20000 parallel executions, 1 ms per I/O op
file time(ms) slowdown memory(MB)
callbacks-baseline.js 826 0.53 29.29
promises-bluebird-generator.js 1105 0.71 25.49
promises-bluebird-generator-fair.js 1113 0.72 28.27
promises-cscott-babybird-prfun-generator.js 1202 0.77 33.68
promises-bluebird-fair.js 1553 1.00 51.38
promises-cscott-babybird-noall.js 1623 1.05 65.25
promises-bluebird.js 1643 1.06 43.66
promises-cscott-babybird-prfun.js 1707 1.10 72.33
promises-cscott-babybird.js 1724 1.11 72.82
promises-then-promise-es6.js 2985 1.92 106.96
promises-then-promise.js 3004 1.93 106.69
promises-ecmascript6-native.js 3770 2.43 176.96
promises-zloirock-corejs.js 6298 4.06 155.89
promises-paulmillr-es6shim.js 13892 8.95 227.61
Platform info:
Linux 4.2.0-1-amd64 ia32
Node.JS 5.1.0
V8 4.6.85.31
Intel(R) Core(TM) i7 CPU L 640 @ 2.13GHz × 4
The babybird
implementation began with a fairly faithful mechanical
translation of the ES6 Promise spec into JavaScript. In this
section I'm going to list the optimizations which were then applied,
roughly ordered so that those with the largest effect on performance
come first. This is only a very rough ordering, however.
- Use the
asap
package and reuse task objects instead of callingsetImmediate
directly. This avoids almost all allocation when dispatching asynchronous handlers. (Commiteb1f4ea7
and3216461f
) - Avoid creating a new
PromiseCapability
inPromise#then
if the return value will be discarded --- for example, in thePromiseResolveThenableJob
(when aPromise
is resolved to anotherPromise
) and in the implementation ofPromise.all
andPromise.race
. Since creating aPromiseCapability
calls a user-suppliedPromise
constructor, and this call is user-observable, we can only perform this optimization when we are certain the constructor does not have side effects. We call this the "then0
optimization". (Commit919d7aaf
,f717a7b1
,3a1bf92a
) - Further optimize
Promise#then
by bypassing the standardPromise
constructor, which requires the creation of three separate closures (theexecutor
function and separateresolve
andreject
functions). When we know that the constructor is safe, we use an internal-only constructor which uses defaultresolve
/reject
implementations without requiring allocation. The nativePromise
implementation in v8 contains a version of this optimization. We call this the "PromiseCapability
optimization". (Commitb6eea14d
) - Use a free list to reuse
PromiseCapability
objects and avoid unnecessary allocation insidePromise#then
. (Commit9f27ecd4
) - Inline the reaction array into the Promise object, with special
attention to the first element. Most Promises only get a single
handler registered, so we can avoid the array allocations entirely
for this common case. We further combined the
state
andreaction length
fields to reduce memory further. We call this the "Promise fields optimization". (Commit6cad108a
anddd0c7e4d
) - Separate the
createResolvingFunctions
implementation for thePromise
constructor andPromise#then
. For the constructor, reuse thestate
field to track thealreadyResolved
state, to avoid allocating an extra boolean for this common case. (Commited96b474
) - Specialize
Promise.all
andPromise.race
if the argument is a true array, avoiding the overhead of usingIterator
objects. - Avoid the use of
try
/catch
, since their presence deoptimizes the entire function containing them. Where necessitated by the ES6 specification (calling user-provided handlers, for example), encapsulate thetry
/catch
in separate functions to contain the scope of deoptimization. (Commit28a151ae
) - Additional micro-optimizations, many informed by IRHydra2: avoid
the use of
Object#toString
on the fast path, split some functions to better allow call site specialization, avoid some polymorphism, avoid use ofarguments
, and improve the implementation of theTypeIsObject
check.
In addition, we made some improvements to prfun
's Promise.async
implementation to take advantage of the then0
optimization (commit
0e7dd8ce
). This allows even greater performance when writing
async code using generators, as it avoids the necessity of creating
an extra Promise
at every yield
.
The performance implications of the standard Promise
constructor
and field layout were raised by the author of bluebird
on
StackExchange.
This explanation is somewhat incomplete: the performance advantage
of bluebird's promisify
is somewhat overstated, and the
"then0
" and "PromiseCapability
" optimizations are not mentioned---but
these prove to be very significant. It appears that the best
way to optimize a Promise
is avoid creating it entirely (the
"then0
" optimization). If you have to create it, avoid the
creation of three closures and a PromiseCapability
object
if you can (the "PromiseCapability
" optimization). And then
finally, if you must, at least take care to avoid creating
unnecessary reaction arrays (the "Promise fields" optimization).
It is also the case that avoiding the naive use of setImmediate
confers a significant performance advantage, which I don't
believe is widely discussed.
The primary limitation of the then0
and PromiseCapability
optimizations is that they require the Promise
subclass'
constructor to be side-effect-free, and not to do anything with
the provided executor
function except pass it unmodified to
its superclass constructor. If you wish to use a Promise
subclass constructor which does not adhere to these rules,
then you should use require('babybird/strict-constructors')
(or require('babybird/strict')
if your Promise
subclass
is especially badly-behaved). You can then turn on these
You can then turn on these optimizations on a subclass-by-subclass
basis by setting PromiseSubclass.noSideEffects = true
where
appropriate. (The prfun
library already sets noSideEffects
on the subclass it creates.)
Copyright (c) 2015-2016 C. Scott Ananian
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.