When using v10.15.1 we are seeing processes exiting with:
Error: shutdown ENOTCONN
File "net.js", line 364, col 25, in TLSSocket.Socket._final
File "_stream_writable.js", line 612, col 10, in callFinal
File "internal/process/next_tick.js", line 63, col 19, in process._tickCallback
This appears in services connecting to Amazon RDS instances via TLS. We are using
https://registry.npmjs.org/mysql/-/mysql-2.15.0.tgz
and https://registry.npmjs.org/knex/-/knex-0.14.4.tgz for connection pooling.
We do not see these errors on earlier node image (I have tested v10.15.0, v10.14.2 and v10.0.0).
Could this be impacted by #24290 and related commits?
The errors could possibly be bettered handled higher up (by knex or the mysql client), but this does seem to be a significant change in behaviour.
I've run a git bisect between v1.15.0 and v1.15.1. c84b4204571e8f87fae300e4ab331e688912fb10 seems to be the cause of the problem.
@addaleax any thoughts on this?
@tcolgate I don’t suppose you have an easy reproduction available? Does this also occur for you with v11.x (v11.4.0 and above)?
I don't have a cut down test at the moment. But I suspect the problem is with idle TLS connections in the knex pool. It's also awkward to test as it shows up after "about 5 minutes" in a lightly loaded app. I'll try and reduce the problem down a bit though.
Unfortunately the app wont install with node 11, npm install fails. I can try again if I get a cut down test working.
Sorry, I'm not really a node developer :/
We're running into the same issue. I don't have a fully runnable repro that I can share, but it reproduces when the connection is closed while the query is still in flight. It looks roughly like this:
const connection = mysql.createConnection(...);
await connection.connect();
connection.query('SOME QUERY', []); // Important: missing await
await util.setTimeoutAsync(100 * Math.random()); // wait a bit; not sure if this is necessary.
connection.end();
There is just [email protected], no pooling.
@tcolgate @syrnick Would either of you be able to try a patch like this?
diff --git a/lib/net.js b/lib/net.js
index b649c6779a94..71feba0cfd10 100644
--- a/lib/net.js
+++ b/lib/net.js
@@ -36,7 +36,8 @@ const {
const assert = require('internal/assert');
const {
UV_EADDRINUSE,
- UV_EINVAL
+ UV_EINVAL,
+ UV_ENOTCONN
} = internalBinding('uv');
const { Buffer } = require('buffer');
@@ -359,7 +360,7 @@ Socket.prototype._final = function(cb) {
req.callback = cb;
var err = this._handle.shutdown(req);
- if (err === 1) // synchronous finish
+ if (err === 1 || err === UV_ENOTCONN) // synchronous finish
return afterShutdown.call(req, 0);
else if (err !== 0)
return this.destroy(errnoException(err, 'shutdown'));
That seems like a cleaner solution to me, but either way, I’m a bit wary of adding or re-adding code that we do not test for and that should not be triggered in the first place…
I'll try some time this week. It seems like it's either
We're also seeing this using [email protected]
in node v10.15.3 and not on node v10.15.0
We cannot update to node v12 at the moment so it would be great if this bug can be looked into.
We are experiencing this issue as well when using knex
and the latest version of node v10.15.3
. Would really appreciate anytime that could be put in on this issue. For now we have reverted to Node v10.15.0
as well which has resolved the issue, but unfortunately aws lambda's node v10
is stuck on v10.15.3
due to it always using the latest node v10
. We have reverted our lambdas back to Node v8.10
for now.
We are also facing this issue using knex and node 10.16. Any update on this?
Adding log trace:
[email protected]
[email protected]
require( 'knex' )({
client: client,
connection: connection,
pool: {
afterCreate: function( conn, done ) {
conn.query("select 1",function(err,response){
done(err,conn);
})
conn.on( 'error', function( err ) {
console.trace(); // Line 28
});
}
}
});
at Connection.<anonymous> (/app/knex.js:28:14)
at Connection.emit (events.js:203:15)
at Connection.EventEmitter.emit (domain.js:448:20)
at Connection._handleProtocolError (/usr/src/app/listener/node_modules/mysql/lib/Connection.js:433:8)
at Protocol.emit (events.js:198:13)
at Protocol.EventEmitter.emit (domain.js:448:20)
at Protocol._delegateError (/usr/src/app/listener/node_modules/mysql/lib/protocol/Protocol.js:392:10)
at Protocol.handleNetworkError (/usr/src/app/listener/node_modules/mysql/lib/protocol/Protocol.js:365:10)
at Connection._handleNetworkError (/usr/src/app/listener/node_modules/mysql/lib/Connection.js:428:18)
at TLSSocket.<anonymous> (/usr/src/app/listener/node_modules/mysql/lib/Connection.js:313:20)
at TLSSocket.emit (events.js:198:13)
at TLSSocket.EventEmitter.emit (domain.js:448:20)
at TLSSocket._tlsError (_tls_wrap.js:612:8)
at TLSSocket.emit (events.js:198:13)
at TLSSocket.EventEmitter.emit (domain.js:448:20)
at emitErrorNT (internal/streams/destroy.js:91:8)
I am also experiencing this issue on ghost 2.22.0 and node 10.16.0 with Amazon RDS:
events.js:174
throw er; // Unhandled 'error' event
Error: shutdown ENOTCONN
at TLSSocket.Socket._final (net.js:363:25)
at callFinal (_stream_writable.js:617:10)
at process._tickCallback (internal/process/next_tick.js:63:19)
Emitted 'error' event at:
at Connection._handleProtocolError (/var/app/current/node_modules/mysql/lib/Connection.js:425:8)
at Protocol.emit (events.js:198:13)
at Protocol._delegateError (/var/app/current/node_modules/mysql/lib/protocol/Protocol.js:390:10)
at Protocol.handleNetworkError (/var/app/current/node_modules/mysql/lib/protocol/Protocol.js:363:10)
at Connection._handleNetworkError (/var/app/current/node_modules/mysql/lib/Connection.js:420:18)
at TLSSocket.(/var/app/current/node_modules/mysql/lib/Connection.js:305:20)
at TLSSocket.emit (events.js:198:13)
at TLSSocket._tlsError (_tls_wrap.js:612:8)
at TLSSocket.emit (events.js:198:13)
at emitErrorNT (internal/streams/destroy.js:91:8)
at emitErrorAndCloseNT (internal/streams/destroy.js:59:3)
at process._tickCallback (internal/process/next_tick.js:63:19)
Also seeing this issue (with AWS Lambda 10.x, RDS and [email protected]):
db error { Error: shutdown ENOTCONN
at TLSSocket.Socket._final (net.js:361:25)
at callFinal (_stream_writable.js:612:10)
at process._tickCallback (internal/process/next_tick.js:63:19)
errno: 'ENOTCONN',
code: 'ENOTCONN',
syscall: 'shutdown',
fatal: true }
This error is killing me. I was forces to return to Node version 10.15.0 just to keep things working.
I would really like to move to 12.9.1, but I am not stuck and can not take advantage of improvements and new features until this is fixed.
We tried with 12.9.1 and the bug is still there.
Just to reiterate, anybody who can either a) provide a reproduction or b) confirm that the patch in https://github.com/nodejs/node/issues/26315#issuecomment-471173736 solves the issue is likely to get this resolved really, really fast.
I ran into this issue as well. I went ahead and applied the patch mentioned in the comment, and built node from master (+ the patch). As expected, it looks like the patch resolved the error. If I switch back to node 12.x, the issue returns immediately. Let me know if you want me to throw a project together to attempt reproduction.
Some notes:
Now that Richard has verified the fix, can we get this in? There have already been two releases since Richard verified. Can we _please_ get it in the next release?
We have also been experiencing this issue in our app. Downgrading to 10.15.0 resolved the issue. We confirmed the bug exists in all newer versions. Any update on getting this fixed in latest release?
I’ve opened https://github.com/nodejs/node/pull/29912 with the patch above.
Fwiw, anybody can open PRs here, you don’t need to wait for somebody else to do it, especially when it has been confirmed to resolve an issue.
@addaleax Thanks for raising that PR. Quite a few people were having issues with AWS Lambda and node10.x
. So what is the process for getting this into the next v10.x
release (ideally the upcoming v10.17.x
)? Happy to raise a PR against another branch if required.
@anthonynovatsis It should get picked up automatically; our rules generally say that something must have lived in a current release for 2 weeks before it is backported to LTS, but I’ve commented on #29875 so that maybe we can skip that part for the patch, and opened a backport PR @ https://github.com/nodejs/node/pull/29968.
Thanks for that @addaleax. Much appreciated..!
How can we know what versions sill have this fix?? Is there an overall bug # to build # lookup table?
@intervalia You can check the releases tab to verify which commits are included in which releases. This particular patch was landed in v10.17.0 and in the latest v13 releases. Thank you @addaleax for pushing this through!
What's the process for getting it into v12.x LTS branch? I checked that (https://github.com/nodejs/node/blob/v12.x/lib/net.js is unpatched)
@syrnick Yeah, thanks for pointing that out … I think this is unfortunate timing because it was merged so close to the Node.js 12 LTS release. I’ve added it to the v12.x-staging
branch, meaning it will go into the next v12.x release; that should be unprolematic given that this has already been accepted for v10.x.
Most helpful comment
I ran into this issue as well. I went ahead and applied the patch mentioned in the comment, and built node from master (+ the patch). As expected, it looks like the patch resolved the error. If I switch back to node 12.x, the issue returns immediately. Let me know if you want me to throw a project together to attempt reproduction.
Some notes:
10.15.0
, but is in10.15.1+
.