Javalin: Async implementation feedback

Created on 14 Apr 2018 · 10Comments · Source: tipsy/javalin

We just released async support in Javalin 1.6.0!

However, I have a feeling that the current implementation isn't perfect. Please use this thread to suggest improvements, or just to complain (then we'll think up improvements).

The current (v1.6.0) implementation:

// usage: 
app.get("/") { ctx -> ctx.result(string) } // blocking
app.get("/") { ctx -> ctx.result(completableFuture) } // async

// what happens behind the scenes:
tryBeforeAndEndpointHandlers()
val future = ctx.resultFuture()
if (future == null) {
    tryErrorHandlers()
    tryAfterHandlers()
    writeResult(ctx, res)
} else {
    req.startAsync().let { asyncContext ->
        future.exceptionally { throwable ->
            if (throwable is Exception) {
                exceptionMapper.handle(throwable, ctx)
            }
            null
        }.thenAccept {
            when (it) {
                is InputStream -> ctx.result(it)
                is String -> ctx.result(it)
            }
            tryErrorHandlers()
            tryAfterHandlers()
            writeResult(ctx, asyncContext.response as HttpServletResponse)
            asyncContext.complete()
        }
    }
}

Users who don't care about async are not affected by the functionality in any way.

INPUT WANTED

Source

tipsy

👍1

Most helpful comment

I put together a sample async benchmark application based on my PR. The application can be found here: https://github.com/joatmon/spark-async-benchmark

I've included some sample scripts that demonstrate that, under heavy load, using async request processing for long requests improves the throughput of short synchronous requests by 750x.

When the short queries are competing for jetty threads with the long queries, the application is able to process 250 short requests in ten seconds. With the long queries handled by the async thread pool, the applications processes 188,454 short requests in ten seconds.

joatmon on 15 Apr 2018

👍3

All 10 comments

@keuller, @mj1618, @ruurd, @joatmon you have all been active in the async-issues on the Spark tracker (#549, #208). What do you think about this solution? It should be possible to use a similar approach for Spark.

tipsy on 14 Apr 2018

Hey man looks pretty good to me. A bit of refactoring required for Spark but yep same approach is possible. My PR on spark was attempting to refactor it but ended up being a lot of changes. Overall it can only be a positive if it doesn’t impact existing users. Allows people to use things like non blocking fb login without tying up a thread.

I’m assuming this does allow non-blocking and not just async? Or does it still tie up a thread?

mj1618 on 14 Apr 2018

Maybe some benchmarks to how some numbers on this.

that it can handle more non blocking requests than it has threads.
that there is less error rate at high throughput
determine the length a request has to be at certain throughputs to get lower response time with async

mj1618 on 14 Apr 2018

I did a couple benchmarks on my PR in wrk and locust - happy to pull it up for you. Was testing on a Mac but should probably be run on some kind of decent hardware. I still saw some good improvements for requests over 50ms. Let me know if you want me to help out with this.

mj1618 on 14 Apr 2018

I’m assuming this does allow non-blocking and not just async? Or does it still tie up a thread?

Yes, unless we've missed something fundamental in our implementation and tests. The futures are resolved like this:

future.exceptionally { throwable ->
    if (throwable is Exception) {
        exceptionMapper.handle(throwable, ctx)
    }
    null
}.thenAccept {
    when (it) {
        is InputStream -> ctx.result(it)
        is String -> ctx.result(it)
    }
}

I did a couple benchmarks on my PR in wrk and locust - happy to pull it up for you. ... Let me know if you want me to help out with this.

That would be great, thanks!

tipsy on 14 Apr 2018

@rmaes4 @Adeynack, you guys asked about async in Javalin some time ago. Please check out the implementation if you're still here.

tipsy on 15 Apr 2018

I put together a sample async benchmark application based on my PR. The application can be found here: https://github.com/joatmon/spark-async-benchmark

I've included some sample scripts that demonstrate that, under heavy load, using async request processing for long requests improves the throughput of short synchronous requests by 750x.

joatmon on 15 Apr 2018

👍3

In the benchmark above I implemented an even cleaner way of presenting async request handing to the sparkjava application developer. With this approach developers write async request handlers exactly the same way that they do normal request handlers. They just use a different API to register the handler. Here is example code from the benchmark. Note that I am able to use the exact same method to process requests synchronously or asynchronously.

public class DataController {
    DataService service;

    public DataController(DataService service) {
        this.service = service;
        Spark.get("/data", this::getData);
        SparkAsync.getAsync("/asyncdata", this::getData);
    }

    private String getData(Request req, Response resp) throws Exception {
        long delay = 0;
        String delayParam = req.queryParams("delay");
        if (delayParam != null) {
            delay = Long.parseLong(delayParam);
        }
        return service.getData(delay);
    }
}

joatmon on 15 Apr 2018

I'm not familiar with your source base, but I looked through your implementation and tests. It looks like it should work fine to me.

joatmon on 16 Apr 2018

I got some great input from /u/0x256 on /r/java, I'm posting it here with his permission. In short, the current implementation is good for long running tasks with short responses (the problem we set out to solve), but there are some things we need to fix if we want Javalin to be truly async. We probably don't want that, but the info he provided is great nonetheless:

That [our implementation] is not async IO, that is just delaying the response. Do not use JAX-RS as an example on how async should work in web frameworks, because they got it wrong (or incomplete, IMHO):

In JAX-RS, there is no way to read the request body in a non-blocking way. There is only InputStream and that blocks on slow clients.

You can delay the response (and not block a request thread while waiting for the response to be created) using @Suspended AsyncResponse, but once your response is ready, you have to write it all at once, and that will block if your response is bigger than the write buffer. No support for streaming large responses without blocking on slow clients or buffering to disk.

You might think that StreamingOutput is perfect for, you know, streaming output. Nah, it sucks: It reads from a (blocking) InputStream and writes to an unbounded and/or blocking write buffer (no back-pressure control). For large downloads, you either run into OOMs, end up buffering to temporary files, or block a thread most of the time (depending on the implementation).

Raw Servlets (3.1+) do support real async IO. You can read from the request and write to the response, both in a non-blocking way and with full (albeit really hart to get right) back-pressure control. The only problem is that it is really easy to 'drop the ball' with async servlets and end up in a state where you are mistakenly waiting for the client because you missed a listener call or forgot to call ServletOutputStream.isReady() until it returns false. Or to issue read or write calls more often then you are allowed to and get mixed results. Or re-use a buffer you are not supposed to re-use before some event happened. Writing a simple async echo client (just copy input to output) is really harder than it should be. But it works. Not so with JAX-RS (or Javalin).

A simple test for a web framework that claims to support async IO is to ask: Can I write an 'echo' application that accepts requests of arbitrary size, copies the bytes to the response body of the same request, and serves (significantly) more concurrent clients than there are threads? If you have blocking read/write calls or unbounded buffers (memory or disk) anywhere in your pipeline, then the answer is no. Slow clients will eventually exhaust your thread pool, heap or disk space.

I wrote an abstraction layer on top of javax.servlet.AsyncContext that basically offers this non-blocking API (simplified):

ctx.read(ByteBuffer, ReadyCallback<ByteBuffer>) Reads bytes from the request body into the supplied buffer and triggers the callback on success. The buffer will contain at least one byte as long as the request body is not fully consumed, and zero bytes after that.

ctx.write(ByteBuffer, ReadyCallback<ByteBuffer>) Writes bytes to the response body and triggers the callback once the bytes are fully written and the ByteBuffer can be re-used.

There are three timeouts: The read timeout is triggered if a read request is not completed in time. The write timeout works the same for write requests. The third timeout (application timeout) is triggered if the application does not issue a read or write request (and does not 'ping' the context) for a long time (prevents drop-the-ball bugs).

It is also an error to issue a read or write request if there is still a pending request of the same type. This prevents some out-of-order races and concurrency bugs and ensures that intermediate buffers do not grow out of bounds (pressure control).

It should be easy to use this API with CompleteableFuture if you prefer that.