When we can statically determine that a defer is called (at most) once, we should allocate the Defer structure in the stack frame instead of from the DeferChunk mechanism. We suspect most defers fit into this category.
I sketched this during gophercon. The 6g-only CL is in https://golang.org/cl/100300043 although it also has an unrelated ... fix for escape analysis mixed in. It made basically no difference for things that hit Dmitriy's latest defer cache. For strangely-sized defers it might help and be worth doing.
The code in stack.c suggests the opposite:
for(dp = &gp->defer, d = *dp; d != nil; dp = &d->link, d = *dp) {
if(adjinfo->old.lo <= (uintptr)d && (uintptr)d < adjinfo->old.hi) {
// The Defer record is on the stack. Its fields will
// get adjusted appropriately.
// This only happens for runtime.main and runtime.gopanic now,
// but a compiler optimization could do more of this.
// If such an optimization were introduced, Defer.argp should
// change to have pointer type so that it will be updated by
// the stack copying. Today both of those on-stack defers
// set argp = NoArgs, so no adjustment is necessary.
*dp = (Defer*)((byte*)d + adjinfo->delta);
continue;
}
_Labels changed: added release-go1.5, removed release-none._
_Status changed to Accepted._
Russ is a bit ahead of himself - this will be the case when https://golang.org/cl/141490043/ is in.
It is probably possible to make work. However, I benchmarked it (patch in the CL above) and it just doesn't matter. The defer cache takes care of nearly all the win. It is not worth it. I will leave this open but it is not targeted to a release.
_Labels changed: added release-none, removed release-go1.5._
The compiler also needs to *fill* in the struct and link it into g->defer, and on successful return unlink from g->defer inline. That would provide the speedup. Today the single defer in net.Conn.Read/Write consumes 2.5% of time in the end-to-end HTTP benchmark: http://build.golang.org/log/33d55778f33682e4e1d45e7c825a73d2ea500de7 Defer is not something that must be visible in a profile of such program. We don't want to wipe defers from std lib due to performance, right? I guess there are Go users who already do this with their libraries.
Okay, well that might work. Defers will need more care in the stack copier if they get stack-allocated, because the Defer chain will weave between allocated memory and the stack, but I think it is possible to make it work. (In contrast it was fundamentally not possible to make stack-allocated Panics work when they were also used during traceback, because a Panic near the bottom of the stack wasn't actually needed until later up the stack, and by then it had been rewritten. The CL 141490043 fixed this by not using Panics during traceback anymore.)
Moving to Go 1.9 since we already doubled the performance of defer via other means for Go 1.8.
Related: #5712.
Punting to unplanned, too late for anything major in 1.12.
Change https://golang.org/cl/171758 mentions this issue: cmd/compile,runtime: allocate defer records on the stack
Change https://golang.org/cl/181258 mentions this issue: cmd/link: fix deferreturn detector
Should this be closed? From what I understand, this has been temporarily reverted due to some bugs, and then reverted back: fff4f599 -> 49200e3f -> 8f296f59de
Yes, this is done and should be closed.
Most helpful comment
Change https://golang.org/cl/171758 mentions this issue:
cmd/compile,runtime: allocate defer records on the stack