Go: Proposal: add a "merge" built-in function to join several slices.

Created on 18 Feb 2018 · 20Comments · Source: golang/go

The problem

Sometime, we need to insert a slice (call it x) into another (call it y) at the index i of y.
If the capacity of y is large enough to accept all elements of x, then things would simple.

func insert1(y, x []T, i int) []T {
    s := y[:len(s)+len(x)]
    copy(s[i+len(x):], s[i:])
    copy(s[i:], x)
    return s
}

However, the capacity of y is not large enough to accept all elements of x,
we must use make to allocate a new slice which is large enough to accept all elements of x and y.

func insert2(y, x []T, i int) []T {
    s := make([]T, 0, len(x)+len(y))
    s = append(s, y[:i]...)
    s = append(s, x...)
    s = append(s, y[i:]...)
    return s
}

The problem here is that the make function will clear all allocated bytes,
which is not essential for this case.

The proposal

So I propose a merge (or join, or concat) built-in function to merge several slices.

func merge(slices ...[][]T) []T

so that we can call

merge(y[:i], elements, y[i:])

which will be more efficient than the insert2 function.

Go2 LanguageChange Proposal

Source

dotaheor

Most helpful comment

The core of this issue is to avoid the make function zeroing a just allocated memory.

You don't need a language change for that. You just need a compiler optimization that infers regions of slices that are unconditionally overwritten, which should be fairly straightforward for the simple case of appending a sequence of slices.

bcmills on 3 Mar 2018

👍2

All 20 comments

Append already reuses the array if the bytes fit in the destination.

as on 18 Feb 2018

Duplicate of https://github.com/golang/go/issues/18605, I believe.

josharian on 18 Feb 2018

👍2

@as
append is only useful to merge two slices.

dotaheor on 18 Feb 2018

@josharian
looks https://github.com/golang/go/issues/18605 is about expanding variadic argument manners.
This proposal is to get a way to merge multiple slices into a new allocated slice without zeroing the new slice when it is allocated.

dotaheor on 18 Feb 2018

@dotaheor if you could do append(s, y[:i]..., x..., y[i:]...) then you would get all of this, with no new built-ins. And that's what #18605 is about it.

josharian on 18 Feb 2018

Ah, yes.

dotaheor on 18 Feb 2018

But, with the manner append(s, y[:i]..., x..., y[i:]...), y[:i] and x and y[i:] will be merged into one slice as the variadic parameter of the append function, then the new merged parameter will be merged again with the first parameter of the append function. There will be two allocations. My proposal will only make one allocation.

dotaheor on 18 Feb 2018

There’s no reason I see that it would have to be implemented that way.

josharian on 18 Feb 2018

Ok, it would be great if that change can satisfy this need.

dotaheor on 18 Feb 2018

Or we just do this in library code once we have generics (#15292).

bradfitz on 18 Feb 2018

👍1

@bradfitz generic is not helpful for this problem.
The core of this issue is to avoid the make function zeroing a just allocated memory.
Surely, it not possible for a make_without_zeroing proposal to get approved.
So I propose the merge function instead, it may be not only useful for the case in my first comment.

dotaheor on 18 Feb 2018

So why not a grow function instead, that acts like realloc in C? Then you could do more than merge (which would be grow+copy).

andlabs on 28 Feb 2018

@andlabs grow still needs to zero new elements, which is unnecessary.

dotaheor on 1 Mar 2018

But I think that sometimes we really need a grow function (which doesn't zero old elements if a new underlying memory block is allocated), or an assureSliceCap function.
We can't implement a grow in the most efficient way by using make, copy and append.

dotaheor on 1 Mar 2018

The core of this issue is to avoid the make function zeroing a just allocated memory.

bcmills on 3 Mar 2018

👍2

It would be great that a compiler optimization can achieve the goal of a merge function.
I think it is possible at least for some scenarios.

dotaheor on 3 Mar 2018

This can be done either via #18605, or via generics, or via a compiler optimization. We're not going to accept this as a builtin function directly.

ianlancetaylor on 24 Apr 2018

ok, I hope there is a tangible solution in planning to resolve this inefficiency problem,
and to fix the Go slice manipulation completeness problem.

dotaheor on 25 Apr 2018

I do have to wonder, and I forget if anyone said this at all in this thread, if the cost of zero-initializing is really significant enough to optimize it away. (This means actual experiments and benchmarks to confirm or deny this.)

andlabs on 25 Apr 2018

I do have to wonder, and I forget if anyone said this at all in this thread, if the cost of zero-initializing is really significant enough to optimize it away.

The current compiler spends about 15% of its execution time zero initializing new allocations.

I doubt that much of that can be optimized away, but the answer is yes: It matters.

josharian on 26 Apr 2018

Was this page helpful?

0 / 5 - 0 ratings

Related issues

proposal: cmd/vet: vet should warn when time.Time type (or types embed it) is used as map keys.

go101 · 3Comments

cannot find package "golang.org/x/sys/unix"

jayhuang75 · 3Comments

encoding/csv: Incorrectly parse records with \r as the record separator

ajstarks · 3Comments

hcigvjrjir

natefinch · 3Comments

proposal: sync: Map.Delete method should return bool, indicating if key was deleted or not

lkarlslund · 3Comments