Rust: #[repr(transparent)] for Box<T>?

Created on 2 Aug 2018 · 39Comments · Source: rust-lang/rust

https://github.com/rust-lang/rust/issues/43036 added #[repr(transparent)] with one of the goals being safe FFI interaction for newtypes, and added that attribute to bunch of built-in types such as NonZero, Waker, Unique etc.

One use-case that could be really helpful for FFI though is adding it to Box<T>. I haven't found any previous discussions on pros and cons of doing so, so apologies if this is way off and there are obvious reasons why it can't be such, but as far as I can tell, this is not a breaking change neither from API nor from memory representation perspective.

Looking from definition, Box is just a wrapper around Unique which is already #[repr(transparent)] and wraps NonNull which is also #[repr(transparent)] which, in turn, wraps raw pointer which is FFI-safe.

Adding this attribute to Box would make it transparent wrapper all the way down around a raw pointer, and so would allow to do FFI interactions by simply leaking Box as return type in allocator functions and accepting it as-is in deallocator function without manual into_raw and from_raw conversions.

Aside from ergonomics improvement (allocation / deallocation is pretty common operation in cdylibs), this would also allow code to be more self-documenting as just from looking at rustdoc you would be able to tell which pointers are owned and will be allocated or consumed by Rust side.

Counter-argument might be that one can implement custom transparent CBox on top of NonNull, but this involves reimplementing many APIs and guarantees Box already provides e.g. Unique is not exposed at the moment, and then you also have various standard traits that "just work" with Box, so if it would be possible to avoid reimplementing all of that with no obvious downsides, it might be still useful to shared library authors.

C-feature-request T-libs

Source

RReverser

👍4

Most helpful comment

PhantomData<A> instead of an embedded A, which could not be zero sized.

CryZe on 6 Aug 2018

👍2

All 39 comments

How does this interact with Drop? And how are foreign functions expected to allocate (using Rust's allocator) the boxed memory?

mjbshaw on 5 Aug 2018

They don't. I already do this in my C API even though it's not entirely defined behavior at the moment: https://github.com/LiveSplit/livesplit-core/blob/3ac0834f70a3963f1b7e4fec84b0803c51e6349e/capi/src/fuzzy_list.rs#L12-L24

CryZe on 5 Aug 2018

👍1

Yeah, what @CryZe said. This is not for importing foreign functions, this is for easily and safely exposing Rust functions that allocate+return+deallocate Box via C API.

RReverser on 5 Aug 2018

How would this interact with #50882?

glandium on 5 Aug 2018

Sadly, they wouldn't be compatible when allocator is not zero-sized. I'm surprised Box is going to store allocator by value (it seems so from the change?) even though in theory same allocator state might be reused for many allocations and must be shared between them... Also it seems quite inefficient for most libs/apps out there that use just one allocator. I guess I'm going to need to dig into that PR deeper tomorrow when I'm not asleep.

RReverser on 6 Aug 2018

Also it seems quite inefficient for most libs/apps out there that use just one allocator

The allocator value that would be used in most cases is Global, a zero sized type.

sfackler on 6 Aug 2018

Nothing in Box<T, A: Alloc>(Unique<T>, A) says the allocator is stored by value. It all depends what Alloc is impl'ed for. It could be a ZST, like Global, or a reference (impl Alloc for &MyAlloc).

glandium on 6 Aug 2018

@sfackler The question is more whether this would be guaranteed for any allocators (I suspect not). In that case, repr(transparent) would indeed not work, but I suspect there are other parts of compiler that currently rely on lang item behind Box being just a pointer...

RReverser on 6 Aug 2018

I feel like if it would / should be guaranteed, then PhantomData would /
should be used instead.

Ingvar Stepanyan notifications@github.com schrieb am Mo. 6. Aug. 2018 um
13:23:

@sfackler https://github.com/sfackler The question is more whether this
would be guaranteed for any allocators (I suspect not). In that case,
repr(transparent) would indeed not work, but I suspect there are other
parts of compiler that currently rely on lang item behind Box being just
a pointer...

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/rust-lang/rust/issues/52976#issuecomment-410676327,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABYmbsshbFXIf5D0N2GfHm6UuPDPtwFOks5uOCcigaJpZM4VsMCs
.

CryZe on 6 Aug 2018

@CryZe

hantomData would / should be used instead

Sorry, I didn't get it - instead of what?

RReverser on 6 Aug 2018

PhantomData<A> instead of an embedded A, which could not be zero sized.

CryZe on 6 Aug 2018

👍2

Agreed. IIUC the motivation for storing A is to give flexibility and allow several instances of the same allocator, each with its own state, but this looks like quite a niche usecase to be covered by stdlib.

RReverser on 6 Aug 2018

Also, I suppose, if allocator does want to share some extra data, it can always allocate pointer with larger size and store that metadata (or a pointer to one) alongside actual data in the heap.

RReverser on 6 Aug 2018

Why does Box need to differ from unique_ptr in this respect?

sfackler on 6 Aug 2018

Do you mean C++ unique_ptr? Why would Box be similar to it? For one, Rust already has lots of differences with C++, including in memory management, and in particular availability of zero-sized types which is what @CryZe and me are suggesting to use for custom allocators.

RReverser on 6 Aug 2018

Box is the direct analogue of C++'s unique_ptr. If I understand it correctly, you want to prohibit the use of stateful allocators (e.g. an arena) with Box because it will allow you to treat Box the same as a raw pointer for FFI?

sfackler on 6 Aug 2018

Yes, but not necessarily prohibit - as I said, even for cases where allocator does need to store some unique state (that can't be put into lazy_static! or similar), it can do so on the heap alongside actual data, like e.g. some implementations of malloc do for the size of allocated data.

So this preserves flexibility for allocators while allows to use Box across FFI boundary without transformations, just like references.

RReverser on 6 Aug 2018

Right now, Box<T, A> is not a thing: we only have Box<T>, which is a wrapper over Unique<T> which is indeed transparent. I think Box<T> not being transparent is a serious bug.

I understand the motivation of extending Box<T> to support Box<T, A> in the future. That's compatible with Box<T> being repr(transparent), it just constraints A to be a ZST.

I understand the desire to allow A's that aren't ZSTs, but it is IMO the job of those extending Box with that to figure out a way of doing so without breaking the guarantee of Box<T> being transparent.

I haven't heard any arguments in support of not making this change now. So are there any? Otherwise we should just make Box<T> transparent now.

gnzlbg on 7 Jul 2019

👍1

I'm absolutely still in favour of this change, but the backpressure was hard when I raised the issue. Did it change / can we agree allocators to be limited to PhantomData<...whatever...> or any other ZST now?

RReverser on 9 Jul 2019

FWIW, we don't even have to apply the repr(transparent) attribute to Box<T>, we just need to guarantee that Box<T> is layout compatible with *mut T under certain conditions. That can be simply done in the documentation of Box<T> for the time being.

gnzlbg on 9 Jul 2019

👍1

That can be simply done in the documentation of Box for the time being.

Well, it can't because it's not guaranteed by the compiler at all (even if happens to be compatible on popular platforms).

RReverser on 9 Jul 2019

If the Box documentation says that Box is guaranteed to have the same layout as a *mut T, and the compiler violates it, the compiler has a bug.

gnzlbg on 9 Jul 2019

Yeah, but documentation currently doesn't say that, and merely writing that down won't guarantee that it actually works that way everywhere :)

RReverser on 9 Jul 2019

I'd much rather have a strict #[repr(transparent)] so that this guarantee would derive naturally, and future extensions to Box would be constrained to adhere to it too.

RReverser on 9 Jul 2019

Yeah, but documentation currently doesn't say that,

Hence this proposal.

and merely writing that down won't guarantee that it actually works that way everywhere :)

Of course not, that would need to come with tests, but AFAIK the compiler currently gives structs with default layout and a single field the same layout as that field on all platforms that Rust currently supports. So beyond adding a test to liballoc checking that this is the case for Box, not much more would need to be done.

I'd much rather have a strict #[repr(transparent)] so that this guarantee would derive naturally,

That would prevent extending Box with an allocator type parameter, which is something that we want to do, and an entire working group is trying to solve. Since repr(transparent) would be part of the Box API, removing it later would be a breaking change, so I don't think we can do that.

gnzlbg on 9 Jul 2019

That would prevent extending Box with an allocator type parameter, which is something that we want to do, and an entire working group is trying to solve.

It wouldn't prevent it, just constraint to using ZST as an allocator parameter, which is what I suggested originally and you seemed to support above as well?

RReverser on 9 Jul 2019

It wouldn't prevent it, just constraint to using ZST as an allocator parameter

We want to support non zero-sized allocator parameters, and this is incompatible with that.

you seemed to support above as well?

I support guaranteeing that Box<T> has the same layout as *mut T. As mentioned, supporting non ZSTs allocators would be a backward incompatible change if Box<T> is made repr(transparent), so we can't make it repr(transparent).

gnzlbg on 9 Jul 2019

Okay, I guess I misread

That's compatible with Box being repr(transparent), it just constraints A to be a ZST.

as encouragement of supporting only ZST A's.

RReverser on 9 Jul 2019

@RReverser

@Lokathor pointed out, that this "implementation detail" is "documented" in the repr(transparent) RFC:

As a matter of optimisation, eligible #[repr(Rust)] structs behave as if they were #[repr(transparent)] but as an implementation detail that can't be relied upon by users.

Box<T> is not user code, but part of the implementation, so it could rely on this, and document that, at least for Box<T> this will always hold.

gnzlbg on 10 Jul 2019

Fine by me - not the outcome I initially hoped for, but it does the trick and reaches same goal anyway :)

RReverser on 10 Jul 2019

I understand the desire to allow A's that aren't ZSTs, but it is IMO the job of those extending Box with that to figure out a way of doing so without breaking the guarantee of Box<T> being transparent.

The goal of using standard library containers with different allocators, including non-zero-size handles, is part of an accepted RFC that precedes the new transparency guarantee being proposed here: https://rust-lang.github.io/rfcs/1398-kinds-of-allocators.html#what-about-standard-library-containers

Ideally we would have Box<T, A> is repr(transparent) if and only if A is zero-size, but as far as I know there is no way to express that in the language today.

SimonSapin on 25 Jul 2019

👍1

@SimonSapin

Ideally we would have Box is repr(transparent) if and only if A is zero-size, but as far as I know there is no way to express that in the language today.

This PR solves that: https://github.com/rust-lang/rust/pull/62514

Once we stabilize Box we just need to clarify that that only holds if A is zero-sized, and that's pretty much it AFAICT.

We plan to submit an RFC to guarantee this for all types for which this holds as part of the UCGs RFC. There is a PR in the UCG repo with proposed wording for that as well: https://github.com/rust-lang/unsafe-code-guidelines/pull/164 (is blocked on some nit-picking on how to abbreviate one-aligned ZSTs but there is consensus that we want to guarantee that). So when that happens the documentation comment guaranteeing this wouldn't be technically necessary anymore, since it would be a given, but it would obviously be documentation worth having.

gnzlbg on 25 Jul 2019

I think #62514 is only adding comments, and there seems to be needing language level mechanism if we want to move forward.

crlf0710 on 17 Aug 2019

I think #62514 is only adding comments,

The comment there guarantees that box can be used in C FFI. Isn't that enough to resolve this?

gnzlbg on 17 Aug 2019

Isn't that enough to resolve this?

Is it? From FFI ABI perspective, adding those comments doesn't turn ABI for Box from representing a struct to representing a pointer, which is all #[repr(transparent)] all about, does it?

crlf0710 on 17 Aug 2019

As long as it is guaranteed and there is some test verifying that it is the case, it doesn't need a special attribute.

CryZe on 17 Aug 2019

From FFI ABI perspective, adding those comments doesn't turn ABI for Box from representing a struct to representing a pointer, which is all #[repr(transparent)] all about, does it?

Box is as special as &mut T or i32, the compiler already sets the ABI of Box to that of a single pointer. The only thing the comment does is guarantee to users that this will never change (otherwise that would be a breaking change).

gnzlbg on 17 Aug 2019

ok, i guess that's the magic of lang_item then. thank you for the elaboration.

crlf0710 on 17 Aug 2019

We now have some extensive documentation about use of Box with FFI and generally the properties we guarantee: https://doc.rust-lang.org/nightly/std/boxed/index.html#memory-layout. Closing.