Rfcs: C++-style initializer list

Created on 1 May 2016 · 31Comments · Source: rust-lang/rfcs

In C++ you can use an initializer to initialize a map, which looks like a map literal and feels nice.

#include <map>
int main() {
    auto m = std::map<int, int> { { 1, 2 }, { 3, 4 } };
}

playground

So I suggest we can do something like this:

let m = <HashMap<i32, i32>> { // wrapping type name with <> to indicate type like in UFCS
  ( 1, 2 ),
  ( 3, 4 ),
};

which do similar thing as above C++ code.

We already have initializer list built-in so we don't need to tackle about the initializer side.

Let me explain how above Rust code work:
It is desugared to:

let m: HashMap<i32, i32> = vec![(1, 2), (3, 4)].into_iter().collect(); // I hope we can drain array here

playground
The trick is that impl<K, V, S> FromIterator<(K, V)> for HashMap<K, V, S> is done already.

To sum up, points are:

start with a type here so we know it's about a type, not for the propose of mentioning some associated item and not to start struct literals and such
wrap a list in braces and separate with comma
each item in the list is plain expression, so the syntax is very flexible

Now you might argue that if the desugared form is already okay why should we accept this fancy sugar.
I say this sugaring form is much more human-parsable and can make life easier.

T-lang

Source

bombless

👎12 👍2

Most helpful comment

@bombless I don't like C++. I am working as C++ programmer, but C++ is ugly. We shouldn't accept any feature from C++ just because it is from C++. Let's wait for another people opinions.

KalitaAlexey on 1 May 2016

❤4 👎4 👍3

All 31 comments

I'd prefer

hash_map!([i32, i32], 1 => 2, 3 => 4)

hash_map!([i32, i32],
          1 => 2,
          3 => 4)

KalitaAlexey on 1 May 2016

👎6 👍1

@KalitaAlexey what I suggest is that we introduce less sugaring, this way the syntax will be more powerful.

For example, my proposal is not about pair, it just happened to involve (U, V), because the list item is (U, V).

This (U, V) can be something else, like plain T.
This way, after we can drain array, maybe after we introduce type-level integer, we can remove vec! macro entirely.

let v: Vec<i32> = <Vec<_>> { 1, 2 };

will be desugared to

let v: Vec<i32> = [1, 2].drain_to_iter().collect();

bombless on 1 May 2016

👍1

@bombless When we introduce type-level integer generics we could make impl

impl<N, T> From<[T; N]> for Vec<T>

And use it in the following form:

let v: Vec<i32> = [1, 2].into();

I am against this syntax because it adds complexity to parser, developers without any significant profit.

KalitaAlexey on 1 May 2016

👍1

@KalitaAlexey
I'm not sure but

hash_map!([i32, i32],
          1 => 2,
          3 => 4)

looks very ugly.

I will try to implement my syntax.

bombless on 1 May 2016

@bombless Uglier than:

<HashMap<i32, i32>> { // wrapping type name with <> to indicate type like in UFCS
  ( 1, 2 ),
  ( 3, 4 ),
};

KalitaAlexey on 1 May 2016

@KalitaAlexey
Well I didn't know you don't like my proposed syntax.

As I mentioned above, initializer list is very powerful, otherwise C++ community won't accept it.

bombless on 1 May 2016

👍2

@bombless I don't like C++. I am working as C++ programmer, but C++ is ugly. We shouldn't accept any feature from C++ just because it is from C++. Let's wait for another people opinions.

KalitaAlexey on 1 May 2016

❤4 👎4 👍3

Plus you don't add one macro for each type, that looks insane.

bombless on 1 May 2016

@bombless Why not? Why does it look insane?

KalitaAlexey on 1 May 2016

@KalitaAlexey Macros exist only because type system is not powerful enough.
Let's build a type-riched language and don't look back.

bombless on 1 May 2016

Macros exist because it allows many things without extending syntax for each case. Rust will end like C++ with many-many features, but with ugly-ugly syntax.

KalitaAlexey on 1 May 2016

And about type system.
With type-level integer generics we could write

impl<K, V, N> From<[(K, V); N]> for HashMap<K, V>

And use:

let m: HashMap<i32, i32> = [(1, 2), (3, 4)].into();

Without extra syntax.

KalitaAlexey on 1 May 2016

👍1

Like I said,

Now you might argue that if the desugared form is already okay why should we accept this fancy sugar.
I say this sugaring form is much more human-parsable and can make life easier.

You see,

let m: HashMap<i32, i32> = vec![(1, 2), (3, 4)].into_iter().collect();

already looks okay, so I'd suggest macro is off-topic here.

bombless on 1 May 2016

I think @KalitaAlexey's integer-generics idea looks pretty nice, and it may be worth just waiting until then.

My main issue with that approach is that it seems to involve pushing the entire array onto the stack and then moving each element into the map individually, which in extreme cases could lead to stack overflow if the optimizer isn't clever enough to figure that out.

How do C++ initializer lists utilize stack space?

rphmeier on 1 May 2016

@rphmeier Same. There is no way to utilize any more efficient.
According to http://www.cplusplus.com/reference/initializer_list/initializer_list

KalitaAlexey on 1 May 2016

I see, type-level integer deserve a RFC.

bombless on 1 May 2016

@bombless It already has. #1038

KalitaAlexey on 1 May 2016

Absolutely agree about preferring type-level integers. I am strongly against additional Rust syntax without a very good motivation, and this motivation doesn't seem strong enough.

alilleybrinker on 1 May 2016

I agree; this syntax is ugly. It should at _least_ use type ascription instead of putting the type name in anglebrackets, and even then, I think we're better off with #542.

BlacklightShining on 2 May 2016

:-1: to extra syntax. There doesn't seem to be enough to pull it's weight. A macro would make sense, however.

ticki on 2 May 2016

You see,

let m: HashMap = vec![(1, 2), (3, 4)].into_iter().collect();

already looks okay, so I'd suggest macro is off-topic here.

The point isn't the syntax, it is insertion at compile time, to improve runtime performance.

ticki on 2 May 2016

Plus you don't add one macro for each type, that looks insane.

The type system want to talk to you.

ticki on 2 May 2016

The point isn't the syntax, it is insertion at compile time, to improve runtime performance.

One can't insert into a HashMap at compile time: for the default hasher, the keys required are generated randomly at runtime, and a runtime allocation is required. Anyway, calling .insert should be able to do a pile of optimisation already (e.g. it can be inlined and/or evaluated at compile time when the arguments are known statically).

There's two reasons I would think special syntax might be faster than calling insert repeatedly or .collect:

preallocating an exact size (possible with HashMap::with_capacity() and .collect does a best-effort preallocation, that works fine for iterators like Vec's)
constructing values in-place inside the hashmap, rather than copying onto the stack and then into the map (this would require defining the syntax to work like that, and will be manually possible soon: https://github.com/rust-lang/rust/issues/27779).

huonw on 2 May 2016

One can't insert into a HashMap at compile time: for the default hasher, the keys required are generated randomly at runtime, and a runtime allocation is required.

Of course, but if the value is known at compile time it is very much possible to hash it at compile time. Even though, my points were mainly the two you listed afterwards.

ticki on 2 May 2016

This is really a job for a macro (like vec!).

nrc on 18 Aug 2016

👍6

Here is a valid macro:

macro_rules! map {
   ($($key:expr => $val:expr),*) => (vec![$(($key, $val)),*].into_iter().collect())
}

playground

Nokel81 on 27 Jun 2017

Just to be clear, initializer_list is one of the most hated features of C++11.

strega-nil on 28 Jun 2017

👎4

(We should add variadics, basically)

strega-nil on 28 Jun 2017

👍4

@ubsan I'll have to strongly disagree with "most hated" since the benefits of initializer_list are among the biggest wins of C++11 for me in day-to-day programming, but I'd agree that it's a massive backwards-compatibility hack that should not be blindly copied, especially since Rust can probably get all the same benefits without the ugly corner cases by doing proper variadics instead.

Ixrec on 28 Jun 2017

👍1

@Ixrec maybe only among my friends, then - it's so bad for two reasons. First of all, uniform initialization means something different than parenthesized initialization, and no standard library functions use uniform initialization. Second, it provides a const view of the data, which is awful for non-trivially-copyable types - basically, you're forced to copy every single value. Don't use uniform initialization for std::vector<std::string>! It should never have been put into C++11 - variadics should have been used.

strega-nil on 28 Jun 2017

Closing in favor of https://github.com/rust-lang/rfcs/issues/542.