Chapel: array of owned class cannot have its domain resized smaller

Created on 30 Oct 2019 · 12Comments · Source: chapel-lang/chapel

Related #14361, #14273

Bug. Nilability. Resizing to a smaller domain when an associated array is of managed non-nilable class type will attempt to default-assign nil to the disappearing elements.

class MyInt {
    var x: int;
}

var D: domain(string);
var table: [D] owned MyInt;

proc add(key: string, val: int) {
    D += key;
    table[key] = new MyInt(val);
}

add("1", 1);
add("2", 2);
writeln(table);

proc remove(key: string) {
    // table[key].clear();
    D -= key;
}

remove("1");
writeln(table);

# chpl version 1.21.0 pre-release (8b3bca92)
{x = 1} {x = 2}
nilable-string-domain.chpl:19: error: halt reached - assigning nil to non-nilable owned

Libraries / Modules Bug Design

Source

BryantLam

All 12 comments

As in issue #14631, this behavior seems specific to the owned type. Specifically, changing the array element type to shared MyInt seems to work. But as with that issue and #14637, I suspect that there are still likely to be underlying problems.

bradcray on 30 Oct 2019

👍1

This issue is becoming my brainteaser of the week. I've got a simple patch that avoids sticking the nil into the array and generates the "right" result, but it also doesn't transfer ownership (so the owned object persists). I could transfer the ownership out of the array's slot and drop it on the floor, but then what to put into that slot? I feel stuck in a mental dead-end.

Maybe collections of non-nilable like this one should give the appearance of storing non-nilable values, but should actually store nilable and then just assert non-nil at any interface boundaries that return the value back to the user?

Bryant, in filing this do you have a different/better solution in mind?

bradcray on 30 Oct 2019

For this issue where the domain is removing an element, one approach is to have an unsafe nilability-ignoring function such as <management intent>.clear() to drop the object implicitly (instead of explicitly setting it to nil). The documentation for .clear() could indicate that .clear() is a nilability-ignoring/bypassing function. However, this is papering over the issue and introduces this rules-breaking function. Maybe that's okay (it's at least grep-able), but it certainly doesn't juxtapose well with object creation (e.g., when the domain is sized up).

A better solution would be some variation of #14273 where an annotated "unsafe" block can temporarily treat the object as nilable to do whatever, so long as the end result when closing the block keeps the non-nilable invariant. It's on the user to make sure the invariant holds, so the compiler can continue to make optimizing assumptions. Maybe this is too laborious, but it's certainly easier to debug / blame when you know the full scope of when nilable behaviors can happen to non-nilable types.

class MyVector {
  type eltType; // where isClass(eltType)
  var buffer: /* some non-nilable 0-based array treated as raw memory */;
  var size: uint = 0; // current size of this vector.

  /* Resize this container to smaller than its current size, otherwise nop. */
  proc truncate(newSmallerSize: uint) {
    if newSmallerSize < this.size {
      AssumeAsNilable(buffer) {
//    ^^^^^^^^^^^^^^^^^^^^^^^
        if isUnamangedClassType(eltType) {
          /* explicitly delete instead of implicitly dropping it */
        } else {
          buffer[newSmallerSize..(this.size - 1)] = nil;
        }
      }
      this.size = newSmallerSize;
    }
  }

  // The rest of the methods in this container
  // don't need to assume any nil objects because
  // there aren't any in the buffer up to this.size.
}

Somewhat a contrived example, but it shows that this approach allows a user to manually manage an array with all the dangers that come with it (the user manages their own invariants).

BryantLam on 30 Oct 2019

one approach is to have an unsafe nilability-ignoring function such as .clear() to drop the object implicitly

I don't think I understand this description. Is your thought that the owned object would have its ownership transferred and dropped on the floor (de-initing the object), but that the object pointer that is stored in memory would continue to point to the stale object rather than being reassigned to nil?

A better solution would be some variation of #14273 where an annotated "unsafe" block can temporarily treat the object as nilable to do whatever, so long as the end result when closing the block keeps the non-nilable invariant.

To me, this feels equivalent to having the collection store the nilable variant of the user-visible type and then to ensure that only non-nilable values are returned at the public interfaces. Specifically, I'm noting that your proposal still uses nil as a placeholder value and relies on the inherent semantics of the data structure to ensure that nil is never returned (relying on language changes when an escape is needed). If instead the collection were just to store collections of C? internally to implement a collection of C, we wouldn't need to change the language, and the collection's developer would still have to worry about similar concerns when implementing it (e.g., making sure that they returned the right types at boundaries and didn't dereference nils in the meantime).

Your comment, The rest of the methods in this container don't need to assume any nil objects becausesuggests that storingCand providing an escape clause when anilneeds to be stored into an element might permit most of the code to live in aCworld and lean on the existing nilability machinery, but I think that by defining the right core helper functions to deal withC?and returnC`, a collection author could still get those benefits without language changes.

Do you see any major difference between the two?

Note that in the case of the OP (an associative array), the default implementation today is for Chapel to use an open-addressing hash table in which empty (nil) and full slots are intermingled arbitrarily, so there isn't as clean a boundary between nil and non-nil elements as in your vector example. I don't think this changes anything much (since in both cases, the datatype has to semantically understand which elements are valid vs. not to interact with them safely), but I'm noting it because I could imagine some solutions that might work for the vector case, yet not for an open addressing hash table.

bradcray on 31 Oct 2019

one approach is to have an unsafe nilability-ignoring function such as .clear() to drop the object implicitly

My thought is that, today, .clear() will drop the object, set it to nil, and ignore all nilability. This is a bug, ... but it could also just be the unsafe behavior that .clear() does (i.e., make it a feature and not a bug). There would be two coding guidelines: for nilable objects, explicitly assign it to nil to drop it; for non-nilable objects, call .clear() to violate nilability and drop it knowing full well that you're doing something dangerous. ([[1]]: most performant, least type-safe.)

To me, this feels equivalent to having the collection store the nilable variant of the user-visible type and then to ensure that only non-nilable values are returned at the public interfaces.

Agreed.

Do you see any major difference between the two?

Yeah, I admit it was a contrived example. For low-level data structures with only a single container, it is reasonable to use a nilable container to represent the non-nilable interface. For higher-level data structures where some construction using multiple, dependent arrays is required, this gets trickier.

I know this issue is for destruction, but if there's a simpler way to do construction than via an annotated code block, I haven't thought of one. Expecting the user to wrap all their complicated cases into nilable arrays and present a non-nilable interface is likely too much work, especially if after construction, the array is non-nilable for the rest of its (long) life.

I could imagine some solutions that might work for the vector case, yet not for an open addressing hash table.

This trickiness is why many of Rust's low-level data structures are implemented using unsafe blocks. A library developer is allowed to side-step some stringent compiler checks (in cases where the compiler cannot definitely prove what is being done is truly safe) so long as they still present the safe interface to the users.

Chapel certainly shouldn't go to this extreme conclusion if it doesn't have to, but there's a tradeoff between performance and safety [[1]]. The most type-safe thing to do is to force users to define a default value for their type and do the construction/destruction by creating this default value and swapping it into the effected value. If creating this default value is non-trivial work, there could be a huge performance penalty.

BryantLam on 31 Oct 2019

I ran into a similar problem today. This code doesn't work:

class MyInt {}

use List;
var l1 = new list((int, owned MyInt));
var l2 = new list((int, shared MyInt));

$CHPL_HOME/modules/standard/List.chpl:407: In function '_makeArray':
$CHPL_HOME/modules/standard/List.chpl:408: error: Cannot default-initialize a tuple component of the type owned MyInt
note: non-nil class types do not support default initialization
note: Consider using the type owned MyInt? instead
$CHPL_HOME/modules/standard/List.chpl:295: Function '_makeArray' instantiated as: _makeArray(this: list((int(64),owned MyInt),false))

One of the functions in List assumes default initialization is available.

Also, without a better default error backtrace, errors will become incomprehensible to library users because the errors are reported too deep (e.g., inside List) from the point that the user made said error. But that's a different concern.

BryantLam on 8 Nov 2019

@BryantLam: This latest case may have an easy fix. I'd be curious whether the following edit works for you (if you're working from a copy of Chapel that you can edit):

diff --git a/modules/internal/ChapelBase.chpl b/modules/internal/ChapelBase.chpl
index 307d194624..c4bf3153df 100644
--- a/modules/internal/ChapelBase.chpl
+++ b/modules/internal/ChapelBase.chpl
@@ -995,7 +995,7 @@ module ChapelBase {

   inline proc _ddata_allocate(type eltType, size: integral,
                               subloc = c_sublocid_none,
-                              initElts: bool=true) {
+                              param initElts: bool=true) {
     pragma "fn synchronization free"
     pragma "insert line file info"
     extern proc chpl_mem_array_alloc(nmemb: size_t, eltSize: size_t,

bradcray on 8 Nov 2019

👍1

That worked!

BryantLam on 8 Nov 2019

I'll do more extensive testing and put together a PR.

bradcray on 8 Nov 2019

I've opened PR #14412 against this most recent case; it doesn't do anything to try and address the original issue (I'll probably collapse this sub-conversation once it's merged).

bradcray on 9 Nov 2019

When I run the example from the top of the issue today, I get:

./bryant.chpl:6: error: Cannot default initialize associative array because element type owned MyInt is a non-nilable class

It's possible to disable that error but we still have problems with the pattern

    D += key;
    table[key] = new MyInt(val);

because the D += key line will cause the array element for key to be default initialized. I'm sure we've discussed this somewhere but one idea is to write this sequence differently, e.g. as a straw-person addKeyAndSetValue(D, key, A, new MyInt(val));.

Edit: #15703 also proposes that such a D += key would be OK if all the arrays declared over the domain are currently in a noinit state.

mppf on 18 Aug 2020

👍1

I'm sure we've discussed this somewhere but one idea is to write this sequence differently, e.g. as a straw-person addKeyAndSetValue(D, key, A, new MyInt(val));.

A challenge to generalizing this idea is that it'd need to be a varargs routine to take any non-default-initable arrays defined over D.

bradcray on 31 Aug 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Should "new initializers" support field initialization prior to super.init()?

bradcray · 3Comments

Reading previously serialized domains fails

astatide · 3Comments

Overloading assignment (=) on class types

bradcray · 4Comments

Implementation of function like `codepointToString` for bytes

Aniket21mathur · 3Comments

Unable to resolve type when using type alias

BryantLam · 3Comments