Note: I can only reproduce this _reliably_ on macOS 10.15.4.
I see it _occasionally_ on Ubuntu 18.04, and can never reproduce it on Windows.
It may be connected to using Start-Sleep
inside the script block; not sure if the fact that a _custom class_ is used across thread boundaries is relevant [_update_: [it is](https://github.com/PowerShell/PowerShell/issues/12801#issuecomment-634344380)]
class Foo { static [object] Echo($val) { return $val } }
$cls = [Foo]
while ($true) {
$res = 1..10 | ForEach-Object -Parallel {
Start-Sleep -ms 100; ($using:cls)::echo($_)
} | Sort-Object
Compare-Object $res (1..10) | Should -BeNullOrEmpty
Write-Host -NoNewline .
}
The loop should run indefinitely, because the test should alway succeed.
The loop eventually - after a varying number of iterations - breaks due to intermittent test failures, stemming from a missing output; e.g.:
Expected $null or empty, but got @(@{InputObject=5; SideIndicator==>})
That is, not all input objects were echoed.
PowerShell Core 7.1.0-preview.3 on macOS 10.15.4
not sure if the fact that a _custom class_ is used across thread boundaries is relevant.
Yeah. It tries to marshal back to the original pipeline execution thread, but since a pipeline is already running it just runs in the current thread while pretending it's the pipeline thread. As a result a whole bunch of state corruption and other weird side effects are possible.
You can see it by doing this:
class Foo {
static [object] Echo($val) {
return [PSCustomObject]@{
ThreadId = [Threading.Thread]::CurrentThread.ManagedThreadId
RunspaceId = [runspace]::DefaultRunspace.Id
}
}
}
$cls = [Foo]
while ($true) {
1..10 | ForEach-Object -Parallel {
Start-Sleep -ms 100
($using:cls)::echo($_)
}
}
You'll get a whole bunch of different thread ID's with the same runspace. Also occasionally some other evidence of state corruption like "Global scope cannot be removed" errors (note this is on Windows too, I'm guessing your Windows machine just has hardware that makes the race conditions less frequently hit).
That's a long winded way of saying it's basically a dupe of #4003. Though a significantly more likely to happen in the wild repro.
/cc @PaulHigin @daxian-dbw
A possible "solution" is to try to block PowerShell classes from being carried over. Sort of like how script blocks currently work.
If #4003 was fixed, this would just dead lock. Personally I'd prefer if runspace affinity was just removed from classes but I understand it's next to impossible to know the impact of a change like that. Something should probably be done though, many folks are going to turn to classes as a way to "get around" the script block problem. It might even appear to work for awhile.
Thanks for the detailed analysis, @SeeminglyScience.
If removing runspace affinity is too risky, perhaps another option is to simply _recreate_ classes in the target runspace, similar to what is already being discussed for script blocks - #12378 - and in line with making the planned opt-in for transferring runspace state comprehensive - #12240.
I mean doing that would already be effectively removing runspace affinity anyway -- you'd just be binding their affinity to the new runspace. It would be largely indistinguishable from always having the code execute in whichever thread it's called and just looking for the default runspace for that thread. More or less about the same there, I'd guess.
Just straight up recreating it is a little sketchy though because you'd lose state from static properties. Property accessors don't (currently) invoke any PowerShell so they aren't subject to the same problems necessarily. Not that you should use them for state sharing in all your threads, but more that it would be confusing if it "appeared" to be gone randomly. Also recreating the classes would create a lot of confusion around type identity.
While disallowing cross-thread class use is definitely the safest option, it also takes away potentially useful functionality.
Given the "less solid" nature of PowerShell classes my guess is that loss of type identity would be less of an issue - it's a case where documenting the issue may suffice (just as users will have to understand that script blocks that are recreated in a different thread no longer form closures around the original runspace's variables, if any); the .FullName
properties will be identical, but -is
and -as
won't work (perhaps a little white lie could fix that)?
Good point about static properties, however - that sounds more problematic.
But couldn't we special-case this to copy any static property values after recreating (just like script blocks will be special-cased, as discussed in https://github.com/PowerShell/PowerShell/issues/12378#issuecomment-616660763)?
I'd prefer if runspace affinity was just removed from classes but I understand it's next to impossible to know the impact of a change like that.
Has anyone ever looked into this?
Isn't the current behavior _never_ useful, or are there legitimate scenarios where existing code relies on it?
NOTE: The examples below should never be purposefully relied on, they are incredibly dangerous and can corrupt state in unexpected ways and/or cause general instability.
Isn't the current behavior _never_ useful, or are there legitimate scenarios where existing code relies on it?
It's probably accidently relied on. Apparently a class method converted to a delegate can run on any thread, even one without a runspace. I was having trouble coming up with a scenario where this would appear useful because I assumed you needed a thread with a default runspace other than the one it was originally bound to. Knowing this, it makes a lot of .NET interop appear to work as expected.
class Test {
static [psobject] GetInfo() {
return [PSCustomObject]@{
ThreadId = [Threading.Thread]::CurrentThread.ManagedThreadId
RunspaceId = [runspace]::DefaultRunspace.Id
}
}
}
[Threading.Tasks.Task[psobject]]::
Run([Func[psobject]][Test]::GetInfo).
GetAwaiter().
GetResult()
[Test]::GetInfo()
Gives some variant of:
ThreadId RunspaceId
-------- ----------
19 1
15 1
Here's something that really surprised me, if you don't block until completion, it'll actually properly marshal it to the right thread:
class Test {
static [psobject] GetInfo() {
return [PSCustomObject]@{
ThreadId = [Threading.Thread]::CurrentThread.ManagedThreadId
RunspaceId = [runspace]::DefaultRunspace.Id
}
}
}
$task = [Threading.Tasks.Task[psobject]]::Run([Func[psobject]][Test]::GetInfo)
while (-not $task.AsyncWaitHandle.WaitOne(200)) { }
$task.GetAwaiter().GetResult()
[Test]::GetInfo()
ThreadId RunspaceId
-------- ----------
15 1
15 1
So the answer is yes, it probably accidently makes a lot of complicated .NET interop work where it would otherwise throw. Any API where you pass a delegate and that delegate gets called on another thread.
Personally I think these behaviors would be the most ideal for method invocations.
If called on:
SessionState
affinity like normalSessionState
affinity. Maybe with TopLevelSessionState
affinityScriptBlockDelegateInvokedFromWrongThread
like ScriptBlock.Invoke
currently doesThough 2 and 3 would be breaking changes, I think they're worth it considering how dangerous they can be.
cc @rjmholt @PaulHigin @daxian-dbw if any of y'all want to weigh in.
With #3, is it more desirable to throw there, or take the time to spin up a runspace to execute in anyway, so as to "make" it work without corrupting the state of another runspace?
With #3, is it more desirable to throw there, or take the time to spin up a runspace to execute in anyway, so as to "make" it work without corrupting the state of another runspace?
I've thought about that a lot but there's a lot of questions
InitialSessionState.CreateDefault()
?As much as the error has been frustrating for me in the past, it's also been pretty helpful in informing me that I basically just shouldn't use that API. It's a hard call. I'd like the option, but I'm also not super sure how you'd go about surfacing that option. Or if it'd be worth the development time.
Another consequence of this is that if you call a second method from the first one, you get a dead lock.
class Test {
static [void] FirstMethod() {
[Console]::WriteLine('First method starting')
[Test]::SecondMethod()
[Console]::WriteLine('First method finishing')
}
static [void] SecondMethod() {
[Console]::WriteLine('Second method starting')
[Console]::WriteLine('Second method finishing')
}
}
[System.Threading.Tasks.Task]::Run([action][Test]::FirstMethod).GetAwaiter().GetResult()
The first method isn't actually running on the pipeline thread, so the second method call also tries to use PowerShell eventing to retake the pipeline thread. The first method will never complete, because it's waiting on the second method, which is waiting on the first method.
I accidently ran into this trying to use the PSDataCollection<>.DataAdding
event with PowerShell class methods as event handlers.
Most helpful comment
/cc @PaulHigin @daxian-dbw
A possible "solution" is to try to block PowerShell classes from being carried over. Sort of like how script blocks currently work.
If #4003 was fixed, this would just dead lock. Personally I'd prefer if runspace affinity was just removed from classes but I understand it's next to impossible to know the impact of a change like that. Something should probably be done though, many folks are going to turn to classes as a way to "get around" the script block problem. It might even appear to work for awhile.