Was discussing with @jpsnover a concern that if a cmdlet opens a resource (like a database connection) in Begin, and later in the pipeline an exception is thrown, End is not called to dispose of the connection.
function a {
[cmdletbinding()]
param()
begin { "a:begin" }
process { 1 }
end { "a:end" }
}
function b {
[cmdletbinding()]
param([parameter(ValueFromPipeline=$true)]$a)
begin { "b:begin" }
process { $a; throw 'oops' }
}
a | b
b:begin
a:begin
a:end
oops
At /Users/steve/test/end_not_called.ps1:15 char:19
+ process { $a; throw "oops" }
+ ~~~~~~~~~~~~ + CategoryInfo : OperationStopped: (oops:String) [], RuntimeException
+ FullyQualifiedErrorId : oops
b:begin
a:begin
oops
At /Users/steve/test/end_not_called.ps1:15 char:19
+ process { $a; throw "oops" }
+ ~~~~~~~~~~~~ + CategoryInfo : OperationStopped: (oops:String) [], RuntimeException
+ FullyQualifiedErrorId : oops
Name Value
---- -----
PSVersion 6.1.0-preview.1
PSEdition Core
GitCommitId v6.1.0-preview.1
OS Darwin 17.5.0 Darwin Kernel Version 17.5.0: Mon Mar 5 22:24:32 PST 2018; root:xnu-4570.51.1~1/RELEASE_X86_64
Platform Unix
PSCompatibleVersions {1.0, 2.0, 3.0, 4.0...}
PSRemotingProtocolVersion 2.3
SerializationVersion 1.1.0.1
WSManStackVersion 3.0
@SteveL-MSFT I had some further discussions with @jpsnover. Basically EndProcessing isn't called if there is an exception because it's not a destructor. EndProcessing is just part of the normal processing. Consider a counter function function count {begin {$c=0} process {$c++} end {$c}}
. An exception should terminate function execution not resume execution at end
. If it ran end
then whatever was in $c
would be incorrectly emitted to the pipeline. As far as clean-up goes, for compiled cmdlets, we call Dispose
on the cmdlet which allows for clean-up. For scripts, we don't expose Dispose
in a useful way so perhaps we should add a "cleanup { }" block for functions scripts. (IIRC our thinking wrt scripts was that scripts call cmdlets and cmdlets get disposed therefore most things in a script will be disposed. Things that don't get disposed that way will eventually be disposed by the garbage collector/finalizer q.e.d there was no need for a cleanup block.)
Seems reasonable
I don't like the idea of waiting on garbage collection to close a database connection. There is a need for a Cleanup {}
or Finally {}
block for the exact scenario outlined.
I wrote an "advanced function" that opens a DataReader
connection in the Begin {}
block so that it could pass the record to other cmdlets to do stuff. I could never figure out how to properly close it if a terminating error happened anywhere in the pipeline. Adding a new block would solve that problem.
Interesting how cmdlet dispose local variables? We could add a script property (or something like that) with code to mandatory execute a dispose code.
Reusing this issue to consider having a Finally{}
type block for cleanup
Please consider providing syntax like C# using block.
@iSazonov
Interesting how cmdlet dispose local variables? We could add a script property (or something like that) with code to mandatory execute a dispose code.
Compiled cmdlets can just implement IDisposable
and PowerShell engine properly call it to do cleanup.
I meant
$con = OpenDatabase
$con.SetDisposeScriptBlock( { $using:con.Close() } )
@aetos382 This is a slightly different scenario where a resource _internal to the command_ is opened in Begin
, used in Process
and then closed in End
. A using
-style statement isn't applicable since there are three separate blocks and you don't want to dispose the resource when a block completes, only when _all_ blocks have completed. Now you could wrap each of the blocks in a try/catch
statement
try
{
... do stuff with resource....
}
catch {
... close the resource
}
Note that you want to use catch
instead of finally
because you only want to close the resource if there's an exception. In the normal flow, the resource would be released in the end
block.
WRT the finally {}
cmdlet block @SteveL-MSFT, proposes, I expect that it would be run when the command is disposed by the pipeline processor. (Consequently, one might argue that it should be called Dispose
instead of Finally
. Another name might be trap { }
in which case we would have to think about break
and continue
semantics.
@iSazonov How/when would the dispose scriptblock get called? The pipeline processor has a list of all of its commands so it can dispose them but it doesn't have a list of the variables. I suppose it could grab all of the variables in the current scope but that could cause problems with objects being returned. Maybe we should have a Register-ForDispose
cmdlet to tell the pipeline processor what variables to dispose? And is a scriptblock really necessary? In what scenarios do you think we'll need it? Maybe it could be optional?
when would the dispose scriptblock get called?
We could consider local variables for the feature (or cmslet local).
Maybe we should have a Register-ForDispose cmdlet to tell the pipeline processor what variables to dispose?
If we'll use syntax like $connection.SetDisposeScriptBlock() - the method could do the registration.
And is a scriptblock really necessary?
How close the DB connection without explicit code?
@BrucePay In your mind, would trap {}
only run in an exception scenario?
IMO, Dispose {}
would be best and Ideally it would always run when the pipeline disposes the command, Code in this block might not be exception code, just cleanup code ensured to run when the pipeline completes regardless of presence or lack of exception.
Trap {}
would infer to me that it only runs when there is an exception. I think that should be handled with the "ubiquitous error handling" instead, however that ends up being implemented.
Some other considerations for consistency:
Support for Dispose {}
in scripts, ScriptBlock, and advanced functions. I'm not sure there is any difference in implementation, or not just want to ensure consistency and easily modifying code to be any of the, Users should easily be able to take the same code and use it as a ScriptBlock, Function, or Script.
Also, possibly exposing the same Dispose {}
inside .psm1 to run on Remove-Module
(shortcut to $MyInvocation.MyCommand.ScriptBlock.Module.OnRemove
) .
Is this something that should also be considered for Configuration {}
? Perhaps there is some cleanup that needs to be done there as well like the block querying a database for config data or something.
I would propose defer
block for naming as of other languages already use for similar use case.
Or.... processDefer, endDefer would be....?
Another option - annotate the variable assignment somehow - e.g. a type annotation or a use
statement like F#:
function foo {
# Attribute
[Dispose()]$v = Get-Connection
# or statement
use $v = Get-Connection
}
Semantics: Dispose is called when the scope of the variable is disposed.
This is important when the script/function is dot sourced. The script/function may complete, but the variable is now in scope.
If going with a block instead of an annotation - it's important to consider dot sourcing when determining when the block is called, possibly disallowing dot sourcing if such a block exists.
@lzybkr Variables don't (currently) implement IDisposable. Unless you changed something, as far as I they just get GC'd (of course we could change this). Anyway, your proposal is essentially what I was proposing as Register-ForDispose
which would register an object (not a variable) for disposal once the command/scope exits. Do you see there being an implementation advantage to making it a language element or attribute over a command?
@iSazonov
How close the DB connection without explicit code?
Assuming the object implementsIDisposible
, simply adding it to a list of objects to dispose on scope (or command) exit is all that would be needed.
@markekraus In commands today, it's normal to dispose resources in the end
block. If we just wanted to handle the exception case, then having a trap
block makes sense. But perhaps changing the pattern to _always_ deal with disposable resources in the Dispose
block would be better.
@guitarrapc The pattern in .NET is IDisposable
so naming the block Dispose
aligns with existing practise. Also, to my mind, Dispose
not quite the same semantic as Golang's defer
.
FWIW I have a proof-of-concept that seems to ensure disposal using out-of-the-box PowerShell control flow. It seems to be robust to downstream throw
, break
, continue
, $PSCmdlet.ThrowTerminatingError()
, and Ctrl-C. Usage is described in the usingObject {}
section of this stackoverflow answer.
There's also a variant I've been calling Afterward {}
that can be used to perform arbitrary cleanup from a scriptblock for things that don't implement IDisposable
.
I don't mean to dissuade language support for this sort of thing. Rather, I'd like to suggest that whatever language support is added should probably be an improvement over what is already possible with a library function and design pattern.
@alx9r But how to call it with pipeline command? Your code imply only one uninterrupted block of user code, but that is not true when command accept pipeline input.
@alx9r This scenario is for when a resource _internal_ to the command is opened in Begin
, used in Process
and then closed in End
. The resource is not externally visible so you can't use a using-object
approach to wrap the command from the outside. From the inside, you'd have to wrap the code inside each block introducing another scriptblock dispatch for every begin/process/end call which is significant overhead. Having a Dispose
block is just simpler and more efficient. That said, having a Using-Object
cmdlet is clearly a good idea that we just never got around to. Perhaps you can open a separate issue to track adding this? Thanks.
@BrucePay @PetSerAl
I think I understand the distinction now.
I think you are describing this sort of function
class d : System.IDisposable {
Dispose() { Write-Host 'Dispose' }
}
function f {
param ([Parameter(ValueFromPipeline)]$x)
begin {
$d = [d]::new() # d should be disposed in every case
# when this function goes out of scope
}
process {
# use $d with $x here...
}
end {
# and/or here
}
}
used like this
1,2 | f | UsesBreakKeyword
in a scenario like the command line where you can't easily refactor to
usingObject { [d]::new() } {
1,2 | f $_ | UsesBreakKeyword
}
Is that correct?
How close the DB connection without explicit code?
Assuming the object implements IDisposible, simply adding it to a list of objects to dispose on scope (or command) exit is all that would be needed.
Question is just how an user can implement this IDisposible
in script? If this already implemented internally in the object it is enough for us to have $localdisposable:conn
. Otherwise the user should attach a script block with a dispose code to the variable.
BEGIN {
$localdisposable:conn = New-Object System.Data.SqlClient.SqlConnection
$con.Dispose = { $this.Close() }
}
END {
# Implicitly $conn.Dispose()
}
I'd like to point out that not all things in PowerShell that required cleanup in these situations are objects/types that have a Dispose
method. There area many modules out there where the session state is handled inside the module that exports the command the user is calling.
function Get-Foo {
[CmdletBinding()]
param ()
begin {
Connect-Foo
}
process {
<# doo stuff with foo #>
}
end {
Disconnect-Foo
}
}
In that case, there is no object to dispose but the connection still lingers if an exception kills the pipeline.
@BrucePay That's what I was thinking. The End {}
block would become a place for completing Process {}
operations (for example, when the pipeline is collected to perform operations such as measure) and the Dispose {}
block would become the de facto place to close connections and perform cleanup as its ensured to run.
What is not clear to me is how to handle output and errors from the Dispose {}
. I'm inclined to think output in the Dispose {}
block should be ignored (similar to how PowerShell Class methods are not leaky). I definitely don't think the output from this block should sent to the output stream, but I'm sure this will cause confusion. I'm also certain users will want to be able to collect output from this block somehow. Perhaps a -DisposeVariable
common parameter could be added to allow for collection of output from the Dispose {}
block, similar to -OutVariable
. And, I'm not sure if Errors/exceptions in the Dispose block should be populated in -ErrorVariable
.
@BrucePay - I don't think PSVariable
should implement IDisposable
- but as an implementation detail, maybe SessionStateScope
would.
A language construct offers a stronger promise than a command, e.g. it can't be overridden, it could enable better code generation.
@markekraus - The disposable pattern isn't used in PowerShell today because it doesn't really work. I think the correct question to ask is - is another pattern needed if PowerShell had clean support for disposable objects?
The right answer might be - you need two things - better disposable support and a way to register arbitrary cleanup code - this would be analogous to C# using
and finally
statements.
@lzybkr That's a fair point.
I think that given the history, the pattern of not returning Disposable objects will continue and it would certainly persist in older code. I can update my code to make use of disposable objects, but I may not be able to update someone else's code I depend on. I'd be left with same situation if only better support for disposable objects was added.
One benefit of adding a new pattern would be the ability to wrap 3rd party code. I suppose that could still be done by warping 3rd party functions in PowerShell classes that implement IDisposable. However, my experience is that the majority of PowerShell users are not comfortable with or interested in using PowerShell classes. (I make it a habit to ask if people are using classes, why/why not, and what for). I suspect the comfort level with a Dispose {}
would be higher, but that is just my guess.
I agree, though, it probably does need both.
I'm wondering what should happen if .Stop()
is called on the runspace during the obtain-and-assign step for the resource. Using @lzybkr's hypothetical [Dispose()]
, for example:
function foo {
[Dispose()]$v = Get-Connect # what if .Stop() is invoked during this call?
}
Is Get-Connect
's execution allowed to continue on .Stop()
until the assignment is complete? I _think_ the repro in #7155 demonstrates that, currently, an unfortunately-timed .Stop()
can allow resources to be allocated without the scope that would do the cleanup ever getting a handle to it. Could that be overcome by [Dispose()]
too?
If Stop
is called while Get-Connect
is running, then Get-Connect
is responsible for cleaning up anything not written to the pipeline. Once the object is written to the pipeline, the downstream command or script takes on the responsibility.
There is a window where pipeline objects live only in the pipeline - in which case it's the engine's responsibility to dispose and should not require a new feature to properly dispose - it should just happen, but it sounds like that doesn't actually work. I don't recall the history of that code to know if it was a design decision or an oversight, my guess is the later.
I believe that addressing this would be extremely valuable. IMHO it is one of the roughest edges on the language and I don't believe a real solution is possible without language support.
Even trying to explain all of the considerations involved to someone without a fairly deep knowledge of PowerShell is quite difficult. Experienced C# developers often struggle just as much as beginners with little programming experience because you have to think about all of the ways a function might stop executing, terminating vs. non-terminating errors, etc.
I'd vote for a new named scriptblock that was guaranteed to execute regardless of how the command stopped. I could imagine some kind of defer-like thing where you pushed scriptblocks onto a stack as a solution, but that seems more race-y, verbose and error prone.
I'd also love some using-like sugar to cut down on the boilerplate involved in handling cleanup in general (even worse with strict mode on), but that is something that might be doable without explicit language support.
One scenario that I don't think has been explicitly called out is a desire to do conditional cleanup. In many cases, I want to have a "convenience" ParameterSet that takes something like a computername or a path and an "optimized" ParameterSet that takes some existing connection/handle/thing that requires cleanup. In the convenience case, it's the function's responsibility to create the necessary disposable and then deterministically clean it up when it's done. In the optimized case, the function does not take ownership of the disposable, it just uses it and is expected to leave it open. The convenience set is for use in a shell or one-shot operations, the optimized path is used to amortize the setup cost for the disposable in more complex scenarios where it might be used for multiple things.
Some of the suggestions (like an attribute) seem less compatible with this scenario. Here's a contrived example, don't read too much into this being a CimSession:
function Get-Example {
[CmdletBinding(DefaultParameterSetName = 'ByComputerName')]
param(
[Parameter(Position = 0, ParameterSetName = 'ByComputerName')]
[string]
$ComputerName = $env:ComputerName,
[Parameter(Mandatory = $true, ParameterSetName = 'ByCimSession')]
[CimSession]
$CimSession
)
try {
# Normalize things so the rest of the body can assume/work with $CimSession
if ($PSCmdlet.ParameterSetName -eq 'ByComputerName') {
$CimSession = New-CimSession -ComputerName $ComputerName -ErrorAction Stop
}
Get-CimInstance -ClassName Win32_ComputerSystem -CimSession $CimSession
}
catch {
# Using a catch as a glorified goto to handle a terminating error initializing the CimSession
Write-Error -ErrorRecord $_
}
finally {
# If we "own" the CimSession, clean it up.
if ($CimSession -and $PSCmdlet.ParameterSetName -ne 'ByCimSession') {
Remove-CimSession -CimSession $CimSession
}
}
}
Correct me if I'm wrong but C# Cmdlets don't have this issue because StopProcessing()
is called when a terminating error is encountered or the user presses CTRL+C? If that is the case advanced functions should have a stop
block added to the existing begin
, process
, end
code blocks for feature parity with C# Cmdlets.
[Cmdlet(VerbsCommon.Get, "Foo")]
public class GetFoo : Cmdlet
{
protected override void BeginProcessing()
{
// open connection
}
protected override void ProcessRecord()
{
// do stuff
}
protected override void EndProcessing()
{
// close connection
}
protected override void StopProcessing()
{
// close connection
}
}
function Get-Foo {
[CmdletBinding()]
param ()
begin {
# open connection
}
process {
# do stuff
}
end {
# close connection
}
stop {
# close connection
}
}
I think it would be preferable to have something else. That would lead to a lot of duplicated code, since StopProcessing isn't called during normal operation.
A proper finally{}
type block would be a much better implementation in this instance, I think.
@tnieto88 AFAIK, StopProcessing
is usually called in different thread, so that it can send interrupt signal to the work in the main pipeline thread. Cleaning up resources here can easily cause concurrent access problem, if main thread is not yet done with them. Also, PowerShell's Runspace
is single-threaded, so defining behavior of PowerShell's variant of StopProcessing
is not trivial task.
@vexx32 I don't think there would be any major amount of duplicated code if the author placed the close connection logic into a helper function called by both the end
and stop
block. It would also allow the author to handle normal post processing and a stopped cmdlet differently.
Correct me if I'm wrong but your recommendation is to have the finally
block in a try/catch
get executed even though the cmdlet is told stop. If that assumption is correct I believe that would require the connection to be built and torn down on each pipeline object creating unnecessary overhead. It would remove the ability to have the connection created in the begin
block. Interact with the resource one or many times in the process
block. Then close the connection in the end
block.
function Get-Foo {
[CmdletBinding()]
param ()
begin { }
process {
try {
# open connection
# interact with resource
}
catch {
# do something
}
finally {
# gets called even if cmdlet is stopped
# close connection
}
}
end { }
}
So you want a helper function... For a function. Potentially every function that needs to deal with resources that may require disposal work other cleanup steps. Call me crazy, but that's still far too much duplicated code.
And no, I'm not talking about try/finally. The reason we're discussing these kinds of solutions is precisely because that's insufficient. Whatever you wanted to name it, it would need to effectively be a finally-type black that effectively wraps the whole function. If at any point in the function a terminating error is thrown or StopProcessing is called, it would be triggered.
I haven't put a lot of thought into it, but my gut reaction would be to oppose mirroring C#'s StopProcessing
facility in PowerShell script, because TL;DR: StopProcessing
is annoying.
The trouble with StopProcessing
was already touched on by PetSerAl: you can't call any commands that interact with the pipeline in StopProcessing
. If you did, for example by calling Write-Verbose, a) it would throw, b) if it were fixed to just be ignored, it would be unexpected (you can already envision the Issues that would be filed: "Write-Verbose
inexplicably does nothing when called from StopProcessing"). So your shared function would have to have a switch to enable StopProcessing
-safe operation... which seems terrible, given that the shared function might want to call other shared functions, etc., until you've poisoned whole libraries.
In production code that I've written where I really, really needed to be able to properly handle cancellation and cleanup, I went to the length of using C# and P/Invoke to intercept Ctrl+c signals and completely handled cancellation myself (the limitations of StopProcessing
were too onerous for me).
This was called out very early in the thread, but regarding this comment from earlier on this thread, is Dispose
the right name to use here?
For some people, the Dispose
name might imply that the script block would be run with garbage collection. In C#, Dispose
methods are automatically invoked as part of garbage collection, or manually if you don't want to wait. In PowerShell, the Dispose
block would be automatically invoked on command completion, never manually, and never as part of garbage collection. Would that inconsistency cause some confusion? Maybe not, but coming from C#, that name originally made me wonder how to invoke it manually if I didn't want to wait for garbage collection, which made me wonder if it would cause confusion for users going from PowerShell to C# as well.
Using Dispose
also means that the new parameter name added to ForEach-Object to provide the same functionality would be named DisposeScript
instead of just Dispose
, since you cannot create a property with the same name as a method in the underlying C# class (although it would be possible to extend the Parameter attribute to allow for a name that is different than the underlying property name if it was wanted badly enough). Regardless, that's not a huge issue, but I wanted to mention it here.
I don't have a great alternative name, but I wanted to ask the question to avoid potential confusion with C#'s Dispose method anyway.
A few of the names we tossed around when discussing this on Slack are: Cleanup
, Finalize
(could be confused with finally
), Close
(but there is no Open
).
Personally I think dispose
is the best option, maybe cleanup
as a close second (really that's just "describing what it _does_" vs "describing what it's _for_"). I think I prefer dispose
as it refers to what the pipeline processor _does_ with it, same as begin
, process
, and end
.
However, I think that whatever name is picked, it's only a matter of time before "it was always that way" and the actual name will end up being essentially unimportant, as long as it's distinctive.
I think dispose
implies the right intent. It's important to note that because this block will always execute, it should be reserved for only critical clean up. If that isn't communicated properly then commands that essentially ignore Ctrl + C could become more common.
Dispose name might imply that the script block would be run with garbage collection
When the Dispose
pattern was added to C#/.NET back in the early days, it was because folks wanted a way to clean up unmanaged resources themselves. They did not want to rely on non-deterministic garbage collection.
I think the confusion about this arises because both the IDisposable.Dispose()
method and a type's finalizer typically call a shared bit of code with the name/sig Dispose(bool disposing)
. It is a type's finalizer that is invoked directly by the garbage collector.
Thanks guys. Question about the naming withdrawn.
I have a working implementation of this that I can PR as soon as the PRs for the formatting changes in the associated files go in. 🙂
I've done some pretty thorough manual testing, currently the only thing remaining broken is tab completion for command parameters inside a dispose{} block. Not sure why, still looking at that.
Beyond that, just gotta write up some pester tests for it and it'll be about ready to go. :sparkles:
EDIT: @SeeminglyScience helped me figure out what the heck was up with tab completion. Pester tests are all that remain.
I just watched the PowerShell Community Call from April, 16th 2020.
There seems to be a concern about users who wants to stop a command line immediately, without having to wait for a cleanup to be done.
Why not just create a new shortcut association for the stop&cleanup?
For example:
@fullenw1 if there is such a desire, it isn't related to this intrinsically; if it's something that's important to you, I'd suggest opening a new issue.
Even without this being implemented, there are a great many scenarios in PowerShell where the user is unable to cancel an operation. @SeeminglyScience and I mentioned a few of those in the RFC discussion.
If we would like something like that to be implemented, it would need to be implemented at a much wider scope than this suggestion applies.
To play devil's advocate for a moment, I would generally advise against allowing users to cancel every operation. Some operations are safe to cancel; cancelling others may cause memory leaks, unstable behaviour, or lead to the shell crashing completely.
_That_ is why a dispose{} block is needed in the first place; without it, script module authors have to accept that their code may at any time become unstable, or find obscure and complicated workarounds to ensure that users can't cancel critical cleanup operations.
Some operations are safe to cancel, others... may result in instability. 🙂
Hi @vexx32,
Personally, I am one of those people who can patiently wait a few minutes, and even more, for the system to give me the hand back :)
I was just trying to help you because you raised the question during the community call about this long-standing feature request...
However, @joeyaiello replied from 33'10" "to 34'15" that there could be some concern regarding users who want CTRL + C to stop the process immediately as it has always done.
While we can be technically accurate (like you @vexx32), we must also remember (like @joeyaiello) the wast majority of users.
Because they never read release notes and don't follow any rich PowerShell blog (like yours), they will suddenly think that this new version of PowerShell is constantly freezing because it doesn't give the hand back immediately as former versions did.
Thus, they will probably always kill the host instead of waiting for the Dispose block to finish its work while cursing this new version.
This is why I made this suggestion, which could be a possible compromise and accelerate the progress of this feature request...
But let's not waste more time on this. Your solution is perfect for an eager to learn and a patient guy like me :)
The solution I'm proposing won't suddenly add delays to everything, so my apologies if I gave that impression. Any such delays would already be happening. As script authors pick up the feature, some small amount of delays may start to crop up depending on how they choose to implement their disposal routines, if they choose to add any.
I think for the most part there won't be a noticeable added delay in the vast majority of cases, even when authors do start to make use of it. There will, of course, always be the odd exception from time to time, I'm sure. 🙂
Because they never read release notes and don't follow any rich PowerShell blog (like yours), they will suddenly think that this new version of PowerShell is constantly freezing because it doesn't give the hand back immediately as former versions did.
They are significantly more likely to run into an existing C# cmdlet that forgot to implement StopProcessing
or a PowerShell function that's blocking on a dotnet API call.
Honestly the majority of cases where a dispose
block will take a lot of time, it'll be because they are using a dotnet API and that API call is blocking. If that's the case, there's no solution. The only place that a extra "really cancel" function would do anything positive is if the writer was explicitly running an infinite loop in dispose
for some reason. In all other cases it's just a race condition that will occasionally leave resources open to save a few milliseconds.
this term "Dispose" should be replaced with a more "powershelly" one.
The main idea of PowerShell is following English to help beginners. How can they understand "dispose"? I tried to translate to Russian and get some useless cases including "allocation"! This term will mislead beginners. I found that translations for "CleanUp" and "Finally" are much better. I guess for English men too.
finally
is an existing keyword that I would very much hesitate to override. It also doesn't really communicate the actual purpose of the feature.
Cleanup
is closer and could perhaps be used, but personally I feel it's more appropriate to introduce the existing concept of Dispose
rather than to confuse those already familiar with that with a different term.
Either way, this concept is going to be a little unfamiliar to folks who've lived primarily in PowerShell land. I would prefer to bring those familiar and unfamiliar closer together rather than further apart, and any gaps that need to be filled will be filled with documentation. Either way, we'd need to get some documentation written; I'd think it best that it teach the existing concept properly rather than introduce a wholly new idea that more or less mirrors an existing concept anyway.
finally is an existing keyword that I would very much hesitate to override.
It is another context. No conflicts at all. (If you think so, then you should be more worried about $begin = 1
😄 )
I remind you of the __fundamental PowerShell concept__ and the references to "documentation" and "teach" are not correct in the context.
This concept is something that has largely been ignored in PowerShell up till now.
There isn't anything comparable to pull from. This is going to be a new concept for PowerShell folks. We're better off following established patterns in .NET than trying to create an entirely new concept that can be confused with the existing ones already established in the .NET ecosystem.
As a very timely example, I'll just quote Immo here:
https://twitter.com/terrajobst/status/1252504713762766848
Naming conventions for APIs shouldn’t be innovated in. Otherwise you create a mess. You get one shot in V1. If you fucked it up, too bad.
I don't feel strongly, but I might suggest picking a keyword like cleanup
specifically because it is different. PowerShell is already full of keywords that look like those in other languages but behave a bit differently. Having something look familiar to a C# developer and then making it behave differently doesn't do anyone any favors (see try/catch).
As for delays, there's a million things you can do already that will hang your shell and cause it not to respond to a ctrl+c. This seems fairly unlikely to make that much worse. Dispose()
methods that block for a meaningful amount of time are not that common.
@mattpwhite I don't agree myself, but I've been know to be stubborn. Could you add that feedback to the rfc discussion? It might be worth discussing more thoroughly :slightly_smiling_face: https://github.com/PowerShell/PowerShell-RFC/pull/207#discussion_r393935727
French dictionnary has 7 definitions for the verb, one is to prepare someone to death.
@vexx32 For PowerShell scripts UX our common approach is always to priority PowerShell concepts over C#.
one is to prepare someone to death
Oh... It could fall under MSFT compliance requirements.
@iSazonov no, it's a common word used for a lot of meaning. i spoke only about one usage in french, i dont know for english. It depends of the context. Like Prepare an object to be destroyed as a final goal. In restaurant, they 'dispose' the table because the final goal is to serve the customer.
@iSazonov
@vexx32 For PowerShell scripts UX our common approach is always to priority PowerShell concepts over C#.
As I said:
This concept is something that has largely been ignored in PowerShell up till now.
This concept is not something that PowerShell (currently) has any tangible reference to. The closest you can find is directly calling Dispose()
methods on disposable resources. Unless there's a truly pressing reason (outside of personal preferences and nice-to-haves) to change the name of the existing concept, I see no reason for PowerShell to be a special snowflake here.
The idea was expressed by @BrucePay, then in another form @lzybkr, then @alx9r.
I would not want to lose this idea and put it in https://github.com/PowerShell/PowerShell-RFC/pull/250
@iSazonov appreciate the writeup!
I had a bit of a read just a little bit ago, but it seems to me the concept is flawed from the beginning. Automatic resource management is effectively trying to predict what users want from the code, which IMO is never a great way to try to create a reliable experience for users.
I'll add more detailed comments to the RFC itself.
@vexx32 The RFC comes from the fact that I like when PowerShell makes _smart_ things :-)
it seems to me the concept is flawed from the beginning
As joke - it seems to you :-) Do you really deny the garbage collector in C#?!
Not at all. But I think us attempting to re-create a garbage collector when there's already one available in .NET is a bit of wasted effort, frankly. Can we improve how we work with it? To a point, sure! But trying to do too much there is liable to create an incredibly confusing UX.
We need to look script examples where it could be a problem.
There is nothing that says both of these cannot be done.
I'd really (speaking as someone who prefers language to change slowly!!) prefer to not wait another year or two to have a dispose-type capability in PowerShell. It's an enormously powerful concept especially for network, database, and file-system (i.e., most) scripts that are long-running and robust.
So I'd hate for @vexx32 's implementation to get derailed by a similar-but-not-identical resource-tracking proposal from @iSazonov .
I don't necessarily see a problem with scope based disposal, but it's gotta be explicit. For instance, a nested/non-public function that creates a database connection for use in a larger function/module is gonna break.
For instance, a nested/non-public function that creates a database connection for use in a larger function/module is gonna break.
It is developer duty to make this correctly. If it is a module developer should set "DisposeKind.AtModuleUnload" for the connection object. Now consumer gets PowerShell magic - automatically closing the connection.
I would not want us to dive into the implementation details - it is clear that this is not easy.
Yeah, I'm not sure I see a lot of value in making this a "magic" sort of implementation. If there is desire for that kind of functionality on module unload, authors can already register tasks for module unload. I wonder if we'd be perhaps better off making that functionality more easily accessible in general rather than for a specific targeted purpose you're suggesting..
It is developer duty to make this correctly. If it is a module developer should set "DisposeKind.AtModuleUnload" for the connection object. Now consumer gets PowerShell magic - automatically closing the connection.
If I do this:
$rs = & {
$temp = [runspacefactory]::CreateRunspace()
$temp.Open()
$temp
}
How am I as a script writer supposed to know that $rs
is now closed. Even as someone who would be well aware of the change, it would be incredibly difficult to troubleshoot.
Plus, you can assign objects you don't own to variables:
$currentRunspace = [runspace]::DefaultRunspace
I would not want us to dive into the implementation details - it is clear that this is not easy.
Yeah psuedo code is fine of course. To be clear though, if automatic disposal is opt-out then this would be an incredibly large breaking change.
I believe PowerShell can be smart enough to understand that an object is published in pipeline and shouldn't be destroyed. Can? :-) It is mentioned in RFC too. In fact, we can come up with any scenario and find a way to get around it or vice versa to fix it.
I believe PowerShell can be smart enough to understand that an object is published in pipeline and shouldn't be destroyed.
It's not going to be that easy, you'd need full escape analysis
Plus you'd have to intimately familiar with all of those restrictions, if not then troubleshooting why your object breaks in script A but works great in script B will be very difficult.
I don't see why it has to be automatic. Adding another use to the using
keyword for assignments ala using $b = [runspacefactory]::CreateRunspace()
seems like the move to me. If you don't explicitly request disposal, waiting until it hits the finalizer is probably fine. If not, it's the developers duty to quickly mark the assignment with using
.
Most helpful comment
It's not going to be that easy, you'd need full escape analysis
Plus you'd have to intimately familiar with all of those restrictions, if not then troubleshooting why your object breaks in script A but works great in script B will be very difficult.
I don't see why it has to be automatic. Adding another use to the
using
keyword for assignments alausing $b = [runspacefactory]::CreateRunspace()
seems like the move to me. If you don't explicitly request disposal, waiting until it hits the finalizer is probably fine. If not, it's the developers duty to quickly mark the assignment withusing
.