Cli: v2 feature: Args (an alternative to Flags)

Created on 24 Feb 2020  ยท  17Comments  ยท  Source: urfave/cli

Checklist

  • [x] Are you running the latest v2 release? The list of releases is here.
  • [x] Did you check the manual for your release? The v2 manual is here
  • [x] Did you perform a search about this feature? Here's the Github guide about searching.

What problem does this solve?

Undocumented arguments can be a burden and forces developers to generate write their own ArgsUsage

Solution description

Add a new field to App and Command called Args of type []ArgSpec which is an interface like Flag. There would many structs that implement this interface, like those for parsing flags.

type ArgSpec interface {
    // Name returns the name that this arg will be accessed by
    Name() string
    // Required returns true if this arg is required 
    Required() bool
    // Priority means the order (highest to lowest) which are parsed after the required ones
    Priority() uint
    // Min returns the minimum args to be parsed
    Min() uint
    // Max returns the maximum args to be parsed, if 0 then any number can be parsed 
    Max() uint
}

Along with this there are several other rules that will result in a panic (always no matter what the provided args to be parsed are)

  • All Optional fields must either come before all required fields or after all required fields
  • At most one Max() == 0 arg per field

Args never mean anything for sub-comands.

Args will be accessed in the same manner as Flags currently, they are parsed after flags so will override them (that is how one can have a EnvVar be the default to an arg)

Implementation

If there are any []ArgSpec then all provided args must be used, otherwise a error is printed as if a required flag was not supplied.

This also means that the Arg(n) and Args() won't return anything after parsing args.

Describe alternatives you've considered

  • Do nothing, this a hard problem to solve and this solution is complicated to use.
arev2 help wanted statuconfirmed

All 17 comments

I think this idea will need a few iterations - but it's off to a good start! I think we should really consider something like this ๐Ÿ‘

Hey, I think this is a good idea, a few thoughts from my side:

  • I think we should call them positional arguments
  • Positional arguments can only apply to commands
  • They should support specific value sets, like we do for flags
  • I think we do not need an index if we inherit that information from a slice index (if app.PositionalArgs = []cli.PositionalArg{}
  1. I like the name positional arguments
  2. Would that mean that an application like ls would not be able to be written using this feature?
  3. And specific types?
  4. I agree we would not need indexes anymore, I was thinking of having them accessible by name using the *Command.<Type>() methods.

Having done some more preliminary investigation by doing a mock implementation I have found the following to be necessary:

  • Remove "priority" and have it strictly left to right
  • Only the last "required" pos-arg can be of slice + unlim length
  • There is no parsing of slice args based on commas, just on the args specified whitespace (namely 1.0 2.0 3.0 would be the three elements of a float64slice
  • All previous positional args get filled before going on to the next (important for slices)

I'm marking this as help wanted for someone to work on the design, since it'll be a few steps before this is ready for implementation

Nice proposal!
Just some drive-by-bikeshedding from a random user:

Wouldn't naming the interface Args instead of ArgSpec fit better with the existing Flag interface? Maybe even Arguments or ArgumentGroup, since this wouldn't be typed often enough to justify sacrificing readability for "writeability".

Also, wouldn't it be possible to implement Required: true via Min: 1? This would leave us with a leaner interface:

type ArgumentGroup interface {
  Name() string
  Min() uint
  Max() uint
}

And as a bonus we wouldn't have to deal with the corner-case of Required: true + Min: 0.

Implementing something like mv or cp would be a matter of:

app := &cli.App{
    Name:    "cp",
    Arguments: []cli.ArgumentGroup{
        cli.StringArguments{
            Name: "paths",
            Min: 2,
        },
    },
}

(assuming cli.StringArgument could default to Max:0)

But I wonder a bit what the main use-case for multiple argument groups would be.
The relatively common cmd [foo [bar]] idiom could be implemented using multiple groups with something like:

app := &cli.App{
    Name:    "cmd",
    Arguments: []cli.ArgumentGroup{
        cli.OptionalStringArgument{
            Name: "foo",
        },
        cli.OptionalStringArgument{
            Name: "bar",
        },
    },
}

Where cli.OptionalArgument is a cli.ArgumentGroup with Min() โ†’ 0 and Max() โ†’ 1 (assuming such a helper would be acceptable).

But OTOH, the same could be accomplished with a single argument group and simple slice logic. I'm not sure the increase in complexity is worth it, just to get a more comfortable access pattern for the individual arguments. :thinking:

We could also allow for some sort of regular expression based system where the Argument has a "is valid" method

Whatever we do for this feature request (eg. https://github.com/urfave/cli/issues/1074), it should also most likely solve this request too => https://github.com/urfave/cli/issues/1069

Current idea formulation:

Arguments have at least the following:

  • func PlacementName() rune: this value is the value that must be used in the argument placement regex. It is a singular rune because that makes the regex both faster and not ambiguous. If any of these rune's collide then a panic will happen at run-time.
  • func IsValid(arg string) bool: this function is used to build up the placement string for the regex matching. This must be required to be stateless and deterministic. It should also be cheap to run, since it will be run multiple times.
  • func DocName() string: this function is used in the generation of documentation.

Using this idea:

Arguments will be provided in some manner. The ordering doesn't matter at all as all arguments will be "placed" (like flags but I think it should be required).

With the arguments, a "placement string" must also be supplied. The syntax is a subset of full regular expressions.

  • All the counting formulations are allowed: one, ?, *, +, {n, m}, etc....
  • Only "single character" character classes or | not [...] nor [^...].
  • No escape sequences or ^ or $ (those are assumed always)
  • No regex flags
  • No whitespace
  • The only characters allowed are those that are returned by the PlacementName() methods on the arguments for a specific command.

The following is then done in order:

  1. The set of "possible placement" strings are constructed based on the IsValid() method and the "placement string". Creating a string that looks like a concatenation of possible values of that argument, for example if arg[1] could either be A or B the resulting substring would be [AB]. If arg[2] could either be A or C or H then the resulting substring would be [ACH].
  2. The placement string is transformed into matches that match those substrings for each type. So for the placement of A+ it would be transformed into ((?:\[[^\]]*(?:A)[^\]*]*\])+). This way each substring can be match for a specific type. This is also why the placement name must be single characters, that way this regex is unambiguous. This is called the "matching string".

    • Those characters are implementation details, the documentation generation should use the argument's display name.

    • Example 2: (A|B)+: ((?:\[[^\]]*(?:A|B)[^\]*]*\])+). If there is some ambiguity (for example an argument gets assigned the pps of [AB] then the precedence is left->right.

  3. Since each argument's "type" is represented by a string of the form A(|B)* (where A and B are any character (and the repeats are different) once a match has been found the ordering from left to right is used to choose which type the match really refers too.

Example:

  1. placement string: AB{1, 3}A
  2. args:

    • A: Int32

    • B: string

  3. input: 31 h 2 "hello world" 789
  4. Possible placement strings: [AB][B][AB][B][AB]
  5. Matching string: ^((?:\[[^\]]*(?:A)[^\]*]*\]))((?:\[[^\]]*(?:B)[^\]*]*\]){1,3})((?:\[[^\]]*(?:A)[^\]*]*\]))$
  6. Which gives the following substring matches:

    1. [AB][A][BC][B][AB]

    2. [AB]

    3. [B][AB][B]

    4. [AB]

  7. Then each substring match (the part enclosed by [...]) are checked to decide which type to use. In this example, we find it to be: A, B, B, B, A.
  8. The argument parsing and placing is done

This issue or PR has been automatically marked as stale because it has not had recent activity. Please add a comment bumping this if you're still interested in it's resolution! Thanks for your help, please let us know if you need anything else.

I'm still super interested in seeing this happen ^^

This issue or PR has been bumped and is no longer marked as stale! Feel free to bump it again in the future, if it's still relevant.

I would really like to see this too

This issue or PR has been automatically marked as stale because it has not had recent activity. Please add a comment bumping this if you're still interested in it's resolution! Thanks for your help, please let us know if you need anything else.

Closing this as it has become stale.

๐Ÿ‘€

This issue or PR has been bumped and is no longer marked as stale! Feel free to bump it again in the future, if it's still relevant.

As far as I remember, we would love to see this but it needs more design work ๐Ÿ™๐Ÿผ

Was this page helpful?
0 / 5 - 0 ratings

Related issues

lynncyrin picture lynncyrin  ยท  22Comments

skillful-alex picture skillful-alex  ยท  15Comments

Nokel81 picture Nokel81  ยท  14Comments

VirrageS picture VirrageS  ยท  17Comments

marwan-at-work picture marwan-at-work  ยท  24Comments