Fd: Add option for natural ordering (a la `-v` in `ls`)

Created on 13 Sep 2018 · 15Comments · Source: sharkdp/fd

lt would be really, really handy for scripting to have a flag to arrange output using natural ordering.

Given the following tree:

```.
├── file-1
├── file-11
├── file-12
├── file-2
├── file-22
└── file-3

0 directories, 6 files


The output of `ls -1`/`fd` is currently:

file-1
file-11
file-12
file-2
file-22
file-3


The proposed output for natural order (`ls -1v`, proposed to be something like `fd -v`) would be:

file-1
file-2
file-3
file-11
file-12
file-22
```

If you need a dependency for this, rust-natord is small and seems like it could fit the bill.

question

Source

ErichDonGubler

👍1

All 15 comments

as a workaround you can use fd | sort -V

tmccombs on 13 Sep 2018

👍1

@ErichDonGubler Thank you for your feedback.

I think this is better left to external tools, as @tmccombs suggests. Another way is to use xargs to do the sorting via ls -1v:

▶ fd -0 | xargs -0 ls -1v
file-1
file-2
file-3
file-11
file-12
file-22

sharkdp on 13 Sep 2018

Also, see #196 and #159.

sharkdp on 13 Sep 2018

@sharkdp: I don't really have anything to add after seeing the discussion you've linked. You definitely call the shots! I wonder if there IS a case where there's significant enough gain by using internal sorting (which would NOT be the default, of course -- I agree with the opinion you expressed there in #159). Let me see if I can find some numbers that form a convincing case -- if I can't find something in the next few days, I'll happily close this. :)

ErichDonGubler on 13 Sep 2018

@ErichDonGubler Thank you for the feedback.

I'd definitely be interesting in hearing use cases for such a feature! However, I am still following the "80% of the use cases" philosophy with fd, as mentioned in the README.

sharkdp on 16 Sep 2018

I'm going to close this for now. Feel free to comment here and I can reopen the ticket.

sharkdp on 5 Oct 2018

👍1

Windows user here to complain about order inconsistency between launches.

Let’s say I want to compute hashes like that fd -tf -d 1 -x rhash --sha256

| Expected order | Launch 1 | Launch 2| Launch 3 |
| ------------- | ------------- | ------------- | ------------- |
| AUTHORS | AUTHORS | AUTHORS | AUTHORS |
| ccguess.1 | ccguess.1 | ccguess.1 | ccguess.1 |
| ccguess.html | ccguess.html | ccguess.html | ccrypt.1 |
| ccrypt.1 | ccrypt.1 | ccrypt.1 | ccguess.html |
| ccrypt.html | ccrypt.html | ccrypt.html | ChangeLog |
| ChangeLog | ChangeLog | ChangeLog | ccrypt.html |
| COPYING | COPYING | COPYING | COPYING |
| cygwin1.dll | cypfaq01.txt | cypfaq01.txt | cypfaq01.txt |
| cypfaq01.txt | NEWS | NEWS | cygwin1.dll |
| NEWS | ps-ccrypt.el | cygwin1.dll | NEWS |
| ps-ccrypt.el | cygwin1.dll | ps-ccrypt.el | ps-ccrypt.el |
| ps-ccrypt.elc | ps-ccrypt.elc | ps-ccrypt.elc | ps-ccrypt.elc |
| README | README | README | README-WIN |
| README-WIN | README-WIN | README-WIN | README |

sergeevabc on 9 Mar 2019

@sergeevabc: Was there something in the documentation that gave you the impression that a certain order of output was guaranteed? AFAIK fd doesn't make any.

ErichDonGubler on 9 Mar 2019

@ErichDonGubler, some kind of processing order is usually expected from CLI file-related utils (archivers like 7zip and Zstandard, backup managers like Duplicacy and Restic, defrag managers like Contig, duplicate killers like Jdupes, hash calculators like Rhash, even media encoders like LAME and FLAC gave me that impression).

sergeevabc on 9 Mar 2019

So, first, let me see if I can address your immediate problem by asking a question: can your environment be expected to have common POSIX tools likesort and xargs? If it can and I'm understand how you want to use rhash, then you can do something like:

fd -tf -d 1 | sort -V | xargs -I {} rhash --sha256 {}

In regards to fd itself, perhaps the best way to handle your complaint is making the lack of order guarantee explicit in documentation? How do you feel about that? EDIT: see @sharkdp's suggestion below. This is probably the solution to add to documentation.

Second, it's true that applications can (and often do) enforce a specific order of file walking results -- even if it is only defined by the filesystem implementation. However, not all applications or tools guarantee it, particularly those that traverse file trees asynchronously and without a cleanup nor sorting pass. fd is one of those tools.

To illustrate my point, let's analyze where asynchronously operations happen in the relevant paths of fd's source by stepping through manually:

main enters walk::scan.
A channel sender and receiver pair is created in walk:: scan that acts as the work queue for printing results to stdout later, with the results sent by a later usage of a parallel directory walker constructed here. This introduces at least two places where async conditions (which are effectively non-deterministic) will affect order of results.
walk::scan enters walk::spawn_receiver, where the thread receiving results to print is born. If we're executing the invocation with job execution you referenced above (fd <expr> -x <job_template>),
the passed FdConfig has a command and it's not a batch command, so a pool of threads are spun up, which run exec::job .
Once a file is found and pulled by a job worker thread in exec::job, it calls exec::CommandTemplate::execute_command, which calls exec::command::execute_command.
execute_command finally executes the job command and locks a printing mutex, first printing the command's stdout and then stderr. This means that even if a command starts first, if it ends AFTER another command then the second command will still print first.

I'll let @sharkdp correct me if I'm wrong here about the intent of the code, but my assumption is that it's optimized for speed: don't add another pass, keep work between file discovery and printing output as simple as possible.

ErichDonGubler on 9 Mar 2019

@ErichDonGubler is correct. You can use --threads 1 / -j 1 if you want to have a deterministic output order.

sharkdp on 9 Mar 2019

👍1

@sharkdp, indeed, -j 1 fixes the issue of output sorting.
Consider adding remark about sorting both to docs here and next to that switch (via -h and --help).

@ErichDonGubler, your bio says ‘dedicated to building software for other humans’. Being an average human with calloused hands, I’m looking for tools that first and foremost deliver the predictable output based on the previous experience. Human-friendly tool is expected to have name and version, licenсe and author’s contact data, manual with commands explanation and usage examples. But above all its tangible visual part should resemble behaviour of other tools from the same niche (until author is some kind of revolutionary who believes that customs are obsolete or ineffective). For example, ag, grep, pt, ripgrep, and sift are made to search files for patterns, ripgrep is the fastest among them and it delivers that speed without quirks: switches are mostly kept intact for a sake of consistency not to retrain users and output looks like what user rooted in (pioneering) Western digital culture expects to see (e.g. left-to-right, a-z). The other way round inevitably leads to lengthy justifications about ‘asynchronicity’ and other peculiarities under the hood, which might impress enthusiasts and the academic milieu, but would likely confuse and alienate our human.

sergeevabc on 19 Mar 2019

@sergeevabc: I see the value in having a reproducible order with the tools we're discussing here, and I'm glad you are teaching me about it! You're the first human I've encountered that has A) expressed a preference for a reliable order and B) has actually taken time to write about it. I would imagine that many humans might also not care or prefer speed to that ordering (because they may not have the same previous experience as you!) -- so I don't consider your point generally applicable, but I do think it's a valuable perspective to keep in mind.

ErichDonGubler on 19 Mar 2019

This is now supported (in a particular way) by the new -l/--list-details option, see #556.

sharkdp on 3 Apr 2020

This has now been released in fd v8.0.

sharkdp on 16 Apr 2020

Was this page helpful?

0 / 5 - 0 ratings