1303795 – RFE: df output is non intuitive when reporting on multiple files residing in current directory (doc only)

Bug 1303795 - RFE: df output is non intuitive when reporting on multiple files residing in current directory (doc only)

Summary: RFE: df output is non intuitive when reporting on multiple files residing in ...

Keywords:
Status:	CLOSED NOTABUG
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	coreutils
Sub Component:
Version:	rawhide
Hardware:	All
OS:	Linux
Priority:	unspecified
Severity:	low
Target Milestone:	---
Assignee:	Kamil Dudka
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2016-02-02 01:20 UTC by Noel McLoughlin
Modified:	2016-06-16 16:13 UTC (History)
CC List:	9 users (show)
Fixed In Version:
Clone Of:
Clones:	1303964 (view as bug list)
Environment:
Last Closed:	2016-06-16 16:13:23 UTC
Type:	Bug
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
CentOS	0010296	0	None	None	None	2016-02-02 01:21:26 UTC

Description Noel McLoughlin 2016-02-02 01:20:07 UTC

Description of problem:

Description	

sort(1) in coreutils 8.24 has an OPTION to check for strict ordering.

       -u, --unique
               with -c, check for strict ordering; without -c, output only the first of an equal run

However, df(1) in coreutils 8.24 has no OPTION to check for strict ordering.

This is inconsistent considering both df and sort reside in same coreutils package, and generated df output can contain duplicates.

Feature Request: The df utility should support strict ordering OPTION (-u).


Version-Release number of selected component (if applicable):

coreutils-8.24, 

How reproducible:

Always

Steps to Reproduce:

1. # cd /

2. Execute these commands:
# df *
# df /*
# df bin lib lib64 sbin

3. Execute these commands:

# cd /
# df --totals *
# df --totals /l*
# df --totals bin lib lib64 sbin


Expected results:

N/A

Additional info:

Comment 1 Noel McLoughlin 2016-02-02 09:24:55 UTC

Example of non intuitive output for directory containing 7 files.

7423bfmp:~/workspace/ISO\ ]$ df * --total
Filesystem              1K-blocks      Used Available Use% Mounted on
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
total                  1141084476 967266188 115692976  90% -

Comment 2 Ondrej Vasik 2016-02-03 08:39:01 UTC

Thanks for both BZs, I can see why you request this, but can you please clarify "expected results" so we can prevent possible confusion?
In addition, as this is not something what should be solved downstream, I think it would be best to bring this up on upstream mailing list - to see what the upstream community thinks about this enhancement. 

Can you please send an email about this - either to coreutils or to bug-coreutils (the bugs-coreutils email will automatically create a ticket in bug tracking system? They may have some additional questions and having me as "man in the middle" can slow the things and discussion.

Comment 3 Noel McLoughlin 2016-02-03 10:30:45 UTC

I emailed coreutils asking for comments.  

The current situation is as follows:

7423bfmp:~/workspace/ISO\ ]$ df * --total
Filesystem              1K-blocks      Used Available Use% Mounted on
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
total                  1141084476 967266188 115692976  90% -


IMHO, the desired situation is as follows:

7423bfmp:~/workspace/ISO\ ]$ df * --total -u
Filesystem              1K-blocks      Used Available Use% Mounted on
/dev/mapper/my_vg-root  163012068 138180884  16527568  90% /
total                   163012068 138180884  16527568  90% -

thanks,
Noel

Comment 4 Noel McLoughlin 2016-02-11 18:12:41 UTC

** The only feedback from coreutils is from Bernhard Voelker ***

I'm not too enthusiastic for such an option.  This is the first time
I've seen someone doing "df *".  Although it doesn't seem to be explicitly
specified by POSIX, I'd assume a consistent behavior like many other tools
which do their job for each command line argument regardless whether it
may make sense (stat, du) or maybe not (df, dirname).

Furthermore, since the introduction of the output field 'file', it really
makes sense to not unify the output:


  $ df -h --out=source,target,file df du mv
  Filesystem     Mounted on      File
  /dev/sda5      /media/big_data df
  /dev/sda5      /media/big_data du
  /dev/sda5      /media/big_data mv

A --unique option would clash with this output format.

Finally, I'm not sure if everyone is aware that also regular files can
be a mount point (admittedly, the output of 'findmnt' is a little more
useful in this case):

  $ touch a b c

  $ mount --bind /proc/cpuinfo b
  $ mount --bind /sys/kernel/notes c

  $ df -h a b c
  Filesystem      Size  Used Avail Use% Mounted on
  /dev/sda2        20G   14G  5.2G  73% /
  proc               0     0     0    - /root/tmp/b
  sysfs              0     0     0    - /root/tmp/c

Still, when 2 lines are really identical (because they are on the
same file system and not mounted over), it may be valid to squeeze
the output, but I don't think this is following the principle of
least suprise: someone might be confused why some lines are missing
... or even relying on that in a script.  Anyway, one may simply
do that with "... | sort -u" if needed.

Have a nice day,
Berny

Comment 5 Noel McLoughlin 2016-02-11 18:13:51 UTC

I would also note the inconsistency between df and sort when invoked with "*".

# pwd
/usr/share
# sort * | wc -l
sort: read failed: abrt: Is a directory
0
# df -l * | wc -l
233

Comment 6 Noel McLoughlin 2016-02-11 18:26:40 UTC

Due to lack of consensus, I think no action will be taken downstream.

Still .. it is open to Linux distribution vendors to consider updating the df(1) man page to explain how '*' globbing can produce output which may not make sense (df * --total) or need further processing (df *) to remove duplicates.

Comment 7 Ondrej Vasik 2016-02-12 13:38:35 UTC

Thanks for bringing it to upstream. Yes, I don't expect this implemented downstream only.
Keeping opened for now so we can later come up with some documentation adjustment. Please note that will likely be only adjustment to info documentation , man pages in coreutils are intended to be compact.

Comment 8 Kamil Dudka 2016-05-12 09:37:11 UTC

Running 'df *' does not make any sense in most of the cases.

Did not you want to run 'du *' instead?

Comment 9 Noel McLoughlin 2016-05-12 10:45:29 UTC

Hi Kamil,
Thanks for comment. Yes, "du *" is better user space choice, but this BZ is a comment on df behaviour in this case, not choices. However, I would recommend your comment is taken on board in df info-docs.  i.e. 
---
  
   Output from "df *" may be non intuitive. Consider "du *" instead.

---

Comment 10 Fedora Admin XMLRPC Client 2016-06-13 09:05:04 UTC

This package has changed ownership in the Fedora Package Database.  Reassigning to the new owner of this component.

Comment 11 Kamil Dudka 2016-06-16 16:13:23 UTC

In my opinion, the output of "df *" is not non-intuitive.  It is just useless in most of the cases.  I am not sure how the info documentation could be improved.  You are welcome to send a proposal to the bug-coreutils mailing list.

Note You need to log in before you can comment on or make changes to this bug.