Bug 142948
Summary: | wc -m option unable to count the number of multibyte characters in a file | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Lawrence Lim <llim> | ||||
Component: | coreutils | Assignee: | Tim Waugh <twaugh> | ||||
Status: | CLOSED NOTABUG | QA Contact: | David Lawrence <dkl> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 4.0 | CC: | eng-i18n-bugs, tools-bugs | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2004-12-15 09:29:48 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 142941 | ||||||
Attachments: |
|
Description
Lawrence Lim
2004-12-15 09:16:21 UTC
Created attachment 108596 [details]
script with a single Test Case from LI18NUX Test for testing wc -m option
Here is my analysis of this for LSB 1.3. Note that this has nothing to do with glibc, but with coreutils: 10|570 /tset/LI18NUX2K.L1/utils/wc/wc 00:21:33|TC Start, scenario ref 574-0 520|570 1 6181 1 1|* When -m option is specified, verify this utility outputs the number of characters in each input file even if the characters are multibyte characters. 520|570 1 6181 1 2| 520|570 1 6181 1 3|Can't count number of characters. 220|570 1 1 00:21:33|FAIL 520|570 2 6181 1 1|* When this utility writes to the standard output the number of words, this utility correctly recognizes the boundaries of words. The boundaries are shown as white-space characters constituted in current locale. 520|570 2 6181 1 2| 520|570 2 6181 1 3|Can't count number of words. 220|570 2 1 00:21:33|FAIL LSB here expects that wc violates POSIX standard and prints say 26 text.txt instead of 26 text.txt POSIX requires "%d %d %d %s\n", <newlines>, <words>, <bytes>, <file> format (with omitting the numbers that are not printed), see http://www.opengroup.org/onlinepubs/009695399/utilities/wc.html, while: The output file format pseudo- printf() string differs from the System V version of wc: "%7d%7d%7d %s\n" which produces possibly ambiguous and unparsable results for very large files, as it assumes no number shall exceed six digits. I'd say LSB testsuite should be changed to accept both at least. This is fixed in LSB 2.0 already. *** Bug 142949 has been marked as a duplicate of this bug. *** |