Bug 98119
Summary: | cat <file> | sort -u > <file2>, without some words with accent | ||
---|---|---|---|
Product: | [Retired] Red Hat Linux | Reporter: | hotmail <luisuebel> |
Component: | coreutils | Assignee: | Tim Waugh <twaugh> |
Status: | CLOSED WORKSFORME | QA Contact: | Mike McLean <mikem> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 9 | ||
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i686 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2005-09-08 15:29:04 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
hotmail
2003-06-26 20:26:57 UTC
Could you send me a minimal test case (or provide a pointer to one) that demonstrates the problem? Perhaps obscuring the words with "tr '[a-z]' x" would help? Also what locale are you using? What does 'locale' say? I am trying to find a minimum file that appers this error. I really cannot send you the original file. The problem is related to very large files. The original file has 8Mbytes with 1.3Mwords and 65K unique words. I couldn't reproduce the problem with a smaller version of the file. I notice that RedHat 9.0 and RedHat 7.2 have bugs in this case, but they are differents bugs. In RedHat 7.2, there are a couple of non accent words missing, but in RedHat 9.0, there are accented words missing. I cannot reproduce this error with a small file. I don't know if you can arrange a very big text file to test this. Unfortune, I really cannot send you the file. Luis Need a test case before I can analyse the problem. :-/ |