Bug 235420 - File does not recognize a UTF-16, little-endian encoded XML file
File does not recognize a UTF-16, little-endian encoded XML file
Status: CLOSED ERRATA
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: file (Show other bugs)
5.0
All Linux
medium Severity medium
: ---
: ---
Assigned To: Tomas Smetana
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2007-04-05 13:29 EDT by Dave Malcolm
Modified: 2009-01-20 17:02 EST (History)
1 user (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2009-01-20 17:02:49 EST
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
Input file (sticking to 7-bit characters) (49 bytes, text/plain)
2007-04-05 13:29 EDT, Dave Malcolm
no flags Details
output file, should be UTF-16 little-endian encoding (136 bytes, text/xml)
2007-04-05 13:30 EDT, Dave Malcolm
no flags Details

  None (edit)
Description Dave Malcolm 2007-04-05 13:29:46 EDT
Description of problem:
Attached is an XML file, which I believe is a correctly encoded UTF-16,
little-endian encoded XML file (c.f. :
http://www.w3.org/TR/REC-xml/#sec-guessing )

xmllint reads it fine.

However running "file" on it gives blank output:
file output.xml
output.xml: 

Version-Release number of selected component (if applicable):
file-4.17-8

How reproducible:
100%


Steps to Reproduce:
1. Take an XML file, run "xmllint --encode UTF-16 input.xml > output.xml"
2. xmllint output.xml
3. file output.xml
  
Actual results:
xmllint reads the file fine but file gives no useful output:
file output.xml 
output.xml: 
(i.e. blank output)

Expected results:
either:
  output.xml: XML 1.0 document text
or:
  output.xml: XML 1.0 document text (UTF-16 little-endian encoding)
or somesuch
Comment 1 Dave Malcolm 2007-04-05 13:29:46 EDT
Created attachment 151788 [details]
Input file (sticking to 7-bit characters)
Comment 2 Dave Malcolm 2007-04-05 13:30:35 EDT
Created attachment 151789 [details]
output file, should be UTF-16 little-endian encoding
Comment 3 RHEL Product and Program Management 2008-06-04 18:49:44 EDT
This request was evaluated by Red Hat Product Management for inclusion in a Red
Hat Enterprise Linux maintenance release.  Product Management has requested
further review of this request by Red Hat Engineering, for potential
inclusion in a Red Hat Enterprise Linux Update release for currently deployed
products.  This request is not yet committed for inclusion in an Update
release.
Comment 8 errata-xmlrpc 2009-01-20 17:02:49 EST
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2009-0208.html

Note You need to log in before you can comment on or make changes to this bug.