Bug 2158115

Summary: `file` misdetects sjis encoded cvs file with CRLF as "DIY-Thermocam raw data"
Product: Red Hat Enterprise Linux 8 Reporter: Jing Huo <jhuo>
Component: fileAssignee: Vincent Mihalkovič <vmihalko>
Status: CLOSED ERRATA QA Contact: Karel Volný <kvolny>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 8.7CC: kdudka, kvolny, vmihalko
Target Milestone: rcKeywords: Triaged
Target Release: ---Flags: jhuo: needinfo-
pm-rhel: mirror+
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: file-5.33-25.el8 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-11-14 15:48:54 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jing Huo 2023-01-04 10:10:04 UTC
Description of problem:
`file` misdetects sjis encoded cvs file with CRLF line terminators as "DIY-Thermocam raw data".

Version-Release number of selected component (if applicable):
file-5.33-21.el8.x86_64

How reproducible:


Steps to Reproduce:
1.run `file cvs_filename` against a sjis encoded cvs file with CRLF line terminators
2.
3.

Actual results:

# file file-cmd.csv 
file-cmd.csv: DIY-Thermocam raw data (Lepton 2.x), scale 38590-38590, spot sensor temperature -0.294423, color scheme 150, maximum point enabled, calibration: offset -0.000000, slope 7916750543304341681668233308762275840.000000

Expected results:

# file file-cmd.csv 
file-cmd.csv: Non-ISO extended-ASCII text, with CRLF line terminators


Additional info:
On both RHEL 7 and RHEL 9, `file` can recognize the file correctly.

On RHEL 8, after converting to UTF8, the result is expected.

# file file-utf8.csv 
file-utf8.csv: UTF-8 Unicode text, with CRLF line terminators

After removing CRLF by dos2unix, the result is expected.

# file file-cmd.csv 
file-cmd.csv: Non-ISO extended-ASCII text

Comment 11 errata-xmlrpc 2023-11-14 15:48:54 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (file bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:7104