Bug 2118726

Summary: [RGW][s3-select][parquet]: query on parquet object without .parquet extension fails with " out of memory " error
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Madhavi Kasturi <mkasturi>
Component: RGWAssignee: gal salomon <gsalomon>
Status: CLOSED ERRATA QA Contact: Madhavi Kasturi <mkasturi>
Severity: medium Docs Contact: Rivka Pollack <rpollack>
Priority: unspecified    
Version: 6.0CC: akraj, cbodley, ceph-eng-bugs, cephqe-warriors, gsalomon, kbader, kdreyer, kkeithle, mbenjamin
Target Milestone: ---   
Target Release: 7.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-18.2.0-1 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-12-13 15:19:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2237662    

Comment 2 gal salomon 2022-08-16 14:59:06 UTC
without an extension, the object is handled as CSV.
and since the Parquet object is binary, it probably failed upon trying to read the object.

Comment 12 gal salomon 2023-03-30 09:35:38 UTC
this behavior was changed in PR #49411 (merged)
RGW identified the input type according to AWS-CLI parameters and not according to object extension.

Comment 13 gal salomon 2023-08-17 08:53:40 UTC
this issue resolved in https://github.com/ceph/ceph/pull/49411/

Comment 21 errata-xmlrpc 2023-12-13 15:19:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 7.0 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:7780