Bug 2118726

Summary: [RGW][s3-select][parquet]: query on parquet object without .parquet extension fails with " out of memory " error
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Madhavi Kasturi <mkasturi>
Component: RGWAssignee: gal salomon <gsalomon>
Status: ASSIGNED --- QA Contact: Madhavi Kasturi <mkasturi>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 6.0CC: cbodley, ceph-eng-bugs, cephqe-warriors, kbader, kkeithle, mbenjamin
Target Milestone: ---   
Target Release: 7.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 2 gal salomon 2022-08-16 14:59:06 UTC
without an extension, the object is handled as CSV.
and since the Parquet object is binary, it probably failed upon trying to read the object.

Comment 12 gal salomon 2023-03-30 09:35:38 UTC
this behavior was changed in PR #49411 (merged)
RGW identified the input type according to AWS-CLI parameters and not according to object extension.

Comment 13 gal salomon 2023-08-17 08:53:40 UTC
this issue resolved in https://github.com/ceph/ceph/pull/49411/