Bug 2182421 - Trino / Ceph integration requires changes in S3select engine and RGW
Summary: Trino / Ceph integration requires changes in S3select engine and RGW
Keywords:
Status: ASSIGNED
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RGW
Version: 6.0
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: ---
: 7.0
Assignee: gal salomon
QA Contact: Madhavi Kasturi
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-03-28 15:16 UTC by gal salomon
Modified: 2023-08-17 08:46 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-6338 0 None None None 2023-03-28 17:30:49 UTC

Description gal salomon 2023-03-28 15:16:13 UTC
Description of problem:

Trino gains efficiency upon issuing multiple requests per single Query. the return results by RGW should be aligned with Trino expectations (otherwise queries are rejected or results are not accurate).
upon aggregation statement (count) Trino pushes down a non aggregation statement, which retrieves an empty column. Trino issue parallel multiple s3select-requests, it seems that deviation in the result relates to the number of parallel requests.

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 2 gal salomon 2023-08-17 08:46:04 UTC
the following PR's deal with Trino / CEPH integration, and resolve various issues related to that integration.

https://github.com/ceph/ceph/pull/49411
https://github.com/ceph/ceph/pull/50471
https://github.com/ceph/ceph/pull/52651


Note You need to log in before you can comment on or make changes to this bug.