Bug 2214981 - [CEE/sd][RGW] RGWSI_Notify::robust_notify(const DoutPrefixProvider*, RGWSI_RADOS::Obj&, const RGWCacheNotifyInfo&, optional_yield):402 Notify failed on object: (110) Connection timed out [NEEDINFO]
Summary: [CEE/sd][RGW] RGWSI_Notify::robust_notify(const DoutPrefixProvider*, RGWSI_RA...
Keywords:
Status: ASSIGNED
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RGW
Version: 5.3
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
: 7.0
Assignee: Matt Benjamin (redhat)
QA Contact: Madhavi Kasturi
URL:
Whiteboard:
Depends On:
Blocks: 2228874 2228875 2230445
TreeView+ depends on / blocked
 
Reported: 2023-06-14 10:41 UTC by Tridibesh Chakraborty
Modified: 2023-08-11 02:54 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 2228874 (view as bug list)
Environment:
Last Closed:
Embargoed:
aemerson: needinfo-
mbenjamin: needinfo? (mkogan)
trchakra: needinfo? (mbenjamin)
trchakra: needinfo? (cbodley)


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 59495 0 None None None 2023-06-28 13:10:50 UTC
Github ceph ceph pull 51161 0 None Merged rgw/sts: AssumeRole no longer writes to user metadata 2023-06-28 13:10:50 UTC
Red Hat Issue Tracker RHCEPH-6827 0 None None None 2023-06-14 10:42:50 UTC

Description Tridibesh Chakraborty 2023-06-14 10:41:59 UTC
Description of problem:
Customer is observing frequent connection timed out error in the RGW logs and because of this jobs are getting cancelled and users have to rerun the jobs. There is no pattern for the connection timed out BTW as per customer this is happening when they are coping huge data via RGW. 

~~~
2023-05-30T06:02:26.098+0000 7f8cbf8b5700  1 req 2251681747474232490 10.014173508s int RGWSI_Notify::robust_notify(const DoutPrefixProvider*, RGWSI_RADOS::Obj&, const RGWCacheNotifyInfo&, optional_yield):402 Notify failed on object xx.rgw.meta:users.uid:xxx: (110) Connection timed out
2023-05-30T06:02:26.098+0000 7f8cbf8b5700  1 req 2251681747474232490 10.014173508s int RGWSI_Notify::robust_notify(const DoutPrefixProvider*, RGWSI_RADOS::Obj&, const RGWCacheNotifyInfo&, optional_yield):418 Invalidating obj=xx.rgw.meta:users.uid:xxx tries=0
~~~

Version-Release number of selected component (if applicable):
RHCS 5.3z3 (16.2.10-160.el8cp)
RHEL 8.4

How reproducible:
Customer environment specific

Steps to Reproduce:
NA

Actual results:
Connection timed out happening 

Expected results:
There should not be connection timed out messages

Additional info:

Customer is using Hadoop credential provider for OIDC+STS authentication


Note You need to log in before you can comment on or make changes to this bug.