Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1402471

Summary: SampleTimeKeepingJob failed due to - commands ignored until end of transaction block
Product: [oVirt] ovirt-engine-dwh Reporter: Eldad Marciano <emarcian>
Component: DatabaseAssignee: Shirly Radco <sradco>
Status: CLOSED NOTABUG QA Contact: Pavel Stehlik <pstehlik>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.0.6CC: bugs, emarcian, kshukla, oourfali, sradco, ykaul
Target Milestone: ovirt-4.0.7Flags: sradco: ovirt-4.0.z?
sradco: ovirt-4.1?
sradco: planning_ack?
sradco: devel_ack?
sradco: testing_ack?
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-12-12 12:10:41 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Metrics RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Eldad Marciano 2016-12-07 15:50:50 UTC
Description of problem:

2016-12-05 17:50:45|Krot3R|wmN1ng|TycDii|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|5|tWarn|tWarn_1|Can not sample data, oVirt Engine is not updating the statistics. Please check your oVirt Engine status.|9704
Exception in component tJDBCInput_5
Exception in component tJDBCInput_10
org.postgresql.util.PSQLException: ERROR: smallint out of range
        at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2157)
        at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1886)
        at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:255)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Statement.java:555)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeWithFlags(AbstractJdbc2Statement.java:403)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeQuery(AbstractJdbc2Statement.java:283)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync.tJDBCInput_5Process(StatisticsSync.java:4056)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync$3.run(StatisticsSync.java:15979)
org.postgresql.util.PSQLException: ERROR: current transaction is aborted, commands ignored until end of transaction block
        at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2157)
        at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1886)
        at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:255)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Statement.java:555)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeWithFlags(AbstractJdbc2Statement.java:403)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeQuery(AbstractJdbc2Statement.java:283)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync.tJDBCInput_10Process(StatisticsSync.java:8515)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync$5.run(StatisticsSync.java:16071)
2016-12-05 17:54:26|RHRRaC|wmN1ng|0zMypw|OVIRT_ENGINE_DWH|StatisticsSync|Default|6|Java Exception|tJDBCInput_10|org.postgresql.util.PSQLException:ERROR: current transaction is aborted, commands ignored until end of transaction block|1

Version-Release number of selected component (if applicable):
4.0.6-1

How reproducible:
100%

Steps to Reproduce:
1. scale out env 27 hosts 2.2k vms
2.
3.

Actual results:
sample time keeping failed.

Expected results:
sample time keeping runs with no issues.

Additional info:

Comment 3 Yaniv Kaul 2016-12-09 16:20:54 UTC
Severity?

Comment 4 Eldad Marciano 2016-12-11 09:48:36 UTC
(In reply to Yaniv Kaul from comment #3)
> Severity?

high (Shirly already set it)

Comment 6 Eldad Marciano 2016-12-11 12:21:11 UTC
after chatting with Shirly, we will keep monitor the issue.
it might be related to the monitoring lock we found on BZ https://bugzilla.redhat.com/show_bug.cgi?id=1364791

Comment 7 Yaniv Kaul 2016-12-12 12:10:41 UTC
(In reply to Eldad Marciano from comment #6)
> after chatting with Shirly, we will keep monitor the issue.
> it might be related to the monitoring lock we found on BZ
> https://bugzilla.redhat.com/show_bug.cgi?id=1364791

Closing for the time being. Please re-open if relevant.

Comment 8 Eldad Marciano 2016-12-12 12:35:58 UTC
Not sure if we want to close this bug, it means DWH can't sample data somehow.
what about retry \ false recovery logic, also nicer error message, and stack trace just with debug?

Comment 11 Red Hat Bugzilla 2023-09-14 03:35:47 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days