Bug 1402471 - SampleTimeKeepingJob failed due to - commands ignored until end of transaction block [NEEDINFO]
Summary: SampleTimeKeepingJob failed due to - commands ignored until end of transacti...
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: ovirt-engine-dwh
Classification: oVirt
Component: Database
Version: 4.0.6
Hardware: Unspecified
OS: Unspecified
unspecified
high vote
Target Milestone: ovirt-4.0.7
: ---
Assignee: Shirly Radco
QA Contact: Pavel Stehlik
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-12-07 15:50 UTC by Eldad Marciano
Modified: 2020-05-14 15:32 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-12-12 12:10:41 UTC
oVirt Team: Metrics
sradco: needinfo? (kshukla)
sradco: ovirt-4.0.z?
sradco: ovirt-4.1?
sradco: planning_ack?
sradco: devel_ack?
sradco: testing_ack?


Attachments (Terms of Use)

Description Eldad Marciano 2016-12-07 15:50:50 UTC
Description of problem:

2016-12-05 17:50:45|Krot3R|wmN1ng|TycDii|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|5|tWarn|tWarn_1|Can not sample data, oVirt Engine is not updating the statistics. Please check your oVirt Engine status.|9704
Exception in component tJDBCInput_5
Exception in component tJDBCInput_10
org.postgresql.util.PSQLException: ERROR: smallint out of range
        at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2157)
        at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1886)
        at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:255)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Statement.java:555)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeWithFlags(AbstractJdbc2Statement.java:403)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeQuery(AbstractJdbc2Statement.java:283)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync.tJDBCInput_5Process(StatisticsSync.java:4056)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync$3.run(StatisticsSync.java:15979)
org.postgresql.util.PSQLException: ERROR: current transaction is aborted, commands ignored until end of transaction block
        at org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2157)
        at org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1886)
        at org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:255)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Statement.java:555)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeWithFlags(AbstractJdbc2Statement.java:403)
        at org.postgresql.jdbc2.AbstractJdbc2Statement.executeQuery(AbstractJdbc2Statement.java:283)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync.tJDBCInput_10Process(StatisticsSync.java:8515)
        at ovirt_engine_dwh.statisticssync_4_0.StatisticsSync$5.run(StatisticsSync.java:16071)
2016-12-05 17:54:26|RHRRaC|wmN1ng|0zMypw|OVIRT_ENGINE_DWH|StatisticsSync|Default|6|Java Exception|tJDBCInput_10|org.postgresql.util.PSQLException:ERROR: current transaction is aborted, commands ignored until end of transaction block|1

Version-Release number of selected component (if applicable):
4.0.6-1

How reproducible:
100%

Steps to Reproduce:
1. scale out env 27 hosts 2.2k vms
2.
3.

Actual results:
sample time keeping failed.

Expected results:
sample time keeping runs with no issues.

Additional info:

Comment 3 Yaniv Kaul 2016-12-09 16:20:54 UTC
Severity?

Comment 4 Eldad Marciano 2016-12-11 09:48:36 UTC
(In reply to Yaniv Kaul from comment #3)
> Severity?

high (Shirly already set it)

Comment 6 Eldad Marciano 2016-12-11 12:21:11 UTC
after chatting with Shirly, we will keep monitor the issue.
it might be related to the monitoring lock we found on BZ https://bugzilla.redhat.com/show_bug.cgi?id=1364791

Comment 7 Yaniv Kaul 2016-12-12 12:10:41 UTC
(In reply to Eldad Marciano from comment #6)
> after chatting with Shirly, we will keep monitor the issue.
> it might be related to the monitoring lock we found on BZ
> https://bugzilla.redhat.com/show_bug.cgi?id=1364791

Closing for the time being. Please re-open if relevant.

Comment 8 Eldad Marciano 2016-12-12 12:35:58 UTC
Not sure if we want to close this bug, it means DWH can't sample data somehow.
what about retry \ false recovery logic, also nicer error message, and stack trace just with debug?


Note You need to log in before you can comment on or make changes to this bug.