Created attachment 683561 [details] dwh.log Description of problem: Table audit_log doesnt gets any logs on the following exceptions ovirt-engine.dwhd log writes exceptions logs due to blocked DNS connections using iptables. Version-Release number of selected component (if applicable): 3.2/sf3 How reproducible: always Steps to Reproduce: 1.install rhevm+dwh+reports 2.create entities - hosts, SD, VMs 3.Imply the below iptables rules 4.examine ovirt-engine.dwhd log Actual results: ovirt-engine.dwhd log writes exceptions logs but no events/logs in audit_log table Expected results: audit_log table should have events on the exception written in the dwhd.log Additional info: iptables rules: Add rules in iptables -A INPUT -s DNS-IP/32 -d RHEVM-IP/32 -p udp -m udp --sport 53 --dport 1024:65535 -m state --state ESTABLISHED -j DROP -A INPUT -s DNS-IP/32 -d RHEVM-IP/32 -p tcp -m tcp --sport 53 --dport 1024:65535 -m state --state ESTABLISHED -j DROP -A OUTPUT -p udp -m udp --dport 53 -j DROP -A OUTPUT -s RHEVM-IP/32 -d DNS-IP/32 -p tcp -m tcp --sport 1024:65535 --dport 53 -m state --state NEW,ESTABLISHED -j DROP ======================================= ovirt-engine-dwh.log exception -------------------------------- 2013-01-17 17:03:00|Lj9gd5|S80i59|0o0sDj|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tJava_1|java.net.UnknownHostException:dbotzer-ydary.qa.lab.tlv.redhat.com|1 Exception in component tJava_1 java.net.UnknownHostException: dbotzer-ydary.qa.lab.tlv.redhat.com at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391) at java.net.Socket.connect(Socket.java:579) at sun.security.ssl.SSLSocketImpl.connect(SSLSocketImpl.java:618) at sun.security.ssl.BaseSSLSocketImpl.connect(BaseSSLSocketImpl.java:160) at sun.net.NetworkClient.doConnect(NetworkClient.java:180) at sun.net.www.http.HttpClient.openServer(HttpClient.java:378) at sun.net.www.http.HttpClient.openServer(HttpClient.java:473) at sun.net.www.protocol.https.HttpsClient.<init>(HttpsClient.java:270) at sun.net.www.protocol.https.HttpsClient.New(HttpsClient.java:327) at sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection.getNewHttpClient(AbstractDelegateHttpsURLConnection.java:191) at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:931) at sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection.connect(AbstractDelegateHttpsURLConnection.java:177) at sun.net.www.protocol.https.HttpsURLConnectionImpl.connect(HttpsURLConnectionImpl.java:153) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJava_1Process(SampleTimeKeepingJob.java:1170) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob$1.run(SampleTimeKeepingJob.java:4309) 2013-01-17 17:04:00|DNIL8R|S80i59|0o0sDj|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tJava_1|java.net.UnknownHostException:dbotzer-ydary.qa.lab.tlv.redhat.com|1
Created attachment 683562 [details] engine-logs
Created attachment 683563 [details] all-logs
audit_log doesnt get logs even if a view is deleted I deleted vm_configuration from ovirt_engine_history DB And I cant get any logs in audit log - i waited more than 5 minutes, --- 2013-01-21 11:07:00|rvGQ5L|8slpb6|EbyCZc|OVIRT_ENGINE_DWH|SampleRunJobs|Default|6|Java Exception|tRunJob_1|java.lang.RuntimeException:Child job running failed|1 Exception in component tRunJob_1 java.lang.RuntimeException: Child job running failed at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tRunJob_1Process(SampleTimeKeepingJob.java:2479) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJDBCInput_2Process(SampleTimeKeepingJob.java:2322) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJDBCConnection_1Process(SampleTimeKeepingJob.java:1484) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJDBCConnection_2Process(SampleTimeKeepingJob.java:1386) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJava_3Process(SampleTimeKeepingJob.java:1294) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJava_1Process(SampleTimeKeepingJob.java:1199) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob$1.run(SampleTimeKeepingJob.java:4309) 2013-01-21 11:07:00|EbyCZc|8slpb6|RZ8cn0|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tRunJob_1|java.lang.RuntimeException:Child job running failed|1
Created attachment 684122 [details] missingTable
tested SF6, Is that the output I should get in audit_log table : ------ "USER_VDC_LOGIN" - "User admin@internal logged in." - FALSE ---------------- And in dwh.log: ---------------- 2013-02-14 10:18:04|5OWQs5|meBWew|8Bni46|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tJava_1|java.net.UnknownHostException:dbotzer-vm-jrs7.qa.lab.tlv.redhat.com|1 Exception in component tJava_1 java.net.UnknownHostException: dbotzer-vm-jrs7.qa.lab.tlv.redhat.com at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391) at java.net.Socket.connect(Socket.java:579) at sun.security.ssl.SSLSocketImpl.connect(SSLSocketImpl.java:618) at sun.security.ssl.BaseSSLSocketImpl.connect(BaseSSLSocketImpl.java:160) at sun.net.NetworkClient.doConnect(NetworkClient.java:180) at sun.net.www.http.HttpClient.openServer(HttpClient.java:378) at sun.net.www.http.HttpClient.openServer(HttpClient.java:473) at sun.net.www.protocol.https.HttpsClient.<init>(HttpsClient.java:270) at sun.net.www.protocol.https.HttpsClient.New(HttpsClient.java:327) at sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection.getNewHttpClient(AbstractDelegateHttpsURLConnection.java:191) at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:931) at sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection.connect(AbstractDelegateHttpsURLConnection.java:177) at sun.net.www.protocol.https.HttpsURLConnectionImpl.connect(HttpsURLConnectionImpl.java:153) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob.tJava_1Process(SampleTimeKeepingJob.java:1170) at ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob$1.run(SampleTimeKeepingJob.java:4309) 2013-02-14 10:19:04|3KyRjP|meBWew|8Bni46|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tJava_1|java.net.UnknownHostException:dbotzer-vm-jrs7.qa.lab.tlv.redhat.com|1
(In reply to comment #6) > tested SF6, > Is that the output I should get in audit_log table : > ------ > "USER_VDC_LOGIN" - "User admin@internal logged in." - FALSE > ---------------- > > > And in dwh.log: > ---------------- > 2013-02-14 > 10:18: > 04|5OWQs5|meBWew|8Bni46|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java > Exception|tJava_1|java.net.UnknownHostException:dbotzer-vm-jrs7.qa.lab.tlv. > redhat.com|1 > Exception in component tJava_1 > java.net.UnknownHostException: dbotzer-vm-jrs7.qa.lab.tlv.redhat.com > at > java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178) > at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391) > at java.net.Socket.connect(Socket.java:579) > at sun.security.ssl.SSLSocketImpl.connect(SSLSocketImpl.java:618) > at > sun.security.ssl.BaseSSLSocketImpl.connect(BaseSSLSocketImpl.java:160) > at sun.net.NetworkClient.doConnect(NetworkClient.java:180) > at sun.net.www.http.HttpClient.openServer(HttpClient.java:378) > at sun.net.www.http.HttpClient.openServer(HttpClient.java:473) > at > sun.net.www.protocol.https.HttpsClient.<init>(HttpsClient.java:270) > at sun.net.www.protocol.https.HttpsClient.New(HttpsClient.java:327) > at > sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection. > getNewHttpClient(AbstractDelegateHttpsURLConnection.java:191) > at > sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection. > java:931) > at > sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection. > connect(AbstractDelegateHttpsURLConnection.java:177) > at > sun.net.www.protocol.https.HttpsURLConnectionImpl. > connect(HttpsURLConnectionImpl.java:153) > at > ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob. > tJava_1Process(SampleTimeKeepingJob.java:1170) > at > ovirt_engine_dwh.sampletimekeepingjob_3_2.SampleTimeKeepingJob$1. > run(SampleTimeKeepingJob.java:4309) > 2013-02-14 > 10:19: > 04|3KyRjP|meBWew|8Bni46|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java > Exception|tJava_1|java.net.UnknownHostException:dbotzer-vm-jrs7.qa.lab.tlv. > redhat.com|1 It's fixed in SF7. Look at release notes please. Yaniv
Fixed, 3.2/SF7 Table audit_log contains correct log "DWH_ERROR";9704;2;"ETL service sampling has encountered an error. Please consult the service log for more details." Fixed, 3.2/SF7
Found error in fix. Moving back to assigned. Yaniv
Fixed 3.2/SF10 ------ dwh properties is configured by default -> timeBetweenErrorEvents=300000 (5min) ------ I get the below log every 5min - correctly, Table audit_log contains correct log "ETL service sampling has encountered an error. Please consult the service log for more details." ------ I have changed timeBetweenErrorEvents=60000 (1min) - keep getting errors every 1min I left the errors to keep on coming for 1H Fixed 3.2/SF10 ------ also DWH contains log, 2013-03-21 09:53:17|T2q9Zm|h5Cx48|gaUbZm|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|6|Java Exception|tJava_1|java .net.UnknownHostException:xxx.xxx.xxx.com|1 Exception in component tJava_1 java.net.UnknownHostException: xxx.xxx.xxx.com Fixed 3.2/SF10
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2013-0926.html