Created attachment 1217416 [details] metrics pods logs Description of problem: # oc get pods when executed in openshift-infra project gives output as # oc get pods NAME READY STATUS RESTARTS AGE hawkular-cassandra-1-mp5gn 1/1 Running 0 2d hawkular-cassandra-2-z8rl0 1/1 Running 2 2d hawkular-metrics-2z5so 1/1 Running 0 2d hawkular-metrics-5srpo 1/1 Running 0 2d heapster-0npf8 1/1 Running 18 2d metrics-deployer-op83i 0/1 Completed 0 3d from where is visible that heapster pod was restarted many times for unknown reason. Version-Release number of selected component (if applicable): Openshiftrpm -qa | grep atomic atomic-openshift-dockerregistry-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-pod-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-clients-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-node-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-tests-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-clients-redistributable-3.3.1.1-1.git.0.629a1d8.el7.x86_64 tuned-profiles-atomic-openshift-node-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-master-3.3.1.1-1.git.0.629a1d8.el7.x86_64 tuned-profiles-atomic-2.7.1-3.el7.noarch atomic-openshift-3.3.1.1-1.git.0.629a1d8.el7.x86_64 atomic-openshift-sdn-ovs-3.3.1.1-1.git.0.629a1d8.el7.x86_64 and metrics images v.3.3 How reproducible: I have seen this issue when openshift metrics was supposed to monitor 15k pods across 220 nodes. Actual results: heapster pod fails Expected results: Additional info: log files for heapster / hawkular / cassandra attached to BZ
*** This bug has been marked as a duplicate of bug 1465532 ***