Description of problem: I was running a light mixed workload against mem, network and i/o. I was monitoring using Grafana and it was working with no issues during the initial start of the run. When checking in the morning I receive a popup stating Templating Template variable service failed....... Looking at the logs in grafana pod I see the following: t=2022-01-20T12:43:50+0000 lvl=info msg="Database locked, sleeping then retrying" logger=sqlstore error="database is locked" retry=0 t=2022-01-20T12:43:50+0000 lvl=eror msg="Failed to login" logger=auth.proxy username=kube:admin message="failed to log in as user, specified in auth proxy header" error="user already exists" ignoreCache=false t=2022-01-20T12:43:50+0000 lvl=eror msg="failed to log in as user, specified in auth proxy header" logger=context error="user already exists" t=2022-01-20T12:43:50+0000 lvl=info msg="Request Completed" logger=context userId=0 orgId=0 uname= method=GET path=/api/datasources/proxy/1/api/v1/series status=407 remote_addr="10.20.116.6, 10.128.2.2" time_ms=14 size=1762 referer="https://grafana-openshift-monitoring.apps.pok-74.ocptest.pok.stglabs.ibm.com/d/cL_KOrJnz/node-exporter-use-method-node?orgId=1&refresh=30s&var-datasource=prometheus&var-cluster=&var-instance=worker-2.pok-74.ocptest.pok.stglabs.ibm.com" t=2022-01-20T12:43:50+0000 lvl=info msg="Request Completed" logger=context userId=2 orgId=1 uname=kube:admin method=GET path=/api/datasources/proxy/1/api/v1/series status=403 remote_addr="10.20.116.6, 10.131.0.2" time_ms=35 size=86085 referer="https://grafana-openshift-monitoring.apps.pok-74.ocptest.pok.stglabs.ibm.com/d/cL_KOrJnz/node-exporter-use-method-node?orgId=1&refresh=30s&var-datasource=prometheus&var-cluster=&var-instance=worker-2.pok-74.ocptest.pok.stglabs.ibm.com" t=2022-01-20T12:44:02+0000 lvl=eror msg="Dashboard not found" logger=context userId=2 orgId=1 uname=kube:admin error="Dashboard not found" remote_addr="10.20.116.6, 10.131.0.2" t=2022-01-20T12:44:02+0000 lvl=info msg="Request Completed" logger=context userId=2 orgId=1 uname=kube:admin method=GET path=/api/dashboards/uid/cL_KOrJnz status=404 remote_addr="10.20.116.6, 10.131.0.2" time_ms=1 size=33 referer="https://grafana-openshift-monitoring.apps.pok-74.ocptest.pok.stglabs.ibm.com/d/cL_KOrJnz/node-exporter-use-method-node?orgId=1&refresh=30s&var-datasource=prometheus&var-cluster=&var-instance=worker-2.pok-74.ocptest.pok.stglabs.ibm.com" . . . t=2022-01-20T12:48:06+0000 lvl=info msg="Request Completed" logger=context userId=2 orgId=1 uname=kube:admin method=GET path=/api/datasources/proxy/1/api/v1/query_range status=403 remote_addr="10.20.116.6, 10.128.2.2" time_ms=2 size=86211 referer="https://grafana-openshift-monitoring.apps.pok-74.ocptest.pok.stglabs.ibm.com/d/0TCpKC1nz/node-exporter-use-method-node?orgId=1&refresh=30s" Version-Release number of selected component (if applicable): Client Version: 4.10.0-0.nightly-s390x-2022-01-17-171822 Server Version: 4.10.0-0.nightly-s390x-2022-01-17-171822 Kubernetes Version: v1.23.0+60f5a1c How reproducible: I am not sure what triggered the issue. In my case I had a mixed workload running over night. I checked another one of my systems that has nothing running and I can get to the grafana dashboard with no issues. Steps to Reproduce: 1. 2. 3. Actual results: Grafana Dashboard is no longer working Expected results: Additional info: I will attach the grafana pod log and the screen shot of the Grafana Error Popup
Created attachment 1852226 [details] grafana pod log
Created attachment 1852227 [details] Grafana Error Pop Up screen shot
*** This bug has been marked as a duplicate of bug 2037891 ***