Description of problem:
OVN octavia load balancers stop working. octavia.log contains the following errors:
Client-side error: Pool <UUID> is immutable and cannot be updated. format_exception /usr/lib/python3.6/site-packages/wsme/api.py:222
Perhaps: https://bugs.launchpad.net/neutron/+bug/1900763
Version-Release number of selected component (if applicable):
16.1.x
How reproducible:
This specific environment
Additional info:
I'll provide additional details in privates comments
Hi Ignacio,
Unfortunately these errors were happening in even the oldest logs in the sosreports I found on supportshell, so I can't tell what operation might have triggered them exactly.
One possibility is that the subnet was not created properly, and when Kuryr/CNO went to allocate a VIP with that IP, it was already in-use.
Another is that there was a loadbalancer create operation that was somehow stopped and left a stale DB entry.
I know there were some manual DB operations to clean things up, followed by a forced failover of (I think) this loadbalancer which cleared things up.
All the log files I've looked at in the case show this failure (AllocateVIPException), I think to get further in the RCA we'd need the logs for when the failure stopped, so we could see what operation happened at that point in time.
Thanks,
-Brian