Description of problem: OVN octavia load balancers stop working. octavia.log contains the following errors: Client-side error: Pool <UUID> is immutable and cannot be updated. format_exception /usr/lib/python3.6/site-packages/wsme/api.py:222 Perhaps: https://bugs.launchpad.net/neutron/+bug/1900763 Version-Release number of selected component (if applicable): 16.1.x How reproducible: This specific environment Additional info: I'll provide additional details in privates comments
https://access.redhat.com/solutions/5858391 seems relevant.
Hi Ignacio, Unfortunately these errors were happening in even the oldest logs in the sosreports I found on supportshell, so I can't tell what operation might have triggered them exactly. One possibility is that the subnet was not created properly, and when Kuryr/CNO went to allocate a VIP with that IP, it was already in-use. Another is that there was a loadbalancer create operation that was somehow stopped and left a stale DB entry. I know there were some manual DB operations to clean things up, followed by a forced failover of (I think) this loadbalancer which cleared things up. All the log files I've looked at in the case show this failure (AllocateVIPException), I think to get further in the RCA we'd need the logs for when the failure stopped, so we could see what operation happened at that point in time. Thanks, -Brian