Description of problem: TALO couldn't keep up with the amount of reconcile events in large scale ZTP SNO tests. As the number of installed SNOs increases, it takes longer and longer for TALO to get back to a particular SNO and progress its upgrade sequence further. Eventually it becomes too much and TALO starts to give up on SNOs that have gone beyond the 4 hour limit. Version-Release number of selected component (if applicable): How reproducible: 100% Steps to Reproduce: 1. ZTP SNO deployment test at 50 clusters per hour or 100 cluster per hour 2. 3. Actual results: See description Expected results: It should be able to do 50 clusters per hour at minimum. Ideally 100 as well or even higher. Additional info:
Changed to verified to unblock backporting