Hack for test_delete_now rare failures

Until we have a better solution (either in kazoo or in nodepool) for the problem described in the long inline comment, let's prevent these rare test failures that have been plaguing us. Change-Id: Id43c2dba88f8dd3b8ce7b70be551e071ca4b1dd1
2018-02-19 16:20:46 -05:00 · 2018-02-19 16:20:46 -05:00 · 59d636740a
parent 796324f15e
commit 59d636740a
1 changed files with 11 additions and 0 deletions
--- a/nodepool/tests/test_commands.py
+++ b/nodepool/tests/test_commands.py
@ -226,6 +226,17 @@ class TestNodepoolCMD(tests.DBTestCase):
        configfile = self.setup_config('node.yaml')
        pool = self.useNodepool(configfile, watermark_sleep=1)
        self.useBuilder(configfile)
+
+        # (Shrews): This is a hack to avoid a race with the DeletedNodeWorker
+        # thread where it may see that our direct call to NodeDeleter.delete()
+        # has changed the node state to DELETING and lock the node during the
+        # act of deletion, but *after* the lock znode child has been deleted
+        # and *before* kazoo has fully removed the node znode itself. This race
+        # causes the rare kazoo.exceptions.NotEmptyError in this test because
+        # a new lock znode gets created (that the original delete does not see)
+        # preventing the node znode from being deleted.
+        pool.delete_interval = 5
+
        pool.start()
        self.waitForImage('fake-provider', 'fake-image')
        nodes = self.waitForNodes('fake-label')