DAOS-18593 test: replace sleep with retry in rebuild/interactive.py#17559
DAOS-18593 test: replace sleep with retry in rebuild/interactive.py#17559daltonbohning wants to merge 3 commits intomasterfrom
Conversation
|
Ticket title is 'rebuild/interactive.py: remove arbitrary sleep' |
|
Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17559/1/execution/node/791/log |
Replace arbitrary sleep with a retry on expected DER_NONEXIST. Test-repeat: 10 Test-tag: RbldInteractive Skip-unit-tests: true Skip-fault-injection-test: true Signed-off-by: Dalton Bohning <dalton.bohning@hpe.com>
0f8abac to
eb0b70e
Compare
Test-repeat: 10 Test-tag: RbldInteractive Skip-unit-tests: true Skip-fault-injection-test: true Signed-off-by: Dalton Bohning <dalton.bohning@hpe.com>
|
This is improving a race condition where even when This sample run shows the DER_NONEXIST handling is working |
This reverts commit 7318e84. Test-repeat: 10 Test-tag: RbldInteractive Skip-unit-tests: true Skip-fault-injection-test: true Signed-off-by: Dalton Bohning <dalton.bohning@hpe.com>
b163b67 to
862b100
Compare
jamesanunez
left a comment
There was a problem hiding this comment.
I don't think you need 'break'. Please review.
Replace arbitrary sleep with a retry on expected DER_NONEXIST.
This improves a race condition where even when dmg pool query shows rebuild is busy,
it hasn't "actually" started yet.
So when dmg pool rebuild stop fails with DER_NONEXIST, we simply wait and retry.
Test-repeat: 10
Test-tag: RbldInteractive
Skip-unit-tests: true
Skip-fault-injection-test: true
Steps for the author:
After all prior steps are complete: