Test-infra: tide batches newest PRs instead of oldest

Created on 5 Apr 2018 · 10Comments · Source: kubernetes/test-infra

This means the same pool can be repeatedly tested and fail while the oldest PRs are tested and merged serially. Batches and serial tests should operate on the same queue, not different ends!

areprow kinbug

Source

rmmh

Most helpful comment

Why did we bound batches? Why not do log(N) batches of size 1,2,4,...log(N) ?

stevekuznetsov on 5 Apr 2018

👍3

All 10 comments

/kind bug
/cc @cjwagner @kargakis @stevekuznetsov

BenTheElder on 5 Apr 2018

This problem was introduced when we switched from unbounded batches to bounded batches. I think that all we need to do is sort the PRs by increasing number before looping through them in pickBatch().

cjwagner on 5 Apr 2018

Why did we bound batches? Why not do log(N) batches of size 1,2,4,...log(N) ?

stevekuznetsov on 5 Apr 2018

👍3

Why did we bound batches? Why not do log(N) batches of size 1,2,4,...log(N) ?

I would also like to know why we are bounding batches. This sounds more reasonable to me.

BenTheElder on 5 Apr 2018

I agree that we shouldn't be bounding batches, their size should be dynamic. Either way the batch size won't be the entire pool though, so we will still need to sort the PRs.

Multiple batches sounds good to me, but will require a bit of a rework of Tide. Currently it is designed to work with one pending batch per subpool.

cjwagner on 5 Apr 2018

We did it here: https://github.com/kubernetes/test-infra/pull/5953

Maybe we can just revert that?

stevekuznetsov on 6 Apr 2018

👍1

/cc @kargakis

stevekuznetsov on 6 Apr 2018

Unbounded batches is a separate issue although it caused this one. I am fine with reverting but we need to keep in mind that the bigger a batch is the higher chance of a test failure. Maybe a separate issue for enhancing the batch logic to trigger different sizes of batches?

This issue is fixed.