0.7.1
Linux
When a system job is evaluated, if a node has no free resources the task is queued. Later, if resources on the node become available the system task should be allocated.
Expected behaviour:
System jobs are removed from the Queued state and go into the Running state
Observed behaviour:
System jobs remain in the Queued state.
By porting the blocked evaluation logic from generic_scheduler.go into system_scheduler.go I'm able to get things to work as expected.
@maihde I think you identified the problem correctly. We do want to bring a lot of the improvements from the generic scheduler to the system scheduler. Hopefully we can use some of your work!
@dadgar thanks for the feedback. If you would like a pull-request for my patch, let me know.
@maihde would be nice to get that PR up
@jippi just opened up. Thanks.
Is there a timeline for the system scheduler rework that will include the fix for this?
@jippi @mwalters-workmarket my original pull request was closed because there was a major refactor planned to the schedulers and it was deemed easier to start afresh. If this feature is still on your roadmap I'd be happy to implement the feature against the current master and provide a new pull request.
would be cool to revive this
Following up here late, sorry. Is this still a relevant issue? I believe Nomad 0.9.4 fixed this issue with https://github.com/hashicorp/nomad/pull/5900 . Can someone confirm?
Closing this ticket, as it seems fixed and I'm unable to reproduce it now. Please re-open or open a new one if you believe this to be an error. Thanks!